The Large Data Onboarding Pod is building Filecoin-native tooling and market mechanisms to reliably onboard, index, preserve, and retrieve large-scale public datasets through paid on-chain deals. The project advances a peer-to-pool PoRep workflow, lifecycle management and observability for popular datasets, and an MVP for paid retrievals, enabling archival-grade “library level” access where data remains verifiable and available when requested. By piloting on-chain data trusts and a curated commons dataset directory, it aims to strengthen the storage market’s sustainability, improve Fil+ efficiency, and accelerate adoption by institutions and data-driven applications.