Many of the data packages in the EDI Data Repository contain metadata listing the data sources that the data package is derived from. Data sources may be data packages in the EDI Data Repository or links to data sets that are external to the repository. The EDI Data Portal displays these provenance relationships on the data package summary page. A good example is LAGOS-NE-LIMNO v1.087.1: A module for LAGOS-NE, a multi-scaled geospatial and temporal database of lake ecological context and water quality for thousands of U.S. Lakes: 1925-2013. When you view this data package in the data portal, scroll down to the Provenance section to see the list of ninety data packages that this data package is derived from! All ninety source data packages can be viewed by clicking on the links provided. The repository keeps track of these provenance relationships in both directions, so when you view any of the source data packages (for example: Acadia National Park, U.S. National Park Service using Lakes and Stream Monitoring Protocol for National Parks in the Northeast Temperate Network, Version 1.1 (2006-2011) you will see links back to any derived data packages that use this data package as one of its data sources!
EDI received feedback for its Request For Comments (RFC) for establishment of a data registry, from participants of the ESIP working group along with EDI’s email list of environmental information managers. Subsequent updates on the code registry development can be found here.
The Environmental Data Initiative is now on YouTube! Stop by for past tutorials and discussions on information management in the environmental sciences here.
The Ecological Metadata Language (EML) is a metadata standard developed by the ecological community for persistence, discoverability, and reuse of ecological data. EML extends the data lifespan allowing new insights and knowledge to be derived many years from now. While the highly detailed .xml structure of EML is necessary to organize and make searchable the metadata content, it confronts the inexperienced data provider with a steep learning curve that few have time to tackle. To overcome this we’ve created the EML Assembly Line, a linear process helping you craft high quality EML yourself! The EML Assembly Line is useful for publishing one-off datasets or providing a foundation for an EML factory. The EML Assembly Line R code package is available for download and development at our GitHub (https://github.com/EDIorg/EMLassemblyline). See you there!
EDI’s data policy can be found under the folling URL on our webpage: https://environmentaldatainitiative.org/data/edi-data-policy/.