|OVERVIEW||1. ORGANIZE||2. CLEAN||3. DESCRIBE||4. UPLOAD||5. CITE|
Phase 4: Submit your data package
There are two methods for data submission: either you have created EML metadata yourself or EDI has created metadata for you (refer to Phase 3. Create EML Metadata). The first pathway is faster, because you know your data best, there is less back-and-forth and the process is less asynchronous. This is the better choice for those who know a small bit of the R programming language, and can perform one or two trial uploads. Usually, data managers use this pathway.
Get help from EDI
In case you are not versed in metadata creation or working with the repository, EDI will help you, and you will have already worked with an EDI data manager on the metadata in Phase 3. At the submittal stage, your main task is to review and approve data package drafts.
- EDI personnel will help by means of reviewing/approving drafts.
EML Congruence Checker
The quality of a data package (data and metadata) is a reflection of how well it may be used for a specific purpose. The EDI Repository ingests EML-described datasets for presentation, syntheses and analyses. To be useful, a data package must include a minimum amount of supporting information (metadata) that details how to access the data, and adequately describes the data. For metadata to be effective, there must be strong and accurate agreement (i.e., congruence) between the metadata and the data they describe. EML Congruence Checker (ECC) runs a series of checks that examine the congruence of metadata and data. The ECC reports on details of submitted data packages to help ensure that they meet a high standard for quality. Details on the ECC can be found here.
The ECC acts as the ‘gatekeeper’ for datasets entering the PASTA repository. The checks themselves are designed by the data management community. The group meets weekly. Please visit the ECC on GitHub for access to meeting notes, and outlines and assessments for potential new checks.
The EDI Data Portal can be accessed programmatically via the PASTA+ API. Anything you can do manually in the data portal can also be done via the API. Use the API to streamline data publication and data use workflows.
The EDI Dashboard
As a user of the EDI Data Repository you might receive an error message when trying to upload a data package or access data indicating that PASTA+ or another subsystem is not responding, or you might want to see if your data package has finished being processed by PASTA+ and was registered by PASTA+ as a published and archived data package, or for that matter, if it was successfully synchronized and indexed by DataONE? There is some help out there to answer them in the form of the “EDI Dashboard” website. Continue reading.
- See here for a video on “An overview of the EDI data repository and data portal (and how to use it for publishing data).” Presentation slides are available here