The EDI Repository API facilitates automated data processing and publication workflows, thereby enabling reproducible and efficient data package management. Four new R functions have been added to the EDIutils R library supporting package reservation (pkg_reserve_id.R), evaluation (pkg_evaluation.R), upload (pkg_upload.R), and update (pkg_update.R). The full suite of PASTA+ API calls from the R environment will be available soon!
EDI technical staff recently upgraded our virtualization infrastructure to the latest version of VMware’s ESXi software (from version 5 to 6.5). All of EDI’s servers, including those that run the PASTA+ data repository software, operate as virtual clients across six ESXi host systems. These virtual hosts are configured to operate between 6 and 12 clients at one time, with some room left over to shuffle systems and for testing. The ESXi host systems are located on the campus of the University of New Mexico and connect directly to a dedicated 10Gb/s connection using UNM’s Science DMZ research network. Wide-area Internet connectivity to UNM includes 100Gb/s connections to the DOE Energy Sciences Network (ESNet) and the Western Regional Network, both through the Albuquerque Gigapop. EDI’s data storage capacity is currently at 30TB, with an equivalent 30TB mirror storage device for near-time backups and smaller SSD disks that are used for off-site backup purposes. EDI also uses the AWS Glacier storage as a long-term “cold data” archive.
The International Data Week (IDW) 2018 is hosted in Gaborone, Botswana (5-8 November, 2018) by the Botswana Open Science and Open Data Forum, bringing together data scientists, researchers, industry leaders, entrepreneurs, policymakers and data stewards from all disciplines and geographies across the globe. In connection with the IDW 2018, the Enabling FAIR Data Project is celebrating and promoting its Commitment Statement in the international earth, space, and environmental sciences community. You can support the #FAIRData movement by signing up to the Commitment Statement: as an individual, repository, publisher, community, society, institution or funder. “FAIR” is defined as Findable, Accessible, Interoperable, and Reusable as defined by the FAIR Guiding Principles. Continue reading “Enabling FAIR Data”
- All data package “landing pages” in our test environment Data Portal are now labeled with a “Test Data Package” watermark so they will not be mistaken for the real McCoy. Only data packages displayed on the primary Data Portal (https://portal.edirepository.org/nis) should be considered bone-fide and a published (with a “real” DOI) data package.
The EDI data portal has added a new informational check to the suite of data congruence checks it runs on every data package submitted. This check is information only and the community felt it was valuable, so we released it outside the regular twice-yearly schedule on July 11th, 2018. For detailed information contact firstname.lastname@example.org. Continue reading “New informational check released for EDI data portal”