Data Publication Training

Discussing a wide range of topics related to open data and the science it supports.

Workshop at Virginia Polytechnic Institute (Blacksburg, VA), Dr. Cayelan Carey Research Lab, May 24 – 25, 2018.

Data publication facilitates open science, collaboration, and introduces new sources of data for scientific synthesis. These benefits are recognized by several research journals and funding sources, which now require data publication from researchers in the environmental sciences who may be unfamiliar with this process.

EDI member Colin Smith traveled to the research lab of Dr. Cayelan Carey at Virginia Polytechnic Institute and conducted a condensed 2 day workshop (May 24-25, 2018) covering the 5 phases of data publication and consult the group in design and implementation of a data publication workflow.

During the event we discussed issues ranging from the role of controlled vocabularies in improving data discoverability, to the efficiencies a research lab can gain by adopting an open data policy and archiving their data in a publicly accessible repository. The core of our work was on the 5 phases of data publication, and we were ultimately able to get 5 datasets published in the EDI data repository by noon of our second day. In the remaining time, EDI developers (Mark Servilla and Duane Costa), the Carey Lab, and principal investigators of The Smart Reservoir Project convened online in a video conference to design an automated data publication workflow to get valuable data products out to local water resource managers and the public in a timely fashion. The event was considered a success by all parties involved and a great learning experience for EDI.

This was the first application of our new data training approach.

Sketching data publication workflows for long-term monitoring.