Data is becoming increasingly citable as first-order objects, including data archived in the EDI repository. One indication is that data package publications are indexed in personal Google Scholar user profiles, along with other scholarly articles, as for example in the profile of Paul Hanson (Research Professor at the Center for Limnology, University of Wisconsin-Madison).
There is also an increase in the number of cited data packages in scholarly articles. The figure below shows the annual number of EDI data package citations in scholarly articles over the past seven years as derived from Google Scholar.
In order for a data publication to be discoverable by search engines, including Google Scholar and Google’s Data Set Search, the data package needs to be “indexed”. A while ago EDI implemented sitemaps.org and schema.org metadata (often called Search Engine Optimization) to support search engine discovery and indexing of data packages archived in the EDI repository. Sitemaps metadata serves as a table of contents for high-value information found on websites so that search engines may more easily discover relevant web pages to index. For EDI, the sitemaps metadata points to the most recent data package versions, accessible through the EDI Data Portal, and is refreshed hourly.