EASYDAB Guideline

Earth system research datasets published with a DataCite DOI that fulfil the EASYDAB quality requirements can be highlighted with an EASYDAB logo on the landing page. The EASYDAB logo indicates that the datasets have an open license, comply with the FAIR Data Principles, and were reviewed by the responsible repository for compliance with a quality guideline. The repositories can use their own quality guidelines if they achieve a comparably high quality of the published data as defined in the ATMODAT standard.

Using the EASYDAB label, repositories can indicate that they carefully curate datasets with FAIRness principles in mind and make them reusable. Researchers can more easily find and evaluate well-described datasets and reuse data that is relevant to them.


EASYDAB can be used to highlight only those Earth system datasets that are or have been published with a DataCite DOI including a machine- and human-readable landing page.


definitions.png

 

We use the definitions of the RDA Data Foundation and Terminology Working Group. For easier readability, we define the following types of metadata for scientific data, which is  published with a DataCite DOI:

 

  1. DOI metadata: metadata submitted to DataCite for the assignment of a DOI to a  digital object. If one DOI has been assigned to multiple digital objects (digital collection) , the DOI metadata describe the digital collection.
  2. Landing page metadata: metadata that are listed on the human-readable and machine-readable landing page, respectively. They describe the digital objects or digital collection which are published with the corresponding DOI.
  3. File metadata: description of the data in a single file.

requirements.png

 

 

FAIRness of data and metadata is sought with the following requirements:

 

 

  1. DOI metadata:  
    • A subject-specific list of mandatory DOI metadata is provided, which is more extensive than the DataCite list of mandatory metadata in the DataCite Metadata Schema. This is intended to capture not only the bibliographic information but also the generation and content description of the data.
    • The property Subject of the DataCite Metadata Schema always contains the keywords "EASYDAB" and the name of the quality guideline used.
  2. Landing page metadata
    • On both the human-readable and machine-readable landing pages, all metadata for the DOI are listed at a minimum.
    • If one DOI is assigned for multiple digital objects (digital collection) , then the individual digital objects in the collection should also be described on the landing page. This can also be written on subpages of the landing page.
  3. File metadata:
    • All digital objects have descriptive metadata. Data and metadata can be written into separate files, while a description on the landing page only is not sufficient.
  4. All metadata:
    • For individuals, institutions, and funders, a Persistent Identifier (PID) should be provided if possible.
    • The use of controlled vocabulary is recommended.

Datasets should be published as open as possible. Therefore, they have 

  1. an open license, preferably machine-readable, which is available in English.
  2. only an embargo, if it is limited in time.
  3. open and non-proprietary file formats, if possible. If proprietary file formats are used, there must be open software to read and fully convert the files so that the data can also be processed with non-proprietary software.