Policies

The Data Repositories associated with Earthchem and The Astromaterials Data System archive, publish and make accessible user-contributed data and other digital content from Earth and Planetary Science research.

When using EarthChem or Astromaterials Data Systems repository services your dataset will be reviewed by curators with domain-specific knowledge, as outlined below.

By submitting your dataset, you hereby agree to and accept the following policies:

The repositories are domain-specific, offering curation and review of user-submitted data in a variety of Earth and Planetary Science disciplines.

Upon curatorial review, if the submitted data type is considered outside of the repository scope, your data can be rejected from publication.

Please check with our curatorial staff before you start your data submission if your data are within the scope of the EarthChem Library and the Astromaterials Data Repository.

Whenever possible, our staff will provide alternative repositories that would best suit the data type.

The repositories offer data submission templates which are tailored to specific data types.

We highly recommend the use of templates to ensure reproducibility of the data and easy conversion to machine-readable formats.

If the data type can not be aligned with pre-existing templates, it should at least be supplied in one of the following formats:

Generic Resource TypesRecommended Format(s)Additional Format(s)Comment(s)
Tabular DataComma Separated Values (.csv), Tab Separated Values (.tsv), netCDF (.nc), OpenOffice Calc (.ods)Microsoft Excel (.xls or .xlsx), JavaScript Object Notation (.json)ASCII or UTF/UTF8/UNI See guidance for using NetCDF 
TextPortable Document Format/Archival (.pdf), OpenOffice Document (.odt), ASCII (.txt)Microsoft Word (.doc, .docx)Text documents must conform with Section 508 Standards.
MetadataISO 19139 (.xml) Geospatial MetadataFGDC Content Standard for Digital Geospatial Metadata (CSDGM), NASA Directory Interchange Format (DIF), Hierarchical Data Format (.hdf or .h5), Resource Description Framework (.rdf)the preferred representation of metadata in these standard formats is to be in eXtensible Markup Language (xml)
ImagesPortable Network Format (.png), JPEG (.jpeg), Tagged Image File Format (.tiff)BitMap (.bmp), Graphics Interchange Format (.gif), High Efficiency Image (.HEIC)
Geo-referenced dataGeoJSON (.geojson), GeoJPEG (.jgw), Keyhole Markup Language (.kml)
MiscellaniousZip (.zip), Webpage (.html), Jupyter Notebook (.ipynb), Matlab (.m), PowerPoint (.ppt or .pptx)

Our repositories strive to deliver the best quality metadata for datasets. Our curators will review metadata associated with your submitted data, as outlined below.

The completeness and correctness of the (meta)data is the responsibility of the contributor.

 

Metadata TypeRequired MetadataOptional MetadataComment(s)
Data SourceTitleTitle is concise and describes the dataset content
AbstractAbstract uses full sentences to describe what measurements were
done, the locations, and for what purpose. Avoid abbreviations &
acronyms
SampleSample nameUse of the International Geo Sample Numbers (IGSN) is recommended
When applicable include CAPTEM ID
Latitude and Longitude in decimal degreesInclude geospatial information unless not applicable, i.e. experimental samples, extraterrestrial samples
Sample DescriptorsInclude classification, age, stratigraphic position and any additional information on how, where and when the sample was collected
Data and MethodsEach analyte (parameter) in the data table(s) is linked to a sample name or number.
All analytes and units are clearly stated and defined.Avoid or define abbreviations. Relevant standards and reference measurements should be included
when possible.
Technique-Instrument-LaboratoryUse these metadata to define the method code for each analyte (parameter)
Compilation DatasetReference sheet/tab with full citations of all referred publications
Related Publication InformationAuthors, Year, Title, Publisher, (Status)Please include a citation or DOI for the manuscript in which the dataset will be used. If your manuscript is not published yet, include the status (in prep., submitted, in review, accepted)
Funding InformationNSF or NASA Award numbersThis will allow tracking of datasets by Award Number and show compliance with Data Management policies of funding agencies

The EarthChem Library is a curated file repository, you will receive your dataset DOI only after your files and submission has been reviewed and accepted. Please keep in mind that this entire process can take up to two weeks. Complete datasets following all requirements are often published within a few business days.

Datasets in the Library can be identified, shared, published and cited by using a Digital Object Identifier (DOI). The EarthChem Library is part of IEDA, a publication agent with the DataCite consortium.

We ask that you cite the dataset with its dataset DOI. The citation should be formatted as follows:

  • Creator(s) (PublicationYear): Title. Publisher. Identifier. Data Accessed.

For example, a dataset published by IEDA in the EarthChem Library in the DOI system is Whole Rock Helium Data from ODP Site U1256D. Its data DOI is 10.1594/IEDA/100734. Its citation would be:

  • Kurz, M.D.; Curtice, J. (2018): Whole Rock Helium Data from ODP Site U1256D. Interdisciplinary Earth Data Alliance (IEDA). http://dx.doi.org/10.1594/IEDA/100734. Accessed 29 March 2018.

Access to the EarthChem Library is open and free but you may not circulate or publish materials obtained from this site unless you adhere to the licensing requirements as stipulated by the provider of the dataset(s).

The EarthChem Library guarantees long-term availability of its content through Amazon S3 Glacier.

Use of digital materials obtained from the EarthChem Library is licensed under varying Creative Commons licenses that can be selected by the data contributor. EarthChem Library strongly recommends use of the Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license. The license that is applicable for a given dataset can be found on its landing page.

 

Contributors have 4  Creative Commons data file license options to choose from:


Submission Guidelines

Data can be contributed to ECL in any format, but needs to be documented with relevant information regarding the analytical data quality. Please confirm that your dataset follows the below requirements and suggestions. Incomplete datasets will take longer to process and publish.

The EarthChem Library conducts a review for quality of sample and analytical metadata, not a review of geochemical data. The author of the dataset is responsible for ensuring geochemical data submitted to PetDB is quality data.

File Format

  • If you are submitting tabular data it should be in an ECL template but must at the very least be in a .csv, .txt, or .xlsx file (not in .pdf, .docx, .jpeg, etc.). For more information on file formats please consult our file policies.
  • We strongly recommend the use of data submission templates. Templates are available for various data types such as bulk analysis (majors, traces, isotopes), mineral analysis (in-situ analysis), and melt inclusion analysis. If a template does not yet exist for your data type, you may contact us about creating one at info@earthchem.org.

Data & Metadata

  • All analytes, units, and technique, instrument and laboratory  must be clearly stated and defined. Relevant standards, reference measurements, and additional information about instrument calibration should also be included when possible.
  • Every analyte must be linked to a sample with a  sample name and geospatial information (unless not applicable, i.e. experimental samples).
  • We strongly recommend use of IGSNs (International Geo Sample Number) as persistent identifiers to identify samples. Obtain IGSNs by registering samples in the System for Earth Sample Registration (SESAR).

References
If submitting a compilation dataset, the ECL requires a reference sheet or tab with full citations of all referred publications.

The EarthChem Library offers multiple Data Submission Templates, the different workbooks have different fields customized to the material analyzed and the method used.

There are several tabs in this workbook to help catalog necessary metadata for our geochemical databases.

1. Mandatory fields are labeled in boldface. The mandatory fields are the minimum requirements for the data to be entered into our searchable databases. The Requirements for the Publication of Geochemical Data document describes why certain data are necessary.

2. Longitude/Latitude should be reported in decimal degrees ideally to at least four decimal places (if the precision is that good),
and negative values for South and West.

3. We strongly recommend using the EarthChem Vocabularies to complete your parameters, technique and unit names, as well as mineral abbreviations.

4. To manipulate data in Excel, the ‘Transpose’ option is useful to transpose rows to columns or vice versa. Use “Paste Special” then check the box for Transpose.

5. More info about IGSNs (International Geo Sample Number) can be obtained at www.geosamples.org/aboutigsn.

6. The Method Code is a numerical value assigned by you that links the column of measured values (Data Tab) with the Method information (Primary Analytical Metadata tab).

File TabDescription
Data SourceInformation to identify the dataset: title, author(s), creator (form-filler)
Mandatory fields: Title, Author(s), Abstract, Contact Info
SamplesInformation to identify, locate, and describe the samples, including IGSNs. Extra fields can be added.
Mandatory fields: Sample name, Latitude, Longitude, Elevation (or Location keywords), Lithology
DataThe measured geochemical data values go here, along with basic information about how that value was derived (what was the material analyzed and what was done to it before the measurement?)
Mandatory fields: Sample name, Analyzed Material, Parameter(s), Method Code(s)
Primary Analytical MetadataInformation about the analytical procedure and accuracy and reproducibility, including technique, laboratory, and reference sample information.
Mandatory fields: Method Code, Parameter, Technique, Instrument, Laboratory
Method-specific MetadataInformation about detection limit, total procedural blank, normalization, fractionation (if applicable). Additional fields can be added.

Title

Your dataset title must contain concise and descriptive information about the content of the dataset (the “what” and “where”, for example “Gakkel Ridge basalt melt inclusion and mineral chemistry”). If submitting data from a publication, the dataset title may be the same as your publication title, but likely will be different since your ECL dataset will paint a different story than your full publication.

Abstract

Your abstract must describe in full sentences the measurements, location, and purpose of the dataset. The abstract should be <250 words and different from your related publication abstract. Avoid abbreviations and acronyms. Include relevant keywords, such as the geographic area or cruise or field program name, for example “This dataset presents analyses of volatile, major, and trace elements for a suite of glasses and melt inclusions from the 85 degrees E segment of the ultra-slow spreading Gakkel Ridge”.

By default the release date is set to the date of submission. If this is not desirable, you may select a release date within the next two years. Every approved ECL dataset receives a DOI and is searchable in the ECL, even if the release date has not yet been reached. This allows investigators to submit datasets in compliance with data management policies of funding agencies and journals without releasing the actual data immediately.

Use of digital materials obtained from the EarthChem Library is licensed under varying Creative Commons licenses that can be selected by the data contributor. EarthChem Library strongly recommends use of the Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license. The license that is applicable for a given dataset can be found on its landing page.

 

Contributors have 4  Creative Commons data file license options to choose from:

  • Include as much documentation as possible, taking particular care to include unique identifiers such as IGSNs, related publication DOIs, volcano numbers, cruise DOIs, etc.
  • We strongly encourage US-based researchers to link their NSF award numbers with their ECL datasets. This will allow tracking of datasets by Award number and show compliance with Data Management policies of funding agencies.
  • Make sure your uploaded file is in its final form – the EarthChem Library is a long-term repository, treat the dataset submission like a journal article submission. Once a file has been released for public access, it cannot be changed, instead you would have to publish a new version with a new DOI.

PetDB is a searchable database of published geochemical data for igneous and metamorphic rocks. If you are submitting a geochemical dataset to EarthChem Library derived from igneous and metamorphic rocks you may also suggest your dataset for inclusion in PetDB. PetDB only includes geochemical data that has been published. This includes data from peer-reviewed publications and data that has been published online through the minting of a dataset DOI (such as the ones provided by the EarthChem Library).
If an EarthChem Library dataset is suggested for PetDB by the contributor and has a related manuscript that is in preparation or under review, the dataset will be reviewed for inclusion in PetDB upon publication of the related manuscript. A dataset submitted to the EarthChem Library without a related publication is immediately available for consideration.

The ECL is a curated file repository, you will receive your dataset DOI only after your files and submission has been reviewed and accepted. Please keep in mind that this entire process can take up to two weeks. Complete datasets following all requirements are often published within a few business days.

Submit Data