Biodiversity data: data paper

Describe and share your {meta}data

November 2024

Olivier Norvez

Sophie Pamerlon

Yvan Le Bras

Animation coordinator
@PNDB  
@DataTerra  

Data engineer
@GBIF-France  

Scientific and technical coordinator
@PNDB  
@DataTerra  

Table of contents

Table of contents








Context and issues


Description


Process


Resources

Data paper : reminder of the context and the issues

Data paper : reminder of the context and the issues

Heterogeneity (data types, origin, standards) &
Diversity of “objects” to be linked together
1

Loss of information over time2

Data paper : reminder of the context and the issues

Computational reproducibility frequently refers to the ability to generate equivalent analytical outcomes from the same data set using the same code and software1.

[…] all raw data and metadata, code, programming scripts, and bespoke software necessary for fully replicating any analyses that lead to inferences made in a published study2.

Data paper : reminder of the context and the issues

Research data are defined as factual records in the form of figures, texts, images and sounds which are used as the main sources for scientific research and which the scientific community generally recognizes as being necessary to validate research results1.

Metadata, which can be simply defined as “data about data,” is a way of naming things and representing data and their relationships […] Metadata is structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use or manage an information resource2.

Data paper : reminder of the context and the issues

Data paper : reminder of the context and the issues


Data life cycle


FAIR Principles




Flux and stocks of data


Data paper description

data paper : Description

A data paper is a scientific publication that precisely describes a data set, and informs the scientific community of its existence, its methods and its potential for reuse1

data paper : Description

A scientific publication whose primary purpose is to describe a data set or group of data sets, rather than to report a research investigation.

  • DOI: indexing and citation : Data Papers are indexed by Web of Knowledge (ISI), PubMedCentral, Scopus, Zoological Record, Google Scholar, CAB Abstracts, Directory of Open Access Journal (DOAJ), EBSCO.

  • Title : Promotion and publicizing data

  • Authors: Acknowledgement and credit for data publishers through scientific publication

  • Abstract: description of data in a structured, human-readable form

  • Other sections : material and methods, taxonomic coverage, geographic coverage, descriptive statistics, …

data paper : Description

data paper from Research Le Bras et al., 2017

data paper from Policies Lepareur et al., 2022

data paper from citizen sciences Coché et al., 2021

data paper : Description

Adding statistical analyses or graphical representations is possible (and recommanded)

data paper : Description


The Integrated Publishing Toolkit (IPT) developed by the GBIF:

- is a free open-source software and used by organizations around the world to create and manage repositories for sharing biodiversity datasets.

- facilitates the filling of metadata and the automated production of a Data Paper manuscript.

Data paper : description

back to the standardization

data paper : description

Data paper : process

Data paper : process

good Level of FAIRness

  • Open data (CC-BY 4.0 compatible with Etalab)
  • Mandatory license
  • Direct link to download raw datasets
  • Thematic scope (All biodiversity including paleo- and archaeo-biodiversity)
  • Geographic scope (Data produced by France)
  • Temporal coverage (at least one data acquisition date)
  • Abstract
  • Title, authors and contacts
  • Acquisition framework (at least via a text field)
  • DOI / unique identifiers
  • taxonomic coverage (if taxa are present)
  • keywords related to the Thesaurus
  • Data attributes (Dictionary of data attributes with units and descriptions)
  • Semantic annotation (Keywords and attribute names, unlimited usable resources)

Data paper : process

Pipeline to describe and to publish a data paper

Data papers : resources

Data Share

by the CEntre for the Synthesis and Analysis of Biodiversity (CESAB) - French Foundation for Biodiversity Research (FRB).

The aim of this call is to accelerate the sharing of open-access and large scale ‘novel’ biodiversity related datasets.

 For more information: Data Share

OpenMetaPaper

by the French Biodiversity data hub (PNBD - Data Terra) and the French nodal point of the GBIF

The aims are to increase the opening of research data around the use of the EML standard and its links with other data and metadata standards in ecology

 For more information: OpenMetaPaper

Support and training by th best teams ever ;)

Data paper : resources

listing of journals accepting data paper on GBIF.org

Data paper : resources

Ecological journals with “ethical” aspects on Dafnee ISEM

Data paper : resources