Publish with PLOS:
- Rapid publication
- Unlimited readership
- High impact
Signup to get PLOS news
Latest from PLOS
Data are any and all of the digital materials that are collected and analyzed in the pursuit of scientific advances. In line with Open Access to research articles themselves, PLOS strongly believes that to best foster scientific progress, the underlying data should be made freely available for researchers to use, wherever this is legal and ethical. Data availability allows replication, reanalysis, new analysis, interpretation, or inclusion into meta-analyses, and facilitates reproducibility of research, all providing a better ‘bang for the buck’ out of scientific research, much of which is funded from public or nonprofit sources. Ultimately, all of these considerations aside, our viewpoint is quite simple: ensuring access to the underlying data should be an intrinsic part of the scientific publishing process.
PLOS journals have requested data be available since their inception, but we believe that providing more specific instructions for authors regarding appropriate data deposition options, and providing more information in the published article as to how to access data, is important for readers and users of the research we publish. As a result, PLOS is now releasing a revised Data Policy that will come into effect on March 1, 2014, in which authors will be required to include a data availability statement in all research articles published by PLOS journals; the policy can be found below. This policy was developed after extensive consultation with PLOS in-house professional and external Academic Editors and Editors in Chief, who are practicing scientists from a variety of disciplines.
We now welcome input from the larger community of authors, researchers, patients, and others, and invite you to comment before March. We encourage you to contact us collectively at email@example.com; feedback via Twitter and other sources will also be monitored. You may also contact individual PLOS journals directly.
Theo Bloom, Editorial Director for Biology
Emma Ganley, Senior Editor, PLOS Biology
Margaret Winker, Senior Research Editor, PLOS Medicine
for the PLOS Data Group
We thank all the members of the PLOS Data Policy team, PLOS staff, and Academic Editors and Editors in Chief for all their invaluable contributions to this policy and process. We particularly thank Emma Veitch, Senior Editor, PLOS ONE, for her leadership in bringing this policy to fruition.
PLOS Data Policy
from March 1, 2014
Data in supporting information files:
For smaller datasets and certain data types, authors may upload data as supporting information files accompanying the manuscript. Authors should take care to maximize the accessibility and reusability of the data by selecting a file format from which data can be efficiently extracted (for example, spreadsheets are preferable to PDF when providing tabulated data).
If data deposition or provision in supporting information is not ethical or legal (e.g., underlying data pose privacy or legal concerns, or include human participants3), the following two methods may be acceptable alternatives, subject to case-by-case evaluation:
Data made available to all interested researchers upon request. Data Availability Statement must specify “Data available on request” and identify the group to which requests should be submitted (e.g., a named data access committee or named ethics committee). The reasons for restrictions on public data deposition must also be specified. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.
Data available from third party. In the case of a primary dataset that was not originally generated by the authors of the submitted manuscript, appropriate data sharing may require that interested researchers obtain third-party data independently from the named original source. In this case, the Data Availability Statement must state the source of the data with full citation and, if the dataset cannot be provided, indicate “Data available from (named source).” The reasons for restrictions on public data deposition must also be specified.
Unacceptable data access restrictions: PLOS journals will not consider manuscripts where the following factors influence ability to share data:
- Authors will not share data because of personal interests, such as patents or potential future publications.
- The conclusions depend solely on the analysis of proprietary data (e.g., data owned by commercial interests, or copyrighted data). If proprietary data are used, the manuscript must include an analysis of public data that validates the conclusions so others can reproduce the analysis and build on the findings.
1. Definition of data that must be shared
PLOS defines the “minimal dataset” to consist of the dataset used to reach the conclusions drawn in the manuscript with related metadata and methods, and any additional data required to replicate the reported study findings in their entirety. Core descriptive data, methods, and study results should be included within the main paper, regardless of data deposition. PLOS does not accept references to “data not shown”. Authors who have datasets too large for sharing via repositories or uploaded files should contact the relevant journal for advice.
2. Guidance on data repositories
PLOS requires that authors comply with field-specific standards forpreparation and recording of data and to select repositories appropriate to their field, for exampledeposition of microarray data in ArrayExpress or GEO; deposition of gene sequences in GenBank, EMBL or DDBJ; and deposition of ecological data in Dryad. Authors are encouraged to select repositories that meet accepted criteria as trustworthy digital repositories, such as criteria of theCentre for Research Libraries orData Seal of Approval. Large, international databases are more likely to persist than small, local ones. Copyright licensing for data held in repositories may be unclear. If authors use repositories with stated licensing policies; the policies should not be more restrictive than CC-BY.
3. Guidance on sharing datasets that derive from clinical studies or other work involving human participants
For studies involving human participants, data must be handled so as to not compromise study participants’ privacy. PLOS recommends that researchers follow established guidance and applicable local laws in ensuring they do not compromise participant privacy. Resources which researchers may consult for guidance include:
Steps necessary to protect privacy may include de-identification, blocking portions of the database, or license agreements directed specifically at privacy concerns. Authors should indicate, as part of the ethics statement, the ways in which the study participants’ privacy was preserved. If license agreements apply, authors should note the process necessary for other researchers to obtain a license.
Latest from PLOS Blogs
This Week in PLOS NTDs and PLOS Pathogens: NTDs and the End fo the UN MDG, Understanding Defense Mechanisms at the Maternal-fetal Interface, Low-cost Field Microscopy, and More
The following new articles are publishing this week in PLOS NTDs: The United Nations Millennium Development Goals (MDGs) expire in 2015, and there is currently much debate, discussion and jostling for position regarding what should replace them. The results of …
Jeffrey Sachs: On climate, more ‘now’ and ‘how’ is needed
Read more from PLOS Blogs.
PLOS Articles In The News
- Robotic Ants Mimic Real Colonies
- Nature Documentaries are Good for the Brain
- How We Learn with Sesame Street
- Functional Cure for HIV Effective in Some Patients
- One Ciliate, Seven Sexes
- Konzo: A Neglected Tropical Disease Reviewed
- Diagnosing a 500 Year-Old Mummy
- Murder Rate of Women in South Africa Unchanged
- Brown-Eyed Boy: Trustworthiness and Eye Color
- Insights into Breast, Ovarian and Testicular Cancers