The FAIR Guiding Principles for scientific data management and stewardship.

PubWeight™: 5.76‹?› | Rank: Top 1%

🔗 View Article (PMID 26978244)

Published in Sci Data on March 15, 2016

Authors

Mark D Wilkinson1, Michel Dumontier2, I Jsbrand Jan Aalbersberg, Gabrielle Appleton, Myles Axton3, Arie Baak4, Niklas Blomberg5, Jan-Willem Boiten6, Luiz Bonino da Silva Santos7, Philip E Bourne8, Jildau Bouwman9, Anthony J Brookes10, Tim Clark11, Mercè Crosas12, Ingrid Dillo13, Olivier Dumon, Scott Edmunds14, Chris T Evelo15, Richard Finkers16, Alejandra Gonzalez-Beltran17, Alasdair J G Gray18, Paul Groth, Carole Goble19, Jeffrey S Grethe20, Jaap Heringa21, Peter A C 't Hoen22, Rob Hooft23, Tobias Kuhn24, Ruben Kok21, Joost Kok25, Scott J Lusher26, Maryann E Martone27, Albert Mons28, Abel L Packer29, Bengt Persson30, Philippe Rocca-Serra17, Marco Roos31, Rene van Schaik32, Susanna-Assunta Sansone17, Erik Schultes33, Thierry Sengstag34, Ted Slater35, George Strawn, Morris A Swertz36, Mark Thompson31, Johan van der Lei37, Erik van Mulligen37, Jan Velterop38, Andra Waagmeester39, Peter Wittenburg40, Katherine Wolstencroft41, Jun Zhao42, Barend Mons43,26,37

Author Affiliations

1: Center for Plant Biotechnology and Genomics, Universidad Politécnica de Madrid, Madrid 28223, Spain.
2: Stanford University, Stanford 94305-5411, USA.
3: Nature Genetics, New York 10004-1562, USA.
4: Euretos and Phortos Consultants, Rotterdam 2741 CA, The Netherlands.
5: ELIXIR, Wellcome Genome Campus, Hinxton CB10 1SA, UK.
6: Lygature, Eindhoven 5656 AG, The Netherlands.
7: Vrije Universiteit Amsterdam, Dutch Techcenter for Life Sciences, Amsterdam 1081 HV, The Netherlands.
8: RCSB Protein Data Bank, San Diego Supercomputer Center, University of California San Diego, La Jolla, CA 92093, USA Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA.
9: TNO, Zeist 3700 AJ, The Netherlands.
10: Department of Genetics, University of Leicester, Leicester LE1 7RH, UK.
11: Harvard Medical School, Boston, Massachusetts MA 02115, USA.
12: Harvard University, Cambridge, Massachusetts MA 02138, USA.
13: Data Archiving and Networked Services (DANS), The Hague 2593 HW, The Netherlands.
14: GigaScience, Beijing Genomics Institute, Shenzhen 518083, China.
15: Department of Bioinformatics, Maastricht University, Maastricht 6200 MD, The Netherlands.
16: Wageningen UR Plant Breeding, Wageningen 6708 PB, The Netherlands.
17: Oxford e-Research Center, University of Oxford, Oxford OX1 3QG, UK.
18: Heriot-Watt University, Edinburgh EH14 4AS, UK.
19: School of Computer Science, University of Manchester, Manchester M13 9PL, UK.
20: Center for Research in Biological Systems, School of Medicine, University of California San Diego, La Jolla, California 92093-0446, USA.
21: Dutch Techcenter for the Life Sciences, Utrecht 3501 DE, The Netherlands.
22: Department of Human Genetics, Leiden University Medical Center, Dutch Techcenter for the Life Sciences, Leiden 2300 RC, The Netherlands.
23: Dutch TechCenter for Life Sciences and ELIXIR-NL, Utrecht 3501 DE, The Netherlands.
24: VU University Amsterdam, Amsterdam 1081 HV, The Netherlands.
25: Leiden Center of Data Science, Leiden University, Leiden 2300 RA, The Netherlands.
26: Netherlands eScience Center, Amsterdam 1098 XG, The Netherlands.
27: National Center for Microscopy and Imaging Research, UCSD, San Diego 92103, USA.
28: Phortos Consultants, San Diego 92011, USA.
29: SciELO/FAPESP Program, UNIFESP Foundation, São Paulo 05468-901, Brazil.
30: Bioinformatics Infrastructure for Life Sciences (BILS), Science for Life Laboratory, Dept of Cell and Molecular Biology, Uppsala University, S-751 24, Uppsala, Sweden.
31: Leiden University Medical Center, Leiden 2333 ZA, The Netherlands.
32: Bayer CropScience, Gent Area 1831, Belgium.
33: Leiden Institute for Advanced Computer Science, Leiden University Medical Center, Leiden 2300 RA, The Netherlands.
34: Swiss Institute of Bioinformatics and University of Basel, Basel 4056, Switzerland.
35: Cray, Inc., Seattle 98164, USA.
36: University Medical Center Groningen (UMCG), University of Groningen, Groningen 9713 GZ, The Netherlands.
37: Erasmus MC, Rotterdam 3015 CE, The Netherlands.
38: Independent Open Access and Open Science Advocate, Guildford GU1 3PW, UK.
39: Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands Micelio, Antwerp, 2180 Antwerp, Belgium.
40: Max Planck Compute and Data Facility, MPS, Garching 85748, Germany.
41: Leiden Institute of Advanced Computer Science, Leiden University, Leiden 2333 CA, The Netherlands.
42: Department of Computer Science, Oxford University, Oxford OX1 3QD, UK.
43: Leiden University Medical Center, Leiden and Dutch TechCenter for Life Sciences, Utrecht 2333 ZA, The Netherlands.

Articles citing this

Making sense of big data in health research: Towards an EU action plan. Genome Med (2016) 1.55

BioSharing: curated and crowd-sourced metadata standards, databases and data policies in the life sciences. Database (Oxford) (2016) 0.98

The environment ontology in 2016: bridging domains with increased scope, semantic density, and interoperation. J Biomed Semantics (2016) 0.97

Measures for interoperability of phenotypic data: minimum information requirements and formatting. Plant Methods (2016) 0.86

Discovering and linking public omics data sets using the Omics Discovery Index. Nat Biotechnol (2017) 0.86

Identifying ELIXIR Core Data Resources. F1000Res (2016) 0.85

DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants. Nucleic Acids Res (2016) 0.84

Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base. Front Plant Sci (2016) 0.84

PHI-base: a new interface and further additions for the multi-species pathogen-host interactions database. Nucleic Acids Res (2016) 0.84

Data Citation in Neuroimaging: Proposed Best Practices for Data Identification and Attribution. Front Neuroinform (2016) 0.81

Developing a strategy for computational lab skills training through Software and Data Carpentry: Experiences from the ELIXIR Pilot action. F1000Res (2017) 0.80

Let referees see the data. Sci Data (2016) 0.80

Redefining 'stress resistance genes', and why it matters. J Exp Bot (2016) 0.78

The health care and life sciences community profile for dataset descriptions. PeerJ (2016) 0.78

Perspectives and expectations in structural bioinformatics of metalloproteins. Proteins (2017) 0.77

A community proposal to integrate proteomics activities in ELIXIR. F1000Res (2017) 0.76

Precision annotation of digital samples in NCBI's gene expression omnibus. Sci Data (2017) 0.76

Propelling the paradigm shift from reductionism to systems nutrition. Genes Nutr (2017) 0.75

Reproducibility will only come with data liberation. Sci Transl Med (2016) 0.75

A metadata-driven approach to data repository design. J Cheminform (2017) 0.75

Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry. BMJ Open (2017) 0.75

Cyberinfrastructure for Open Science at the Montreal Neurological Institute. Front Neuroinform (2017) 0.75

A fast and efficient python library for interfacing with the Biological Magnetic Resonance Data Bank. BMC Bioinformatics (2017) 0.75

Towards an open grapevine information system. Hortic Res (2016) 0.75

Molecular phenotyping of multiple mouse strains under metabolic challenge uncovers a role for Elovl2 in glucose-induced insulin secretion. Mol Metab (2017) 0.75

FAIR principles for data stewardship. Nat Genet (2016) 0.75

Guideline recommendations for diagnosis and clinical management of Ring14 syndrome-first report of an ad hoc task force. Orphanet J Rare Dis (2017) 0.75

FAIRDOMHub: a repository and collaboration environment for sharing systems biology research. Nucleic Acids Res (2016) 0.75

Reproducibility of histopathological findings in experimental pathology of the mouse: a sorry tail. Lab Anim (NY) (2017) 0.75

Should biomedical research be like Airbnb? PLoS Biol (2017) 0.75

Embracing Complexity beyond Systems Medicine: A New Approach to Chronic Immune Disorders. Front Immunol (2016) 0.75

Irreproducibility of published bioscience research: Diagnosis, pathogenesis and therapy. Mol Metab (2016) 0.75

Reproducible and reusable research: are journal data sharing policies meeting the mark? PeerJ (2017) 0.75

Empowering pharmacoinformatics by linked life science data. J Comput Aided Mol Des (2016) 0.75

An assessment of the quality of the I-DSD and the I-CAH registries - international registries for rare conditions affecting sex development. Orphanet J Rare Dis (2017) 0.75

Fairness in scientific publishing. F1000Res (2016) 0.75

Integration of EGA secure data access into Galaxy. F1000Res (2016) 0.75

Open data: support from Swiss funder. Nature (2017) 0.75

Envisioning the Future of 'Big Data' Biomedicine. J Biomed Inform (2017) 0.75

Supporting evidence-based analysis for modified risk tobacco products through a toxicology data-sharing infrastructure. F1000Res (2017) 0.75

The future of metabolomics in ELIXIR. F1000Res (2017) 0.75

Blowing a breath of fresh share on data. J Comput Aided Mol Des (2016) 0.75

ACCESS climate data management. Ambio (2017) 0.75

A Case for Data Commons: Toward Data Science as a Service. Comput Sci Eng (2016) 0.75

Compliance with minimum information guidelines in public metabolomics repositories. Sci Data (2017) 0.75

Introducing the Brassica Information Portal: Towards integrating genotypic and phenotypic Brassica crop data. F1000Res (2017) 0.75

The TB Portals: An open-access, web-based platform for global drug-resistant tuberculosis data sharing and analysis. J Clin Microbiol (2017) 0.75

Building the biomedical data science workforce. PLoS Biol (2017) 0.75

Developing a framework for digital objects in the Big Data to Knowledge (BD2K) commons: Report from the Commons Framework Pilots workshop. J Biomed Inform (2017) 0.75

Big Data in radiation therapy: challenges and opportunities. Br J Radiol (2016) 0.75

Towards a systems approach for chronic diseases, based on health state modeling. F1000Res (2017) 0.75

Scientific user requirements for a herbarium data portal. PhytoKeys (2017) 0.75

The ELIXIR-EXCELERATE Train-the-Trainer pilot programme: empower researchers to deliver high-quality training. F1000Res (2017) 0.75

Tapping the Vast Potential of the Data Deluge in Small-scale Food-Animal Production Businesses: Challenges to Near Real-time Data Analysis and Interpretation. Front Vet Sci (2017) 0.75

Comprehending the Health Informatics Spectrum: Grappling with System Entropy and Advancing Quality Clinical Research. Front Public Health (2017) 0.75

Medical Image Data and Datasets in the Era of Machine Learning-Whitepaper from the 2016 C-MIMI Meeting Dataset Session. J Digit Imaging (2017) 0.75

Looking back: forward looking. Gigascience (2017) 0.75

Articles cited by this

The Protein Data Bank. Nucleic Acids Res (2000) 187.10

Announcing the worldwide Protein Data Bank. Nat Struct Biol (2003) 17.06

UniProt: a hub for protein information. Nucleic Acids Res (2014) 16.72

Macromolecular Crystallographic Information File. Methods Enzymol (1997) 11.94

GenBank. Nucleic Acids Res (2012) 10.89

Toward interoperable bioscience data. Nat Genet (2012) 4.72

The RCSB Protein Data Bank: views of structural biology for basic and applied research and education. Nucleic Acids Res (2014) 3.57

Ten simple rules for reproducible computational research. PLoS Comput Biol (2013) 3.06

Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format. Nucleic Acids Res (2011) 2.57

PDBe: Protein Data Bank in Europe. Nucleic Acids Res (2013) 2.16

From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles of Data Models and Workflows in Bioinformatics. PLoS One (2015) 1.95

openBIS: a flexible framework for managing and analyzing complex data in biology research. BMC Bioinformatics (2011) 1.91

Public Data Archiving in Ecology and Evolution: How Well Are We Doing? PLoS Biol (2015) 1.71

Achieving human and machine accessibility of cited data in scholarly publications. PeerJ Comput Sci (2015) 1.54

SEEK: a systems biology data and model management platform. BMC Syst Biol (2015) 1.30

The center for expanded data annotation and retrieval. J Am Med Inform Assoc (2015) 1.26

linkedISA: semantic representation of ISA-Tab experimental metadata. BMC Bioinformatics (2014) 1.25

Articles by these authors

The RCSB Protein Data Bank: views of structural biology for basic and applied research and education. Nucleic Acids Res (2014) 3.57

Open PHACTS: semantic interoperability for drug discovery. Drug Discov Today (2012) 3.15

The value of data. Nat Genet (2011) 2.40

The altmetrics collection. PLoS One (2012) 1.70

The Human Phenotype Ontology: Semantic Unification of Common and Rare Disease. Am J Hum Genet (2015) 1.52

How open science helps researchers succeed. Elife (2016) 1.48

WikiPathways: capturing the full diversity of pathway knowledge. Nucleic Acids Res (2015) 1.35

International Cooperation to Enable the Diagnosis of All Rare Genetic Diseases. Am J Hum Genet (2017) 0.78

Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis. Nat Commun (2016) 0.76

Both Intrinsic Substrate Preference and Network Context Contribute to Substrate Selection of Classical Tyrosine Phosphatases. J Biol Chem (2017) 0.75

Using the Semantic Web for Rapid Integration of WikiPathways with Other Biological Online Data Resources. PLoS Comput Biol (2016) 0.75

Computational studies of human class V alcohol dehydrogenase - the odd sibling. BMC Biochem (2016) 0.75

Should biomedical research be like Airbnb? PLoS Biol (2017) 0.75

Increasing phenotypic annotation improves the diagnostic rate of exome sequencing in a rare neuromuscular disorder. Hum Mutat (2019) 0.75

Ten simple rules in considering a career in academia versus government. PLoS Comput Biol (2017) 0.75

Erratum to: The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching. J Cheminform (2017) 0.75

A characterization of cis- and trans-heritability of RNA-Seq-based gene expression. Eur J Hum Genet (2019) 0.75

The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching. J Cheminform (2017) 0.75

Impact of common genetic determinants of Hemoglobin A1c on type 2 diabetes risk and diagnosis in ancestrally diverse populations: A transethnic genome-wide meta-analysis. PLoS Med (2017) 0.75

Ten simple rules to consider regarding preprint submission. PLoS Comput Biol (2017) 0.75

Exhaustive search for epistatic effects on the human methylome. Sci Rep (2017) 0.75

The international MAQC Society launches to enhance reproducibility of high-throughput technologies. Nat Biotechnol (2017) 0.75

Developing international open science collaborations: Funder reflections on the Open Science Prize. PLoS Biol (2017) 0.75

Finding useful data across multiple biomedical data repositories using DataMed. Nat Genet (2017) 0.75