Recent advances in biocuration: meeting report from the Fifth International Biocuration Conference.

PubWeight™: 0.89‹?›

🔗 View Article (PMC 3483532)

Published in Database (Oxford) on October 29, 2012

Authors

Pascale Gaudet1, Cecilia Arighi, Frederic Bastian, Alex Bateman, Judith A Blake, Michael J Cherry, Peter D'Eustachio, Robert Finn, Michelle Giglio, Lynette Hirschman, Renate Kania, William Klimke, Maria Jesus Martin, Ilene Karsch-Mizrachi, Monica Munoz-Torres, Darren Natale, Claire O'Donovan, Francis Ouellette, Kim D Pruitt, Marc Robinson-Rechavi, Susanna-Assunta Sansone, Paul Schofield, Granger Sutton, Kimberly Van Auken, Sona Vasudevan, Cathy Wu, Jasmine Young, Raja Mazumder

Author Affiliations

1: International Society for Biocuration and CALIPHO Group, Swiss Institute of Bioinformatics, 1 Rue Michel Servet, Geneva, Switzerland. pascale.gaudet@isb-sib.ch

Articles cited by this

MicroScope: a platform for microbial genome annotation and comparative genomics. Database (Oxford) (2009) 3.87

Text mining for the biocuration workflow. Database (Oxford) (2012) 2.37

MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database. Database (Oxford) (2012) 2.35

Towards BioDBcore: a community-defined information specification for biological databases. Nucleic Acids Res (2010) 1.97

Tracking and coordinating an international curation effort for the CCDS Project. Database (Oxford) (2012) 1.78

PRIDE: quality control in a proteomics data repository. Database (Oxford) (2012) 1.63

Biocurators and biocuration: surveying the 21st century challenges. Database (Oxford) (2012) 1.46

Using ODIN for a PharmGKB revalidation experiment. Database (Oxford) (2012) 1.29

Towards BioDBcore: a community-defined information specification for biological databases. Database (Oxford) (2011) 1.24

Community annotation and bioinformatics workforce development in concert--Little Skate Genome Annotation Workshops and Jamborees. Database (Oxford) (2012) 1.04

Tetrahymena Genome Database Wiki: a community-maintained model organism database. Database (Oxford) (2012) 0.98

Aptamer Base: a collaborative knowledge base to describe aptamers and SELEX experiments. Database (Oxford) (2012) 0.94

CvManGO, a method for leveraging computational predictions to improve literature-based Gene Ontology annotations. Database (Oxford) (2012) 0.92

The importance of identifying alternative splicing in vertebrate genome annotation. Database (Oxford) (2012) 0.87

Building a biomedical semantic network in Wikipedia with Semantic Wiki Links. Database (Oxford) (2012) 0.83

Articles by these authors

The COG database: an updated version includes eukaryotes. BMC Bioinformatics (2003) 60.98

The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res (2003) 52.80

The diploid genome sequence of an individual human. PLoS Biol (2007) 44.80

miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res (2006) 39.25

The Pfam protein families database. Nucleic Acids Res (2009) 37.98

The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol (2007) 35.41

Pfam: clans, web tools and services. Nucleic Acids Res (2006) 34.83

The Pfam protein families database. Nucleic Acids Res (2011) 33.46

The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol (2008) 31.04

The Pfam protein families database. Nucleic Acids Res (2007) 30.53

UniProt: the Universal Protein knowledgebase. Nucleic Acids Res (2004) 29.05

NCBI Reference Sequences: current status, policy and new initiatives. Nucleic Acids Res (2008) 26.04

GenBank. Nucleic Acids Res (2007) 25.54

Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res (2005) 25.49

InterPro: the integrative protein signature database. Nucleic Acids Res (2008) 25.07

The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res (2003) 24.72

The Universal Protein Resource (UniProt). Nucleic Acids Res (2005) 23.66

The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol (2007) 23.58

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2005) 22.98

Rfam: an RNA family database. Nucleic Acids Res (2003) 22.93

The Universal Protein Resource (UniProt): an expanding universe of protein information. Nucleic Acids Res (2006) 22.70

Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res (2005) 22.62

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2007) 22.53

Pfam: the protein families database. Nucleic Acids Res (2013) 22.48

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2008) 21.36

Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res (2006) 20.92

PIRSF: family classification system at the Protein Information Resource. Nucleic Acids Res (2004) 19.62

GenBank. Nucleic Acids Res (2005) 19.25

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2006) 18.85

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2006) 18.84

Toward an online repository of Standard Operating Procedures (SOPs) for (meta)genomic annotation. OMICS (2008) 18.69

Evolution of genes and genomes on the Drosophila phylogeny. Nature (2007) 18.01

InterPro, progress and status in 2005. Nucleic Acids Res (2005) 17.53

UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics (2007) 17.43

GenBank. Nucleic Acids Res (2002) 17.24

GenBank. Nucleic Acids Res (2007) 16.92

Reactome knowledgebase of human biological pathways and processes. Nucleic Acids Res (2008) 15.69

ArrayExpress--a public repository for microarray gene expression data at the EBI. Nucleic Acids Res (2003) 15.48

CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Res (2003) 14.38

The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families. PLoS Biol (2007) 13.99

InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res (2011) 13.45

Reactome: a knowledge base of biologic pathways and processes. Genome Biol (2007) 13.36

The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics. PLoS Biol (2003) 13.32

GenBank. Nucleic Acids Res (2008) 13.29

Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project. Nat Biotechnol (2008) 12.96

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2009) 12.51

New developments in the InterPro database. Nucleic Acids Res (2007) 12.49

ArrayExpress update--from an archive of functional genomics experiments to the atlas of gene expression. Nucleic Acids Res (2008) 12.45

GenBank: update. Nucleic Acids Res (2004) 12.28

Prepublication data sharing. Nature (2009) 12.24

GenBank. Nucleic Acids Res (2006) 12.21

Rfam: updates to the RNA families database. Nucleic Acids Res (2008) 11.61

Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res (2010) 11.23

GenBank. Nucleic Acids Res (2009) 11.11

Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature (2004) 11.03

Aggressive assembly of pyrosequencing reads with mates. Bioinformatics (2008) 11.01

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2010) 10.97

GenBank. Nucleic Acids Res (2012) 10.89

Big data: The future of biocuration. Nature (2008) 10.81

The GOA database in 2009--an integrated Gene Ontology Annotation resource. Nucleic Acids Res (2008) 10.21

The Mouse Genome Database (MGD): mouse biology and model systems. Nucleic Acids Res (2007) 9.75

GenBank. Nucleic Acids Res (2003) 9.60

QuickTree: building huge Neighbour-Joining trees of protein sequences. Bioinformatics (2002) 9.36

The BioPAX community standard for pathway data sharing. Nat Biotechnol (2010) 9.19

Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res (2010) 9.09

The Mouse Genome Database (MGD): integrating biology with the genome. Nucleic Acids Res (2004) 8.91

GenBank. Nucleic Acids Res (2011) 8.85

GenBank. Nucleic Acids Res (2010) 8.63

Database resources of the National Center for Biotechnology Information. Nucleic Acids Res (2011) 8.62

Assembly algorithms for next-generation sequencing data. Genomics (2010) 8.56

The Reactome pathway knowledgebase. Nucleic Acids Res (2013) 8.56

The BioGRID Interaction Database: 2011 update. Nucleic Acids Res (2010) 8.46

The Mouse Genome Database (MGD): from genes to mice--a community resource for mouse biology. Nucleic Acids Res (2005) 8.19

BioCreAtIvE task 1A: gene mention finding evaluation. BMC Bioinformatics (2005) 8.09

The amphioxus genome and the evolution of the chordate karyotype. Nature (2008) 8.03

Shotgun sequence assembly and recent segmental duplications within the human genome. Nature (2004) 7.91

The RCSB Protein Data Bank: redesigned web site and web services. Nucleic Acids Res (2010) 7.68

Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications. Nat Biotechnol (2011) 7.53

The MGED Ontology: a resource for semantics-based description of microarray experiments. Bioinformatics (2006) 7.37

The mouse Gene Expression Database (GXD): updates and enhancements. Nucleic Acids Res (2004) 7.26

Integrative annotation of 21,037 human genes validated by full-length cDNA clones. PLoS Biol (2004) 7.17

A Sanger/pyrosequencing hybrid approach for the generation of high-quality draft assemblies of marine microbial genomes. Proc Natl Acad Sci U S A (2006) 7.17

iPfam: visualization of protein-protein interactions in PDB at domain and amino acid resolutions. Bioinformatics (2004) 7.02

WormBase: better software, richer content. Nucleic Acids Res (2006) 6.78

BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata. Nucleic Acids Res (2011) 6.69

Rfam: Wikipedia, clans and the "decimal" release. Nucleic Acids Res (2010) 6.58

A promoter-level mammalian expression atlas. Nature (2014) 6.25

MMDB: Entrez's 3D-structure database. Nucleic Acids Res (2003) 6.14

Rfam 11.0: 10 years of RNA families. Nucleic Acids Res (2012) 6.14

Enhanced protein domain discovery by using language modeling techniques from speech recognition. Proc Natl Acad Sci U S A (2003) 6.01

Evaluation of text data mining for database curation: lessons learned from the KDD Challenge Cup. Bioinformatics (2003) 5.77