Data integration in the era of omics: current and future challenges.

PubWeight™: 1.82‹?› | Rank: Top 3%

🔗 View Article (PMC 4101704)

Published in BMC Syst Biol on March 13, 2014

Authors

David Gomez-Cabrero, Imad Abugessaisa, Dieter Maier, Andrew Teschendorff, Matthias Merkenschlager, Andreas Gisel, Esteban Ballestar, Erik Bongcam-Rudloff, Ana Conesa, Jesper Tegnér

Articles citing this

NetworkAnalyst for statistical, visual and network-based meta-analysis of gene expression data. Nat Protoc (2015) 1.17

Pathogenesis and immunobiology of brucellosis: review of Brucella-host interactions. Am J Pathol (2015) 1.07

From big data analysis to personalized medicine for all: challenges and opportunities. BMC Med Genomics (2015) 1.04

Small-Magnitude Effect Sizes in Epigenetic End Points are Important in Children's Environmental Health Studies: The Children's Environmental Health and Disease Prevention Research Center's Epigenetics Working Group. Environ Health Perspect (2017) 0.90

Mixed Linear Model Approaches of Association Mapping for Complex Traits Based on Omics Variants. Sci Rep (2015) 0.89

The quest for tolerant varieties: the importance of integrating "omics" techniques to phenotyping. Front Plant Sci (2015) 0.88

Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems. Front Cell Dev Biol (2014) 0.86

Data- and knowledge-based modeling of gene regulatory networks: an update. EXCLI J (2015) 0.85

A collaborative approach to develop a multi-omics data analytics platform for translational research. Appl Transl Genom (2014) 0.84

diXa: a data infrastructure for chemical safety assessment. Bioinformatics (2014) 0.84

Bioinformatics Mining and Modeling Methods for the Identification of Disease Mechanisms in Neurodegenerative Disorders. Int J Mol Sci (2015) 0.83

Novel multivariate methods for integration of genomics and proteomics data: applications in a kidney transplant rejection study. OMICS (2014) 0.82

Methods for biological data integration: perspectives and challenges. J R Soc Interface (2015) 0.82

Dynamics in Transcriptomics: Advancements in RNA-seq Time Course and Downstream Analysis. Comput Struct Biotechnol J (2015) 0.82

The female gametophyte: an emerging model for cell type-specific systems biology in plant development. Front Plant Sci (2015) 0.81

Public data and open source tools for multi-assay genomic investigation of disease. Brief Bioinform (2015) 0.80

A perspective on bridging scales and design of models using low-dimensional manifolds and data-driven model inference. Philos Trans A Math Phys Eng Sci (2016) 0.79

Analysis of Reverse Phase Protein Array Data: From Experimental Design towards Targeted Biomarker Discovery. Microarrays (Basel) (2015) 0.78

A causal network analysis in an observational study identifies metabolomics pathways influencing plasma triglyceride levels. Metabolomics (2016) 0.78

Joint analysis of multiple phenotypes: summary of results and discussions from the Genetic Analysis Workshop 19. BMC Genet (2016) 0.78

Mildew-Omics: How Global Analyses Aid the Understanding of Life and Evolution of Powdery Mildews. Front Plant Sci (2016) 0.77

Deployment-Associated Exposure Surveillance With High-Resolution Metabolomics. J Occup Environ Med (2016) 0.77

Longitudinal omics modeling and integration in clinical metabonomics research: challenges in childhood metabolic health research. Front Mol Biosci (2015) 0.76

Analyzing the miRNA-Gene Networks to Mine the Important miRNAs under Skin of Human and Mouse. Biomed Res Int (2016) 0.76

The Third International Genomic Medicine Conference (3rd IGMC, 2015): overall activities and outcome highlights. BMC Genomics (2016) 0.76

ENViz: a Cytoscape App for integrated statistical analysis and visualization of sample-matched data with multiple data types. Bioinformatics (2015) 0.75

Why proteomics is not the new genomics and the future of mass spectrometry in cell biology. J Cell Biol (2016) 0.75

The effects of thawing on the plasma metabolome: evaluating differences between thawed plasma and multi-organ samples. Metabolomics (2017) 0.75

PeptiCKDdb-peptide- and protein-centric database for the investigation of genesis and progression of chronic kidney disease. Database (Oxford) (2016) 0.75

Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework. BMC Bioinformatics (2016) 0.75

Parasite genomics-Time to think bigger. PLoS Negl Trop Dis (2017) 0.75

Interdisciplinary approach towards a systems medicine toolbox using the example of inflammatory diseases. Brief Bioinform (2017) 0.75

Computational challenges in modeling gene regulatory events. Transcription (2016) 0.75

Multidimensional Integrative Genomics Approaches to Dissecting Cardiovascular Disease. Front Cardiovasc Med (2017) 0.75

Integrative miRNA-Gene Expression Analysis Enables Refinement of Associated Biology and Prediction of Response to Cetuximab in Head and Neck Squamous Cell Cancer. Genes (Basel) (2017) 0.75

Understanding Physiology in the Continuum: Integration of Information from Multiple -Omics Levels. Front Pharmacol (2017) 0.75

Pan-cancer subtyping in a 2D-map shows substructures that are driven by specific combinations of molecular characteristics. Sci Rep (2016) 0.75

ONION: Functional Approach for Integration of Lipidomics and Transcriptomics Data. PLoS One (2015) 0.75

Data integration in biological research: an overview. J Biol Res (Thessalon) (2015) 0.75

Systems Medicine as an Emerging Tool for Cardiovascular Genetics. Front Cardiovasc Med (2016) 0.75

Inferring differentially expressed pathways using kernel maximum mean discrepancy-based test. BMC Bioinformatics (2016) 0.75

Sharing and Reuse of Sensitive Data and Samples: Supporting Researchers in Identifying Ethical and Legal Requirements. Biopreserv Biobank (2015) 0.75

Recent Achievements in Characterizing the Histone Code and Approaches to Integrating Epigenomics and Systems Biology. Methods Enzymol (2017) 0.75

Articles cited by this

Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet (2000) 336.52

Initial sequencing and analysis of the human genome. Nature (2001) 212.86

Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A (2005) 167.46

A map of human genome variation from population-scale sequencing. Nature (2010) 121.13

The sequence of the human genome. Science (2001) 101.55

Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res (2002) 76.95

Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nat Genet (2001) 63.36

An integrated map of genetic variation from 1,092 human genomes. Nature (2012) 59.82

The KEGG resource for deciphering the genome. Nucleic Acids Res (2004) 53.05

Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl J Med (2012) 44.56

Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature (2008) 30.29

Creating a bioinformatics nation. Nature (2002) 25.48

GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res (2012) 19.19

GREAT improves functional interpretation of cis-regulatory regions. Nat Biotechnol (2010) 16.72

A user's guide to the encyclopedia of DNA elements (ENCODE). PLoS Biol (2011) 16.53

NCBI GEO: archive for functional genomics data sets--update. Nucleic Acids Res (2012) 15.84

Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet (2012) 15.21

Integrated genomic characterization of endometrial carcinoma. Nature (2013) 14.29

TRANSFAC: a database on transcription factors and their DNA binding sites. Nucleic Acids Res (1996) 13.50

Unlocking the secrets of the genome. Nature (2009) 11.80

A map of the cis-regulatory sequences in the mouse genome. Nature (2012) 8.74

The Reactome pathway knowledgebase. Nucleic Acids Res (2013) 8.56

Epigenome-wide association studies for common human diseases. Nat Rev Genet (2011) 7.96

An expansive human regulatory lexicon encoded in transcription factor footprints. Nature (2012) 7.27

Interactome networks and human disease. Cell (2011) 7.16

Advanced methods in meta-analysis: multivariate approach and meta-regression. Stat Med (2002) 6.53

The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line. Nat Genet (2009) 6.02

ArrayExpress update--trends in database growth and links to data analysis tools. Nucleic Acids Res (2012) 4.91

Unsupervised pattern discovery in human chromatin structure through genomic segmentation. Nat Methods (2012) 4.89

MicroRNA profiling: approaches and considerations. Nat Rev Genet (2012) 4.39

An encyclopedia of mouse DNA elements (Mouse ENCODE). Genome Biol (2012) 4.15

State of the nation in data integration for bioinformatics. J Biomed Inform (2008) 3.60

Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms. Proc Natl Acad Sci U S A (2003) 3.14

Dynamic regulatory network controlling TH17 cell differentiation. Nature (2013) 3.07

Genomics: ENCODE explained. Nature (2012) 2.62

Meta-analysis methods for genome-wide association studies and beyond. Nat Rev Genet (2013) 2.58

Circuitry and dynamics of human transcription factor regulatory networks. Cell (2012) 2.56

Hive plots--rational approach to visualizing networks. Brief Bioinform (2011) 2.46

'Big data', Hadoop and cloud computing in genomics. J Biomed Inform (2013) 2.40

BRENDA in 2013: integrated reactions, kinetic data, enzyme function data, improved disease classification: new options and contents in BRENDA. Nucleic Acids Res (2012) 2.33

Challenges and opportunities in mining neuroscience data. Science (2011) 2.32

The impact of cellular networks on disease comorbidity. Mol Syst Biol (2009) 2.23

Identification of transcriptional regulators in the mouse immune system. Nat Immunol (2013) 1.79

RNAcentral: A vision for an international database of RNA sequences. RNA (2011) 1.73

A multivariate analysis approach to the integration of proteomic and gene expression data. Proteomics (2007) 1.39

Immunological Genome Project and systems immunology. Trends Immunol (2013) 1.34

Data integration in genetics and genomics: methods and challenges. Hum Genomics Proteomics (2009) 1.32

Synthetic non-oxidative glycolysis enables complete carbon conservation. Nature (2013) 1.24

High-energy physics: Down the petabyte highway. Nature (2011) 1.06

Integrating and mining the chromatin landscape of cell-type specificity using self-organizing maps. Genome Res (2013) 0.90

Data and knowledge integration in the life sciences. Brief Bioinform (2008) 0.88

Pathway network inference from gene expression data. BMC Syst Biol (2014) 0.88

Bioinformatic analysis of proteomics data. BMC Syst Biol (2014) 0.87

STATegra EMS: an Experiment Management System for complex next-generation omics experiments. BMC Syst Biol (2014) 0.86

Kernel-PCA data integration with enhanced interpretability. BMC Syst Biol (2014) 0.85

ISCB: past-present perspective for the International Society for Computational Biology. Bioinformatics (2014) 0.83

Use of prior knowledge for the analysis of high-throughput transcriptomics and metabolomics data. BMC Syst Biol (2014) 0.81

Integrative omics analysis. A study based on Plasmodium falciparum mRNA and protein data. BMC Syst Biol (2014) 0.80

A diVIsive Shuffling Approach (VIStA) for gene expression analysis to identify subtypes in Chronic Obstructive Pulmonary Disease. BMC Syst Biol (2014) 0.80

The common ground of genomics and systems biology. BMC Syst Biol (2014) 0.80

Alzheimer's disease: From big data to mechanism. Nature (2013) 0.79

Review of the selected proceedings of the Fifth International Workshop on Data Integration in the Life Sciences 2008. BMC Bioinformatics (2008) 0.77

Articles by these authors

Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci U S A (2005) 15.46

High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res (2008) 13.51

Chromatin signatures of pluripotent cell lines. Nat Cell Biol (2006) 12.88

The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. Nucleic Acids Res (2004) 8.79

Loss of acetylation at Lys16 and trimethylation at Lys20 of histone H4 is a common hallmark of human cancer. Nat Genet (2005) 8.45

Cohesins functionally associate with CTCF on mammalian chromosome arms. Cell (2008) 7.80

Differential expression in RNA-seq: a matter of depth. Genome Res (2011) 7.13

An atlas of combinatorial transcriptional regulation in mouse and man. Cell (2010) 6.24

The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line. Nat Genet (2009) 6.02

T cell receptor signaling controls Foxp3 expression via PI3K, Akt, and mTOR. Proc Natl Acad Sci U S A (2008) 5.97

Cohesins form chromosomal cis-interactions at the developmentally regulated IFNG locus. Nature (2009) 5.96

A role for Dicer in immune regulation. J Exp Med (2006) 5.81

Genetic unmasking of an epigenetically silenced microRNA in human cancer cells. Cancer Res (2007) 5.62

Dicer ablation affects antibody diversity and cell survival in the B lymphocyte lineage. Cell (2008) 5.53

Bone progenitor dysfunction induces myelodysplasia and secondary leukaemia. Nature (2010) 5.43

Reverse engineering gene networks using singular value decomposition and robust regression. Proc Natl Acad Sci U S A (2002) 5.29

T cell lineage choice and differentiation in the absence of the RNase III enzyme Dicer. J Exp Med (2005) 5.26

A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinformatics (2012) 4.95

Snail mediates E-cadherin repression by the recruitment of the Sin3A/histone deacetylase 1 (HDAC1)/HDAC2 complex. Mol Cell Biol (2004) 4.15

Changes in the pattern of DNA methylation associate with twin discordance in systemic lupus erythematosus. Genome Res (2009) 3.81

Qualimap: evaluating next-generation sequencing alignment data. Bioinformatics (2012) 3.62

Babelomics: an integrative platform for the analysis of transcriptomics, proteomics and genomic data with advanced functional profiling. Nucleic Acids Res (2010) 3.36

Dicer-dependent pathways regulate chondrocyte proliferation and differentiation. Proc Natl Acad Sci U S A (2008) 3.10

Dicer-dependent endothelial microRNAs are necessary for postnatal angiogenesis. Proc Natl Acad Sci U S A (2008) 3.03

Podocyte-selective deletion of dicer induces proteinuria and glomerulosclerosis. J Am Soc Nephrol (2008) 2.74

Dynamic assembly of silent chromatin during thymocyte maturation. Nat Genet (2004) 2.68

Ikaros DNA-binding proteins as integral components of B cell developmental-stage-specific regulatory circuits. Immunity (2007) 2.65

A role for cohesin in T-cell-receptor rearrangement and thymocyte differentiation. Nature (2011) 2.58

Jarid2 is a PRC2 component in embryonic stem cells required for multi-lineage differentiation and recruitment of PRC1 and RNA Polymerase II to developmental regulators. Nat Cell Biol (2010) 2.55

Notch signaling is essential for ventricular chamber development. Dev Cell (2007) 2.53

MicroRNA miR-125a controls hematopoietic stem cell number. Proc Natl Acad Sci U S A (2010) 2.53

The affinity of different MBD proteins for a specific methylated locus depends on their intrinsic binding properties. Nucleic Acids Res (2003) 2.41

Allele-specific histone lysine methylation marks regulatory regions at imprinted mouse genes. EMBO J (2002) 2.40

Neural induction promotes large-scale chromatin reorganisation of the Mash1 locus. J Cell Sci (2006) 2.37

A truncating mutation of HDAC2 in human cancers confers resistance to histone deacetylase inhibition. Nat Genet (2006) 2.33

An evaluation of analysis pipelines for DNA methylation profiling using the Illumina HumanMethylation450 BeadChip platform. Epigenetics (2013) 2.28

The EMBRACE web service collection. Nucleic Acids Res (2010) 2.26

Heritable gene silencing in lymphocytes delays chromatid resolution without affecting the timing of DNA replication. Nat Cell Biol (2003) 2.21

Genetic analysis of p38 MAP kinases in myogenesis: fundamental role of p38alpha in abrogating myoblast proliferation. EMBO J (2007) 2.21

PDB_REDO: automated re-refinement of X-ray structure models in the PDB. J Appl Crystallogr (2009) 2.19

Phenotypic and functional characterisation of the luminal cell hierarchy of the mammary gland. Breast Cancer Res (2012) 2.18

A DNA methylation fingerprint of 1628 human samples. Genome Res (2011) 2.16

Different roles for Tet1 and Tet2 proteins in reprogramming-mediated erasure of imprints induced by EGC fusion. Mol Cell (2013) 2.10

A robust and highly efficient immune cell reprogramming system. Cell Stem Cell (2009) 2.06

A 1 Mb minimal amplicon at 8p11-12 in breast cancer identifies new candidate oncogenes. Oncogene (2005) 2.05

Cohesin-based chromatin interactions enable regulated gene expression within preexisting architectural compartments. Genome Res (2013) 2.05

Heterokaryon-based reprogramming of human B lymphocytes for pluripotency requires Oct4 but not Sox2. PLoS Genet (2008) 2.01

ESCs require PRC2 to direct the successful reprogramming of differentiated cells toward pluripotency. Cell Stem Cell (2010) 1.98

New insights into the biology and origin of mature aggressive B-cell lymphomas by combined epigenomic, genomic, and transcriptional profiling. Blood (2008) 1.97

Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells. Science (2015) 1.91

GEPAS, a web-based tool for microarray data analysis and interpretation. Nucleic Acids Res (2008) 1.89

The dynamic DNA methylomes of double-stranded DNA viruses associated with human cancer. Genome Res (2009) 1.87

DNA methylation polymorphisms precede any histological sign of atherosclerosis in mice lacking apolipoprotein E. J Biol Chem (2004) 1.82

MicroRNA loss enhances learning and memory in mice. J Neurosci (2010) 1.82

Methyl-DNA immunoprecipitation (MeDIP): hunting down the DNA methylome. Biotechniques (2008) 1.81

Filamentous fungi as cell factories for heterologous protein production. Trends Biotechnol (2002) 1.77

Analysis of 13000 unique Citrus clusters associated with fruit quality, production and salinity tolerance. BMC Genomics (2007) 1.77

E47 phosphorylation by p38 MAPK promotes MyoD/E47 association and muscle-specific gene transcription. EMBO J (2005) 1.76

Nuclear repositioning marks the selective exclusion of lineage-inappropriate transcription factor loci during T helper cell differentiation. Eur J Immunol (2004) 1.76

Discovery of epigenetically silenced genes by methylated DNA immunoprecipitation in colon cancer cells. Cancer Res (2007) 1.71

Genome-wide DNA methylation analysis of archival formalin-fixed paraffin-embedded tissue using the Illumina Infinium HumanMethylation27 BeadChip. Methods (2010) 1.71

Initial genomics of the human nucleolus. PLoS Genet (2010) 1.71

Human DNA methyltransferase 1 is required for maintenance of the histone H3 modification pattern. J Biol Chem (2004) 1.64

Mechanism for top-down control of working memory capacity. Proc Natl Acad Sci U S A (2009) 1.63

Immunomodulatory effect of 5-azacytidine (5-azaC): potential role in the transplantation setting. Blood (2009) 1.62

Recommendations for biomarker identification and qualification in clinical proteomics. Sci Transl Med (2010) 1.59

SIMAP--a comprehensive database of pre-calculated protein sequence similarities, domains, annotations and clusters. Nucleic Acids Res (2009) 1.58

Gateways to the FANTOM5 promoter level mammalian expression atlas. Genome Biol (2015) 1.58

Hairless-mediated repression of notch target genes requires the combined activity of Groucho and CtBP corepressors. Mol Cell Biol (2005) 1.55

B2G-FAR, a species-centered GO annotation repository. Bioinformatics (2011) 1.53

Babelomics: advanced functional profiling of transcriptomics, proteomics and genomics experiments. Nucleic Acids Res (2008) 1.53

Genome-wide identification of Ikaros targets elucidates its contribution to mouse B-cell lineage specification and pre-B-cell differentiation. Blood (2013) 1.51

Cohesin at active genes: a unifying theme for cohesin and gene expression from model organisms to humans. Curr Opin Cell Biol (2013) 1.51

Discovering gene expression patterns in time course microarray experiments by ANOVA-SCA. Bioinformatics (2007) 1.50

Histone hypomethylation is an indicator of epigenetic plasticity in quiescent lymphocytes. EMBO J (2004) 1.49

The impact of chromatin modifiers on the timing of locus replication in mouse embryonic stem cells. Genome Biol (2007) 1.49

A large duplication associated with dominant white color in pigs originated by homologous recombination between LINE elements flanking KIT. Mamm Genome (2002) 1.49

CSL-MAML-dependent Notch1 signaling controls T lineage-specific IL-7R{alpha} gene expression in early human thymopoiesis and leukemia. J Exp Med (2009) 1.47

A novel unstable duplication upstream of HAS2 predisposes to a breed-defining skin phenotype and a periodic fever syndrome in Chinese Shar-Pei dogs. PLoS Genet (2011) 1.45

Best practices in bioinformatics training for life scientists. Brief Bioinform (2013) 1.45

Runx proteins regulate Foxp3 expression. J Exp Med (2009) 1.45

A mouse skin multistage carcinogenesis model reflects the aberrant DNA methylation patterns of human tumors. Cancer Res (2004) 1.44

Tet2 facilitates the derepression of myeloid target genes during CEBPα-induced transdifferentiation of pre-B cells. Mol Cell (2012) 1.43

Epigenetic disruption of ribosomal RNA genes and nucleolar architecture in DNA methyltransferase 1 (Dnmt1) deficient cells. Nucleic Acids Res (2007) 1.42

The origin of the 'Mycoplasma mycoides cluster' coincides with domestication of ruminants. PLoS One (2012) 1.42

Dicer-dependent microRNA pathway controls invariant NKT cell development. J Immunol (2009) 1.41

Paintomics: a web based tool for the joint visualization of transcriptomics and metabolomics data. Bioinformatics (2010) 1.41

Brain activity related to working memory and distraction in children and adults. Cereb Cortex (2006) 1.40

A dynamic switch in the replication timing of key regulator genes in embryonic stem cells upon neural induction. Cell Cycle (2004) 1.39

Fine scale mapping of the breast cancer 16q12 locus. Hum Mol Genet (2010) 1.39

Human C-reactive protein slows atherosclerosis development in a mouse model with human-like hypercholesterolemia. Proc Natl Acad Sci U S A (2007) 1.37

Is REST required for ESC pluripotency? Nature (2009) 1.34

Multi-organ expression profiling uncovers a gene module in coronary artery disease involving transendothelial migration of leukocytes and LIM domain binding 2: the Stockholm Atherosclerosis Gene Expression (STAGE) study. PLoS Genet (2009) 1.34

Transcriptional response of Citrus aurantifolia to infection by Citrus tristeza virus. Virology (2007) 1.32

Dicer regulates Xist promoter methylation in ES cells indirectly through transcriptional control of Dnmt3a. Epigenetics Chromatin (2008) 1.30

A profile of methyl-CpG binding domain protein occupancy of hypermethylated promoter CpG islands of tumor suppressor genes in human cancer. Cancer Res (2006) 1.29

Small RNAs control sodium channel expression, nociceptor excitability, and pain thresholds. J Neurosci (2010) 1.28

Stronger synaptic connectivity as a mechanism behind development of working memory-related brain activity during childhood. J Cogn Neurosci (2007) 1.28