Applying active learning to high-throughput phenotyping algorithms for electronic health records data.

PubWeight™: 1.36‹?› | Rank: Top 10%

🔗 View Article (PMC 3861916)

Published in J Am Med Inform Assoc on July 13, 2013

Authors

Yukun Chen1, Robert J Carroll, Eugenia R McPeek Hinz, Anushi Shah, Anne E Eyler, Joshua C Denny, Hua Xu

Author Affiliations

1: Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, Tennessee, USA.

Articles citing this

Electronic health records-driven phenotyping: challenges, recent advances, and perspectives. J Am Med Inform Assoc (2013) 1.85

Secondary use of clinical data: the Vanderbilt approach. J Biomed Inform (2014) 1.12

Desiderata for computable representations of electronic health records-driven phenotype algorithms. J Am Med Inform Assoc (2015) 1.04

Extracting information from the text of electronic medical records to improve case detection: a systematic review. J Am Med Inform Assoc (2016) 0.99

Identifying and mitigating biases in EHR laboratory tests. J Biomed Inform (2014) 0.95

Text classification for assisting moderators in online health communities. J Biomed Inform (2013) 0.95

Clinical research informatics and electronic health record data. Yearb Med Inform (2014) 0.93

TextHunter--A User Friendly Tool for Extracting Generic Concepts from Free Text in Clinical Research. AMIA Annu Symp Proc (2014) 0.92

Patient Stratification Using Electronic Health Records from a Chronic Disease Management Program. IEEE J Biomed Health Inform (2016) 0.91

A study of active learning methods for named entity recognition in clinical text. J Biomed Inform (2015) 0.89

Supervised machine learning and active learning in classification of radiology reports. J Am Med Inform Assoc (2014) 0.88

A systematic comparison of feature space effects on disease classifier performance for phenotype identification of five diseases. J Biomed Inform (2015) 0.85

Learning statistical models of phenotypes using noisy labeled training data. J Am Med Inform Assoc (2016) 0.83

Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning. Biomed Inform Insights (2016) 0.78

Genome-Wide Locations of Potential Epimutations Associated with Environmentally Induced Epigenetic Transgenerational Inheritance of Disease Using a Sequential Machine Learning Prediction Approach. PLoS One (2015) 0.76

An Empirical Study for Impacts of Measurement Errors on EHR based Association Studies. AMIA Annu Symp Proc (2017) 0.75

Large-Scale Discovery of Disease-Disease and Disease-Gene Associations. Sci Rep (2016) 0.75

Clinical phenotyping in selected national networks: demonstrating the need for high-throughput, portable, and computational methods. Artif Intell Med (2016) 0.75

A Validated Phenotyping Algorithm for Genetic Association Studies in Age-related Macular Degeneration. Sci Rep (2015) 0.75

An active learning-enabled annotation system for clinical named entity recognition. BMC Med Inform Decis Mak (2017) 0.75

Effective Information Extraction Framework for Heterogeneous Clinical Reports Using Online Machine Learning and Controlled Vocabularies. JMIR Med Inform (2017) 0.75

PheKnow-Cloud: A Tool for Evaluating High-Throughput Phenotype Candidates using Online Medical Literature. AMIA Jt Summits Transl Sci Proc (2017) 0.75

Articles cited by this

Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A (2009) 54.68

A general natural-language text processor for clinical radiology. J Am Med Inform Assoc (1995) 29.81

Unlocking clinical data from narrative reports: a study of natural language processing. Ann Intern Med (1995) 14.15

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. J Am Med Inform Assoc (2010) 13.18

The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies. BMC Med Genomics (2011) 9.20

Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform (2008) 8.09

Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record. Am J Hum Genet (2010) 6.44

MedEx: a medication information extraction system for clinical narratives. J Am Med Inform Assoc (2010) 6.01

Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports. Radiology (2002) 5.52

"Understanding" medical school curriculum content using KnowledgeMap. J Am Med Inform Assoc (2003) 5.19

Perspectives for medical informatics. Reusing the electronic medical record for clinical research. Methods Inf Med (2009) 4.25

Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study. J Am Med Inform Assoc (2011) 4.20

Electronic medical records for discovery research in rheumatoid arthritis. Arthritis Care Res (Hoboken) (2010) 4.15

Variants near FOXE1 are associated with hypothyroidism and other thyroid conditions: using electronic medical records for genome- and phenome-wide studies. Am J Hum Genet (2011) 3.85

Evaluation of a method to identify and categorize section headers in clinical documents. J Am Med Inform Assoc (2009) 3.80

Next-generation phenotyping of electronic health records. J Am Med Inform Assoc (2012) 3.41

Comparing ICD9-encoded diagnoses and NLP-processed discharge summaries for clinical trials pre-screening: a case study. AMIA Annu Symp Proc (2008) 3.00

Portability of an algorithm to identify rheumatoid arthritis in electronic health records. J Am Med Inform Assoc (2012) 2.94

Use of primary care electronic medical record database in drug efficacy research on cardiovascular outcomes: comparison of database and randomised controlled trial findings. BMJ (2009) 2.88

Genetic basis of autoantibody positive and negative rheumatoid arthritis risk in a multi-ethnic cohort derived from electronic health records. Am J Hum Genet (2011) 2.27

Chapter 13: Mining electronic health records in the genomics era. PLoS Comput Biol (2012) 2.00

Discovering peripheral arterial disease cases from radiology notes using natural language processing. AMIA Annu Symp Proc (2010) 1.86

Incidence rates, clinical profile, and outcomes of patients with venous thromboembolism. The Worcester VTE study. J Thromb Thrombolysis (2009) 1.71

Active learning with support vector machine applied to gene expression data for cancer classification. J Chem Inf Comput Sci (2004) 1.63

Naïve Electronic Health Record phenotype identification for Rheumatoid arthritis. AMIA Annu Symp Proc (2011) 1.56

Management of venous thromboembolism: a clinical practice guideline from the American College of Physicians and the American Academy of Family Physicians. Ann Fam Med (2007) 1.54

Extracting and integrating data from entire electronic health records for detecting colorectal cancer cases. AMIA Annu Symp Proc (2011) 1.29

A randomized controlled trial of the accuracy of clinical record retrieval using SNOMED-RT as compared with ICD9-CM. Proc AMIA Symp (2001) 1.19

A high throughput semantic concept frequency based approach for patient identification: a case study using type 2 diabetes mellitus clinical notes. AMIA Annu Symp Proc (2010) 1.05

Articles by these authors

Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record. Am J Hum Genet (2010) 6.44

MedEx: a medication information extraction system for clinical narratives. J Am Med Inform Assoc (2010) 6.01

Electronic medical records for genetic research: results of the eMERGE consortium. Sci Transl Med (2011) 5.82

Inactivating mutations in NPC1L1 and protection from coronary heart disease. N Engl J Med (2014) 4.94

Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat Biotechnol (2013) 4.85

The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future. Genet Med (2013) 4.37

Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study. J Am Med Inform Assoc (2011) 4.20

Return of individual research results from genome-wide association studies: experience of the Electronic Medical Records and Genomics (eMERGE) Network. Genet Med (2012) 3.29

Automated acquisition of disease drug knowledge from biomedical and clinical documents: an initial study. J Am Med Inform Assoc (2007) 3.09

Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network. J Am Med Inform Assoc (2013) 3.00

Portability of an algorithm to identify rheumatoid arthritis in electronic health records. J Am Med Inform Assoc (2012) 2.94

Computational phenotype discovery using unsupervised feature learning over noisy, sparse, and irregular clinical data. PLoS One (2013) 2.81

A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. J Am Med Inform Assoc (2011) 2.70

Data from clinical notes: a perspective on the tension between structure and flexible documentation. J Am Med Inform Assoc (2011) 2.57

Prospective derivation and validation of a clinical prediction rule for recurrent Clostridium difficile infection. Gastroenterology (2008) 2.50

The disclosure of diagnosis codes can breach research participants' privacy. J Am Med Inform Assoc (2010) 2.45

Bacteria penetrate the normally impenetrable inner colon mucus layer in both murine colitis models and patients with ulcerative colitis. Gut (2013) 2.32

An evaluation of the NQF Quality Data Model for representing Electronic Health Record driven phenotyping algorithms. AMIA Annu Symp Proc (2012) 2.29

Assessing the accuracy of observer-reported ancestry in a biorepository linked to electronic medical records. Genet Med (2010) 2.09

Genome- and phenome-wide analyses of cardiac conduction identifies markers of arrhythmia risk. Circulation (2013) 2.06

Analyzing the heterogeneity and complexity of Electronic Health Record oriented phenotyping algorithms. AMIA Annu Symp Proc (2011) 2.03

Extracting timing and status descriptors for colonoscopy testing from electronic medical records. J Am Med Inform Assoc (2010) 1.94

Electronic health records-driven phenotyping: challenges, recent advances, and perspectives. J Am Med Inform Assoc (2013) 1.85

Importance of multi-modal approaches to effectively identify cataract cases from electronic health records. J Am Med Inform Assoc (2012) 1.79

Can graphene be used as a substrate for Raman enhancement? Nano Lett (2010) 1.78

Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarin. J Am Med Inform Assoc (2011) 1.67

Predicting warfarin dosage in European-Americans and African-Americans using DNA samples linked to an electronic health record. Pharmacogenomics (2012) 1.66

Complement receptor 1 gene variants are associated with erythrocyte sedimentation rate. Am J Hum Genet (2011) 1.66

A hybrid system for temporal information extraction from clinical text. J Am Med Inform Assoc (2013) 1.63

LncRNAs expression signatures of renal clear cell carcinoma revealed by microarray. PLoS One (2012) 1.59

Naïve Electronic Health Record phenotype identification for Rheumatoid arthritis. AMIA Annu Symp Proc (2011) 1.56

A genome- and phenome-wide association study to identify genetic variants influencing platelet count and volume and their pleiotropic effects. Hum Genet (2013) 1.47

Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features. BMC Med Inform Decis Mak (2013) 1.46

Associations of autoantibodies, autoimmune risk alleles, and clinical diagnoses from the electronic medical records in rheumatoid arthritis cases and non-rheumatoid arthritis controls. Arthritis Rheum (2013) 1.43

Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs. J Am Med Inform Assoc (2012) 1.43

Knowledge-driven multi-locus analysis reveals gene-gene interactions influencing HDL cholesterol level in two independent EMR-linked biobanks. PLoS One (2011) 1.42

An analytical approach to characterize morbidity profile dissimilarity between distinct cohorts using electronic medical records. J Biomed Inform (2010) 1.42

Integrating existing natural language processing tools for medication extraction from discharge summaries. J Am Med Inform Assoc (2010) 1.41

Opioid use after cardiac surgery in children with Down syndrome. Pediatr Crit Care Med (2013) 1.40

A1 adenosine receptor knockout mice exhibit increased renal injury following ischemia and reperfusion. Am J Physiol Renal Physiol (2003) 1.38

Influence of source and drain contacts on the properties of indium-gallium-zinc-oxide thin-film transistors based on amorphous carbon nanofilm as barrier layer. ACS Appl Mater Interfaces (2015) 1.38

A flexible approach for highly multiplexed candidate gene targeted resequencing. PLoS One (2011) 1.35

Modulators of normal electrocardiographic intervals identified in a large electronic medical record. Heart Rhythm (2010) 1.35

Biobanks and electronic medical records: enabling cost-effective research. Sci Transl Med (2014) 1.32

Opportunities for genomic clinical decision support interventions. Genet Med (2013) 1.31

A study of transportability of an existing smoking status detection module across institutions. AMIA Annu Symp Proc (2012) 1.31

Extracting and integrating data from entire electronic health records for detecting colorectal cancer cases. AMIA Annu Symp Proc (2011) 1.29

Development and evaluation of an ensemble resource linking medications to their indications. J Am Med Inform Assoc (2013) 1.28

Combined use of fasting plasma glucose and glycated hemoglobin A1c in the screening of diabetes and impaired glucose tolerance. Acta Diabetol (2009) 1.26

Intronic miR-301 feedback regulates its host gene, ska2, in A549 cells by targeting MEOX2 to affect ERK/CREB pathways. Biochem Biophys Res Commun (2010) 1.23

A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries. AMIA Annu Symp Proc (2012) 1.22

ICD-9 tobacco use codes are effective identifiers of smoking status. J Am Med Inform Assoc (2013) 1.22

The use of a DNA biobank linked to electronic medical records to characterize pharmacogenomic predictors of tacrolimus dose requirement in kidney transplant recipients. Pharmacogenet Genomics (2012) 1.21

Generating Clinical Notes for Electronic Health Record Systems. Appl Clin Inform (2010) 1.19

DNA methylation-regulated miR-193a-3p dictates resistance of hepatocellular carcinoma to 5-fluorouracil via repression of SRSF2 expression. J Biol Chem (2011) 1.19

[Study on 1389 men who have sex with men regarding their HIV high-risk behaviors and associated factors in mainland China in 2004]. Zhonghua Liu Xing Bing Xue Za Zhi (2007) 1.19

Applying active learning to assertion classification of concepts in clinical text. J Biomed Inform (2011) 1.18

Mapping the incidentalome: estimating incidental findings generated through clinical pharmacogenomics testing. Genet Med (2012) 1.17

The development of a naturalistic data collection system to perform critical incident analysis: an investigation of safety and fatigue issues in long-haul trucking. Accid Anal Prev (2006) 1.17

Applying active learning to supervised word sense disambiguation in MEDLINE. J Am Med Inform Assoc (2013) 1.17

Polymerization of MIP-1 chemokine (CCL3 and CCL4) and clearance of MIP-1 by insulin-degrading enzyme. EMBO J (2010) 1.16

Synthesis and in vitro antimycobacterial activity of B-ring modified diaryl ether InhA inhibitors. Bioorg Med Chem Lett (2008) 1.15

Combining use of a panel of ssDNA aptamers in the detection of Staphylococcus aureus. Nucleic Acids Res (2009) 1.14

A3 adenosine receptor knockout mice are protected against ischemia- and myoglobinuria-induced renal failure. Am J Physiol Renal Physiol (2002) 1.13

High density GWAS for LDL cholesterol in African Americans using electronic medical records reveals a strong protective variant in APOE. Clin Transl Sci (2012) 1.13

Interdisciplinary dialogue for education, collaboration, and innovation: intelligent Biology and Medicine in and beyond 2013. BMC Genomics (2013) 1.11

Recognition of medication information from discharge summaries using ensembles of classifiers. BMC Med Inform Decis Mak (2012) 1.10

Comparative analysis of pharmacovigilance methods in the detection of adverse drug reactions using electronic medical records. J Am Med Inform Assoc (2012) 1.10

Comparative study of SBP-box gene family in Arabidopsis and rice. Gene (2007) 1.09

Cathelicidin signaling via the Toll-like receptor protects against colitis in mice. Gastroenterology (2011) 1.09

Preterm and term cervical ripening in CD1 Mice (Mus musculus): similar or divergent molecular mechanisms? Biol Reprod (2009) 1.09

Development of inpatient risk stratification models of acute kidney injury for use in electronic health records. Med Decis Making (2010) 1.09

Teaching evidence-based medicine: Impact on students' literature use and inpatient clinical documentation. Med Teach (2011) 1.08

Using contextual and lexical features to restructure and validate the classification of biomedical concepts. BMC Bioinformatics (2007) 1.07

NOTCH2 in breast cancer: association of SNP rs11249433 with gene expression in ER-positive breast tumors without TP53 mutations. Mol Cancer (2010) 1.06

Metastatic tumor evolution and organoid modeling implicate TGFBR2 as a cancer driver in diffuse gastric cancer. Genome Biol (2014) 1.05

Substance P modulates colitis-associated fibrosis. Am J Pathol (2010) 1.05

Identification of an aptamer targeting hnRNP A1 by tissue slide-based SELEX. J Pathol (2009) 1.04

Detecting temporal expressions in medical narratives. Int J Med Inform (2012) 1.04

Development of a natural language processing system to identify timing and status of colonoscopy testing in electronic medical records. AMIA Annu Symp Proc (2009) 1.03

The role of tumor necrosis factor alpha in down-regulation of osteoblast Phex gene expression in experimental murine colitis. Gastroenterology (2006) 1.02

The tumor suppressive role of CAMK2N1 in castration-resistant prostate cancer. Oncotarget (2014) 1.01

Genomics in 2012: challenges and opportunities in the next generation sequencing era. BMC Genomics (2012) 1.01