Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: a novel informatics approach.

PubWeight™: 1.71‹?› | Rank: Top 3%

🔗 View Article (PMC 3665760)

Published in Inflamm Bowel Dis on June 01, 2013

Authors

Ashwin N Ananthakrishnan1, Tianxi Cai, Guergana Savova, Su-Chun Cheng, Pei Chen, Raul Guzman Perez, Vivian S Gainer, Shawn N Murphy, Peter Szolovits, Zongqi Xia, Stanley Shaw, Susanne Churchill, Elizabeth W Karlson, Isaac Kohane, Robert M Plenge, Katherine P Liao

Author Affiliations

1: Gastrointestinal Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA. aananthakrishnan@partners.org

Articles citing this

Association between reduced plasma 25-hydroxy vitamin D and increased risk of cancer in patients with inflammatory bowel diseases. Clin Gastroenterol Hepatol (2013) 3.03

Colonoscopy is associated with a reduced risk for colon cancer and mortality in patients with inflammatory bowel diseases. Clin Gastroenterol Hepatol (2014) 2.27

Diabetes and the risk of infections with immunomodulator therapy in inflammatory bowel diseases. Aliment Pharmacol Ther (2015) 2.02

Psychiatric co-morbidity is associated with increased risk of surgery in Crohn's disease. Aliment Pharmacol Ther (2013) 1.64

Normalization of plasma 25-hydroxy vitamin D is associated with reduced risk of surgery in Crohn's disease. Inflamm Bowel Dis (2013) 1.60

Toward high-throughput phenotyping: unbiased automated feature extraction and selection from knowledge sources. J Am Med Inform Assoc (2015) 1.19

Modeling disease severity in multiple sclerosis using electronic health records. PLoS One (2013) 1.06

Development of phenotype algorithms using electronic medical records and incorporating natural language processing. BMJ (2015) 1.03

Extracting information from the text of electronic medical records to improve case detection: a systematic review. J Am Med Inform Assoc (2016) 0.99

Serum inflammatory markers and risk of colorectal cancer in patients with inflammatory bowel diseases. Clin Gastroenterol Hepatol (2014) 0.96

Higher plasma vitamin D is associated with reduced risk of Clostridium difficile infection in patients with inflammatory bowel diseases. Aliment Pharmacol Ther (2014) 0.95

Automated extraction of clinical traits of multiple sclerosis in electronic medical records. J Am Med Inform Assoc (2013) 0.92

Mortality and extraintestinal cancers in patients with primary sclerosing cholangitis and inflammatory bowel disease. J Crohns Colitis (2014) 0.91

Methods to Develop an Electronic Medical Record Phenotype Algorithm to Compare the Risk of Coronary Artery Disease across 3 Chronic Disease Cohorts. PLoS One (2015) 0.89

Identification of Nonresponse to Treatment Using Narrative Data in an Electronic Health Record Inflammatory Bowel Disease Cohort. Inflamm Bowel Dis (2016) 0.89

Improving the power of genetic association tests with imperfect phenotype derived from electronic medical records. Hum Genet (2014) 0.89

Risk prediction for chronic kidney disease progression using heterogeneous electronic health record data and time series analysis. J Am Med Inform Assoc (2015) 0.87

Similar risk of depression and anxiety following surgery or hospitalization for Crohn's disease and ulcerative colitis. Am J Gastroenterol (2013) 0.87

Evaluation of matched control algorithms in EHR-based phenotyping studies: a case study of inflammatory bowel disease comorbidities. J Biomed Inform (2014) 0.87

Electronic Health Record Based Algorithm to Identify Patients with Autism Spectrum Disorder. PLoS One (2016) 0.85

Genome privacy: challenges, technical approaches to mitigate risk, and ethical considerations in the United States. Ann N Y Acad Sci (2016) 0.83

Deeper, longer phenotyping to accelerate the discovery of the genetic architectures of diseases. Genome Biol (2014) 0.82

Thromboprophylaxis is associated with reduced post-hospitalization venous thromboembolic events in patients with inflammatory bowel diseases. Clin Gastroenterol Hepatol (2014) 0.82

Common Genetic Variants Influence Circulating Vitamin D Levels in Inflammatory Bowel Diseases. Inflamm Bowel Dis (2015) 0.81

Development and Validation of an Algorithm to Identify Nonalcoholic Fatty Liver Disease in the Electronic Medical Record. Dig Dis Sci (2015) 0.80

Identification of subjects with polycystic ovary syndrome using electronic health records. Reprod Biol Endocrinol (2015) 0.80

Patient Electronic Health Records as a Means to Approach Genetic Research in Gastroenterology. Gastroenterology (2015) 0.78

An autism case history to review the systematic analysis of large-scale data to refine the diagnosis and treatment of neuropsychiatric disorders. Biol Psychiatry (2014) 0.78

Mode of childbirth and long-term outcomes in women with inflammatory bowel diseases. Dig Dis Sci (2014) 0.78

Statin Use Is Associated With Reduced Risk of Colorectal Cancer in Patients With Inflammatory Bowel Diseases. Clin Gastroenterol Hepatol (2016) 0.77

Comparative Effectiveness of Infliximab and Adalimumab in Crohn's Disease and Ulcerative Colitis. Inflamm Bowel Dis (2016) 0.77

Using Electronic Medical Record to Identify Patients With Dyslipidemia in Primary Care Settings: International Classification of Disease Code Matters From One Region to a National Database. Biomed Inform Insights (2017) 0.75

Open Source Clinical NLP - More than Any Single System. AMIA Jt Summits Transl Sci Proc (2014) 0.75

Semi-supervised Learning for Phenotyping Tasks. AMIA Annu Symp Proc (2015) 0.75

Accounting for misclassification in electronic health records-derived exposures using generalized linear finite mixture models. Health Serv Outcomes Res Methodol (2016) 0.75

Articles cited by this

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. J Am Med Inform Assoc (2010) 13.18

Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system. BMC Med Inform Decis Mak (2006) 9.98

MedEx: a medication information extraction system for clinical narratives. J Am Med Inform Assoc (2010) 6.01

Electronic medical records for genetic research: results of the eMERGE consortium. Sci Transl Med (2011) 5.82

Automated identification of postoperative complications within an electronic medical record using natural language processing. JAMA (2011) 5.40

Crohn's disease in Olmsted County, Minnesota, 1940-1993: incidence, prevalence, and survival. Gastroenterology (1998) 4.52

Electronic medical records for discovery research in rheumatoid arthritis. Arthritis Care Res (Hoboken) (2010) 4.15

Variants near FOXE1 are associated with hypothyroidism and other thyroid conditions: using electronic medical records for genome- and phenome-wide studies. Am J Hum Genet (2011) 3.85

Using electronic health records to drive discovery in disease genomics. Nat Rev Genet (2011) 3.80

Automated detection of adverse events using natural language processing of discharge summaries. J Am Med Inform Assoc (2005) 3.77

Extracting findings from narrative reports: software transferability and sources of physician disagreement. Methods Inf Med (1998) 3.67

Epidemiology of Crohn's disease and ulcerative colitis in a central Canadian province: a population-based study. Am J Epidemiol (1999) 3.34

Portability of an algorithm to identify rheumatoid arthritis in electronic health records. J Am Med Inform Assoc (2012) 2.94

Ulcerative colitis in Olmsted County, Minnesota, 1940-1993: incidence, prevalence, and survival. Gut (2000) 2.86

Increasing incidence of paediatric inflammatory bowel disease in Ontario, Canada: evidence from health administrative data. Gut (2009) 2.62

Microscopic activity in ulcerative colitis: what does it mean? Gut (1991) 2.43

Genetic basis of autoantibody positive and negative rheumatoid arthritis risk in a multi-ethnic cohort derived from electronic health records. Am J Hum Genet (2011) 2.27

Estimation of the period prevalence of inflammatory bowel disease among nine health plans using computerized diagnoses and outpatient pharmacy dispensings. Inflamm Bowel Dis (2007) 2.03

Assessment of the diagnoses of Crohn's disease and ulcerative colitis in a Danish hospital information system. Scand J Gastroenterol (1996) 1.98

The promise of electronic records: around the corner or down the road? JAMA (2011) 1.58

Drug side effect extraction from clinical narratives of psychiatry and psychology patients. J Am Med Inform Assoc (2011) 1.56

Evaluation of Medical Problem Extraction from Electronic Clinical Documents Using MetaMap Transfer (MMTx). Stud Health Technol Inform (2005) 1.45

An analytical approach to characterize morbidity profile dissimilarity between distinct cohorts using electronic medical records. J Biomed Inform (2010) 1.42

Hospitalization, surgery, and readmission rates of IBD in Canada: a population-based study. Am J Gastroenterol (2006) 1.29

Validation of psoriatic arthritis diagnoses in electronic medical records using natural language processing. Semin Arthritis Rheum (2010) 1.14

Automated discovery of drug treatment patterns for endocrine therapy of breast cancer within an electronic medical record. J Am Med Inform Assoc (2011) 1.09

A nationwide analysis of changes in severity and outcomes of inflammatory bowel disease hospitalizations. J Gastrointest Surg (2010) 1.00

Validity of computerized diagnoses, procedures, and drugs for inflammatory bowel disease in a northern California managed care organization. Pharmacoepidemiol Drug Saf (2009) 0.97

Validity of administrative data for the diagnosis of primary sclerosing cholangitis: a population-based study. Liver Int (2011) 0.96

Modeling interventions to improve access to public health information. AMIA Annu Symp Proc (2003) 0.84

Articles by these authors

Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet (2006) 115.71

Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet (2011) 18.88

Coordinated reduction of genes of oxidative metabolism in humans with insulin resistance and diabetes: Potential role of PGC1 and NRF1. Proc Natl Acad Sci U S A (2003) 17.10

Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2). J Am Med Inform Assoc (2010) 10.51

Identifying patient smoking status from medical discharge records. J Am Med Inform Assoc (2007) 10.35

Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system. BMC Med Inform Decis Mak (2006) 9.98

Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat Genet (2010) 9.90

STAT4 and the risk of rheumatoid arthritis and systemic lupus erythematosus. N Engl J Med (2007) 9.80

The major genetic determinants of HIV-1 control affect HLA class I peptide presentation. Science (2010) 9.61

Evaluating the state-of-the-art in automatic de-identification. J Am Med Inform Assoc (2007) 9.28

Gene regulation and DNA damage in the ageing human brain. Nature (2004) 9.01

Two independent alleles at 6q23 associated with risk of rheumatoid arthritis. Nat Genet (2007) 8.74

Identifying relationships among genomic disease regions: predicting genes at pathogenic SNP associations and rare deletions. PLoS Genet (2009) 8.39

Common variants at CD40 and other loci confer risk of rheumatoid arthritis. Nat Genet (2008) 7.07

A common haplotype of interferon regulatory factor 5 (IRF5) regulates splicing and expression and is associated with increased risk of systemic lupus erythematosus. Nat Genet (2006) 6.98

Cardiovascular morbidity and mortality in women diagnosed with rheumatoid arthritis. Circulation (2003) 6.62

Replication of putative candidate-gene associations with rheumatoid arthritis in >4,000 samples from North America and Sweden: association of susceptibility with PTPN22, CTLA4, and PADI4. Am J Hum Genet (2005) 6.61

Purine-rich foods, dairy and protein intake, and the risk of gout in men. N Engl J Med (2004) 6.53

The Shared Health Research Information Network (SHRINE): a prototype federated query tool for clinical data repositories. J Am Med Inform Assoc (2009) 6.49

Genetic variants near TNFAIP3 on 6q23 are associated with systemic lupus erythematosus. Nat Genet (2008) 6.42

Analysis and application of European genetic substructure using 300 K SNP information. PLoS Genet (2008) 6.42

A mega-analysis of genome-wide association studies for major depressive disorder. Mol Psychiatry (2012) 6.34

Architecture of the open-source clinical research chart from Informatics for Integrating Biology and the Bedside. AMIA Annu Symp Proc (2007) 5.89

Five amino acids in three HLA proteins explain most of the association between MHC and seropositive rheumatoid arthritis. Nat Genet (2012) 5.11

Genetic analysis of human traits in vitro: drug response and gene expression in lymphoblastoid cell lines. PLoS Genet (2008) 5.08

Human disease classification in the postgenomic era: a complex systems approach to human pathobiology. Mol Syst Biol (2007) 5.02

Three functional variants of IFN regulatory factor 5 (IRF5) define risk and protective haplotypes for human lupus. Proc Natl Acad Sci U S A (2007) 4.71

High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis. Nat Genet (2012) 4.46

Initial evaluation of coronary images from 320-detector row computed tomography. Int J Cardiovasc Imaging (2008) 4.44

Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis. Nat Genet (2012) 4.41

The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future. Genet Med (2013) 4.37

On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Stat Med (2011) 4.34

Electronic medical records for discovery research in rheumatoid arthritis. Arthritis Care Res (Hoboken) (2010) 4.15

Instrumenting the health care enterprise for discovery research in the genomic era. Genome Res (2009) 4.13

Alcohol intake and risk of incident gout in men: a prospective study. Lancet (2004) 4.12

Automated de-identification of free-text medical records. BMC Med Inform Decis Mak (2008) 3.99

Implementing electronic medical record systems in developing countries. Inform Prim Care (2005) 3.94

De novo copy number variants identify new genes and loci in isolated sporadic tetralogy of Fallot. Nat Genet (2009) 3.86

Joint effects of common genetic variants on the risk for type 2 diabetes in U.S. men and women of European ancestry. Ann Intern Med (2009) 3.68

Meta-analysis of genome-wide association studies in celiac disease and rheumatoid arthritis identifies fourteen non-HLA shared loci. PLoS Genet (2011) 3.68

Gene-gene and gene-environment interactions involving HLA-DRB1, PTPN22, and smoking in two subsets of rheumatoid arthritis. Am J Hum Genet (2007) 3.62

Defining the role of the MHC in autoimmunity: a review and pooled analysis. PLoS Genet (2008) 3.54

Genetic variants at CD28, PRDM1 and CD2/CD58 are associated with rheumatoid arthritis risk. Nat Genet (2009) 3.52

REL, encoding a member of the NF-kappaB family of transcription factors, is a newly defined risk locus for rheumatoid arthritis. Nat Genet (2009) 3.36

Investigation of candidate polymorphisms and disease activity in rheumatoid arthritis patients on methotrexate. Rheumatology (Oxford) (2009) 3.33

A genome-wide association study identifies susceptibility variants for type 2 diabetes in Han Chinese. PLoS Genet (2010) 3.25

Combining predictors for classification using the area under the receiver operating characteristic curve. Biometrics (2006) 3.12

Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation Model. J Biomed Inform (2008) 3.10

Genome-wide association analyses identify 18 new loci associated with serum urate concentrations. Nat Genet (2012) 3.04

Building a robust, scalable and standards-driven infrastructure for secondary use of EHR data: the SHARPn project. J Biomed Inform (2012) 2.97

Integration of genetic risk factors into a clinical algorithm for multiple sclerosis susceptibility: a weighted genetic risk score. Lancet Neurol (2009) 2.95

Portability of an algorithm to identify rheumatoid arthritis in electronic health records. J Am Med Inform Assoc (2012) 2.94

A de-identifier for medical discharge summaries. Artif Intell Med (2007) 2.92

Cigarette smoking and the risk of systemic lupus erythematosus: a meta-analysis. Arthritis Rheum (2004) 2.88

Obesity, weight change, hypertension, diuretic use, and risk of gout in men: the health professionals follow-up study. Arch Intern Med (2005) 2.80

Calculating the benefits of a Research Patient Data Repository. AMIA Annu Symp Proc (2006) 2.78

Integration of clinical and genetic data in the i2b2 architecture. AMIA Annu Symp Proc (2006) 2.74

Comparison of threshold cutpoints and continuous measures of anti-cyclic citrullinated peptide antibodies in predicting future rheumatoid arthritis. J Rheumatol (2009) 2.61

Smoking intensity, duration, and cessation, and the risk of rheumatoid arthritis in women. Am J Med (2006) 2.60

A translational engine at the national scale: informatics for integrating biology and the bedside. J Am Med Inform Assoc (2011) 2.55

The SMART Platform: early experience enabling substitutable applications for electronic health records. J Am Med Inform Assoc (2012) 2.48

Reproductive and menopausal factors and risk of systemic lupus erythematosus in women. Arthritis Rheum (2007) 2.44

CNTRO: A Semantic Web Ontology for Temporal Relation Inferencing in Clinical Narratives. AMIA Annu Symp Proc (2010) 2.44

X chromosome-inactivation patterns of 1,005 phenotypically unaffected females. Am J Hum Genet (2006) 2.33

Experience with etanercept in an academic medical center: are infection rates increased? Arthritis Rheum (2002) 2.32

Meta-analysis identifies nine new loci associated with rheumatoid arthritis in the Japanese population. Nat Genet (2012) 2.31

Genetic basis of autoantibody positive and negative rheumatoid arthritis risk in a multi-ethnic cohort derived from electronic health records. Am J Hum Genet (2011) 2.27

Systemic lupus erythematosus and the risk of cardiovascular disease: results from the nurses' health study. Arthritis Rheum (2009) 2.26

Comparison of patient- and clinician-collected anal cytology samples to screen for human papillomavirus-associated anal intraepithelial neoplasia in men who have sex with men. Ann Intern Med (2008) 2.22

C-reactive protein in the prediction of rheumatoid arthritis in women. Arch Intern Med (2006) 2.22

iTools: a framework for classification, categorization and integration of computational biology resources. PLoS One (2008) 2.18

Dietary intake of vitamin D during adolescence and risk of adult-onset systemic lupus erythematosus and rheumatoid arthritis. Arthritis Care Res (Hoboken) (2012) 2.13

Data for Genetic Analysis Workshop 16 Problem 1, association analysis of rheumatoid arthritis data. BMC Proc (2009) 2.12

Narrowing the phase window width in prospectively ECG-gated single heart beat 320-detector row coronary CT angiography. Int J Cardiovasc Imaging (2008) 2.09

Using natural language processing to improve efficiency of manual chart abstraction in research: the case of breast cancer recurrence. Am J Epidemiol (2014) 2.09

Rheumatoid arthritis risk allele PTPRC is also associated with response to anti-tumor necrosis factor alpha therapy. Arthritis Rheum (2010) 2.08

Evaluating potential for whole-genome studies in Kosrae, an isolated population in Micronesia. Nat Genet (2006) 2.07