conclusions from observational clinical data George Hripcsak, MD, - PowerPoint PPT Presentation

Biomedical Informatics discovery and impact OHDSI: Drawing reproducible conclusions from observational clinical data George Hripcsak, MD, MS Biomedical Informatics, Columbia University Medical Informatics Services, NewYork-Presbyterian

Drawing reproducible conclusions August2010: “Among patients in the UK General Practice Research Database , the use of oral bisphosphonates was not significantly Sept2010: “In this large nested case -control associated with incident esophageal or gastric study within a UK cohort [ General Practice cancer” Research Database ], we found a significantly increased risk of oesophageal cancer in people with previous prescriptions for oral bisphosphonates ”

Observational Health Data Sciences and Informatics (OHDSI, as “Odyssey”) Mission: To improve health by empowering a community to collaboratively generate the evidence that promotes better health decisions and better care A multi-stakeholder, interdisciplinary, international collaborative with a coordinating center at Columbia University Aiming for 1,000,000,000 patient data network http://ohdsi.org

OHDSI’s global research community • >200 collaborators from 25 different countries • Experts in informatics, statistics, epidemiology, clinical sciences • Active participation from academia, government, industry, providers • Over a billion records on >400 million patients in 80 databases http://ohdsi.org/who-we-are/collaborators/

Why large-scale analysis is needed in healthcare All health outcomes of interest All drugs

Patient-level predictions for personalized evidence requires big data 2 million patients seem excessive or unnecessary? • Imagine a provider wants to compare her patient with other patients with the same gender (50%), in the same 10-year age group (10%), and with the same comorbidity of Type 2 diabetes (5%) • Imagine the patient is concerned about the risk of ketoacidosis (0.5%) associated with two alternative treatments they are considering • With 2 million patients, you’d only expect to observe 25 similar patients with the event, and would only be powered to observe a relative risk > 2.0 Aggregated data across a health system of 1,000 providers may contain 2,000,000 patients

OHDSI’s approach to open science Data + Analytics + Domain expertise Open Generate science evidence Enable users Open to do source something software • Open science is about sharing the journey to evidence generation • Open- source software can be part of the journey, but it’s not a final destination • Open processes can enhance the journey through improved reproducibility of research and expanded adoption of scientific best practices

Evidence OHDSI seeks to generate from observational data • Clinical characterization – Natural history: Who has diabetes, and who takes metformin? – Quality improvement: What proportion of patients with diabetes experience complications? • Population-level estimation – Safety surveillance: Does metformin cause lactic acidosis? – Comparative effectiveness: Does metformin cause lactic acidosis more than glyburide? • Patient-level prediction – Precision medicine: Given everything you know about me, if I take metformin, what is the chance I will get lactic acidosis? – Disease interception: Given everything you know about me, what is the chance I will develop diabetes?

How OHDSI Works OHDSI Coordinating Center Source data Standardized, de- Data Analytics Research and warehouse, with identified patient- ETL network development education identifiable level database support and testing patient-level data (OMOP CDM v5) Standardized large-scale OHDSI.org analytics Summary Experiment Analogy Strength Plausibility statistics results Analysis Temporality Consistency Coherence repository Biological gradient Specificity results Comparative effectiveness Predictive modeling OHDSI Data Partners

Deep information model OMOP CDM v5 Person Standardized health system data Standardized meta-data Observation_period Location Care_site CDM_source Specimen Provider Concept Death Payer_plan_period health economics Vocabulary Standardized clinical data Standardized Visit_occurrence Domain Standardized vocabularies Procedure _ occurrence Concept_class Cost Concept_relationship Drug_exposure Relationship Device_exposure Concept_synonym Cohort derived elements Concept_ancestor Condition_occurrence Standardized Cohort_attribut Source_to_concept_ma Measurement Condition_era Drug_strength Note Drug_era Cohort_definition Observation Dose_era Attribute_definition Fact_relationship

Extensive vocabularies

Preparing your data for analysis Patient-level Patient-level ETL ETL ETL test data in source data in design implement system/ schema OMOP CDM WhiteRabbit : ATHENA : CDM : ACHILLES : profile your standardized DDL, index, profile your OHDSI tools built to help source data vocabularies constraints for CDM data; for all CDM Oracle, SQL review data domains Server, quality RabbitInAHat : PostgresQL; assessment; map your source Usagi : Vocabulary tables explore structure to map your with loading population- CDM tables and source codes scripts level summaries fields to CDM vocabulary OHDSI Forums : Public discussions for OMOP CDM Implementers/developers http://github.com/OHDSI

ACHILLES Heel Data Validation

ATLAS to build, visualize, and analyze cohorts

Characterize the cohorts of interest

OHDSI in Action

Treatment Pathways Global stakeholders Conduits Local stakeholders Public Social media Evidence Family Lay press RCT, Obs Academics Literature Patient Guidelines Industry Clinician Advertising Regulator Formulary Consultant Labels Inputs Indication Feasibility Cost Preference

OHDSI participating data partners Abbre- Name Description Population, viation millions AUSOM Ajou University School of Medicine South Korea; inpatient hospital 2 EHR CCAE MarketScan Commercial Claims and Encounters US private-payer claims 119 CPRD UK Clinical Practice Research Datalink UK; EHR from general practice 11 CUMC Columbia University Medical Center US; inpatient EHR 4 GE GE Centricity US; outpatient EHR 33 INPC Regenstrief Institute, Indiana Network for US; integrated health exchange 15 Patient Care JMDC Japan Medical Data Center Japan; private-payer claims 3 MDCD MarketScan Medicaid Multi-State US; public-payer claims 17 MarketScan Medicare Supplemental and US; private and public-payer MDCR 9 Coordination of Benefits claims OPTUM Optum ClinFormatics US; private-payer claims 40 STRIDE Stanford Translational Research Integrated US; inpatient EHR 2 Database Environment HKU Hong Kong University Hong Kong; EHR 1

Treatment pathway event flow

Proceedings of the National Academy of Sciences, 2016

Treatment pathways for diabetes T2DM : All databases Only drug First drug Second drug

Population-level heterogeneity across systems, and patient-level heterogeneity within systems MDCD CUMC Depression CCAE Hypertension Type 2 Diabetes Mellitus INPC GE CPRD MDCR OPTUM JMDC

Patient-level heterogeneity HTN: All databases 25% of HTN patients (10% of others) have a unique path despite 250M pop

Monotherapy – diabetes 1 0.9 0.8 General 0.7 upward trend 0.6 in 0.5 0.4 monotherapy 0.3 0.2 0.1 0 1989 1994 1999 2004 2009 AUSOM (SKorea*) CCAE (US#) CPRD (UK*) CUMC (US*) GE (US*) INPC (US*#) JMDC (Japan#) MDCD (US#) MDCR (US#) OPTUM (US#) STRIDE (US*)

Monotherapy – HTN 1 0.9 Academic 0.8 medical 0.7 centers 0.6 differ from 0.5 0.4 general 0.3 practices 0.2 0.1 0 1989 1994 1999 2004 2009 AUSOM (SKorea*) CCAE (US#) CPRD (UK*) CUMC (US*) GE (US*) INPC (US*#) JMDC (Japan#) MDCD (US#) MDCR (US#) OPTUM (US#) STRIDE (US*)

Monotherapy – diabetes 1 0.9 General 0.8 practices, 0.7 whether 0.6 EHR or 0.5 0.4 claims, have 0.3 similar 0.2 profiles 0.1 0 1989 1994 1999 2004 2009 AUSOM (SKorea*) CCAE (US#) CPRD (UK*) CUMC (US*) GE (US*) INPC (US*#) JMDC (Japan#) MDCD (US#) MDCR (US#) OPTUM (US#) STRIDE (US*)

Conclusions: Network research • It is feasible to encode the world population in a single data model – Over 1,000,000,000 records by voluntary effort • Generating evidence is feasible • Stakeholders willing to share results • Able to accommodate vast differences in privacy and research regulation

Open science • Admit that there is a problem • Study it scientifically – Define that surface and differentiate true variation from confounding … • Total description of every study • Research into new methods

conclusions from observational clinical data George Hripcsak, MD, - PowerPoint PPT Presentation

Biomedical Informatics discovery and impact OHDSI: Drawing reproducible conclusions from observational clinical data George Hripcsak, MD, MS Biomedical Informatics, Columbia University Medical Informatics Services, NewYork-Presbyterian

Observational Methods and NATM NATM System for Observational approach to tunnel design Eurocode

Lecture 6/Chapters 5&6 Observational Studies & Review Advantages of Observational

Observational Constraints of Observational Constraints of the Epoch of the Epoch of

Observational and Numerical Observational and Numerical Study of Ocean Dynamics over Study of

Chapter 2: Observational Studies In an observational study the subjects determine whether

Observational studies and experiments Introduction to Data Types of studies Observational

Lecture 6/Chapters 5&6 backward in time, about the past. Observational Studies & Review

Observational Signatures of Strange Stars Curran D. Muhlberger Department of Physics, Cornell

to Adjust for Bias in Observational Data SAS HEALTH USERS GROUP APRIL 6, 2018 Institute for

Operational criteria of causality for observational road safety evaluation studies TRB-paper

Challenges Facing the Programmer in Observational Research Laurence Carpenter, Amgen PhUSE,

Introduction to Observational Studies Deborah Friedman, MD, MPH University of Texas Southwestern

Abstract Claims coming from human medical observational studies, when tested rigorously, most

CoLAboraTive Observational Research ( TRANSLATOR ) CTSA OneHealth Alliance (COHA) standards based

Observational Cosmology (C. Porciani / K. Basu) Lectures 2 + 4 The Cosmic Microwave Background

Statin Use and Risk of Breast Cancer: A Meta-analysis of Observational Studies Krishna Undela

Clinical Study Design: From Pilot to Randomized Controlled Trials Thomas C. Hulsey, MSPH, Sc.D.

DESIGN IN PRACTICE An experience in estimating the relationship between smoking and cancer

Is proton pump inhibitor use associated with risk of Alzheimers disease? Sirpa Hartikainen

INTRODUCTION TO GENETIC EPIDEMIOLOGY (EPID0754) Prof. Dr. Dr. K. Van Steen Introduction to

Penn MEASURE M esothelioma and the E nvironment in A mbler S tudy: U nderstanding R isk from E

Reviewing Drowning Deaths: Key Questions and Prevention Resources Tuesday, May 19, 2020 2:00 PM

Collaboration (IFSAC): Moving Forward Together IFSAC Webinar Presented by, Christopher Braden,

S01 - Logistic Regression STAT 401 (Engineering) - Iowa State University April 23, 2018

conclusions from observational clinical data George Hripcsak, MD, - PowerPoint PPT Presentation

Biomedical Informatics discovery and impact OHDSI: Drawing reproducible conclusions from observational clinical data George Hripcsak, MD, MS Biomedical Informatics, Columbia University Medical Informatics Services, NewYork-Presbyterian

Observational Methods and NATM NATM System for Observational approach to tunnel design Eurocode

Lecture 6/Chapters 5&amp;6 Observational Studies &amp; Review Advantages of Observational

Observational Constraints of Observational Constraints of the Epoch of the Epoch of

Observational and Numerical Observational and Numerical Study of Ocean Dynamics over Study of

Chapter 2: Observational Studies In an observational study the subjects determine whether

Observational studies and experiments Introduction to Data Types of studies Observational

Lecture 6/Chapters 5&amp;6 backward in time, about the past. Observational Studies &amp; Review

Observational Signatures of Strange Stars Curran D. Muhlberger Department of Physics, Cornell

to Adjust for Bias in Observational Data SAS HEALTH USERS GROUP APRIL 6, 2018 Institute for

Operational criteria of causality for observational road safety evaluation studies TRB-paper

Challenges Facing the Programmer in Observational Research Laurence Carpenter, Amgen PhUSE,

Introduction to Observational Studies Deborah Friedman, MD, MPH University of Texas Southwestern

Abstract Claims coming from human medical observational studies, when tested rigorously, most

CoLAboraTive Observational Research ( TRANSLATOR ) CTSA OneHealth Alliance (COHA) standards based

Observational Cosmology (C. Porciani / K. Basu) Lectures 2 + 4 The Cosmic Microwave Background

Statin Use and Risk of Breast Cancer: A Meta-analysis of Observational Studies Krishna Undela

Clinical Study Design: From Pilot to Randomized Controlled Trials Thomas C. Hulsey, MSPH, Sc.D.

DESIGN IN PRACTICE An experience in estimating the relationship between smoking and cancer

Is proton pump inhibitor use associated with risk of Alzheimers disease? Sirpa Hartikainen

INTRODUCTION TO GENETIC EPIDEMIOLOGY (EPID0754) Prof. Dr. Dr. K. Van Steen Introduction to

Penn MEASURE M esothelioma and the E nvironment in A mbler S tudy: U nderstanding R isk from E

Reviewing Drowning Deaths: Key Questions and Prevention Resources Tuesday, May 19, 2020 2:00 PM

Collaboration (IFSAC): Moving Forward Together IFSAC Webinar Presented by, Christopher Braden,

S01 - Logistic Regression STAT 401 (Engineering) - Iowa State University April 23, 2018

Lecture 6/Chapters 5&6 Observational Studies & Review Advantages of Observational

Lecture 6/Chapters 5&6 backward in time, about the past. Observational Studies & Review