Big Data Phenomics in the VA Mary Whooley MD Director, VA - - PowerPoint PPT Presentation

big data phenomics in the va
SMART_READER_LITE
LIVE PREVIEW

Big Data Phenomics in the VA Mary Whooley MD Director, VA - - PowerPoint PPT Presentation

Big Data Phenomics in the VA Mary Whooley MD Director, VA Measurement Science QUERI San Francisco VA Health Care System University of California, San Francisco Kelly Cho PhD MPH Phenomics Lead, Million Veteran Program VA Boston Health Care


slide-1
SLIDE 1

Mary Whooley MD Director, VA Measurement Science QUERI San Francisco VA Health Care System University of California, San Francisco Kelly Cho PhD MPH Phenomics Lead, Million Veteran Program VA Boston Health Care System Harvard Medical School Academy Health Annual Research Meeting June 27, 2017

Big Data Phenomics in the VA

slide-2
SLIDE 2

Outline

  • Importance of data standardization and interoperability
  • PCORnet and the Observational Medical Outcomes

Partnership (OMOP) Common Data Model

  • Million Veteran Program (use case)
  • Coding algorithms for computable phenotypes

2

slide-3
SLIDE 3

3

slide-4
SLIDE 4

4

Data entry Data coding Data analysis Data harmonization Data

  • rganization

Big Data are Messy

slide-5
SLIDE 5

VA Information Systems Technology Architecture (VistA)

5

VA hospitals and clinics

slide-6
SLIDE 6

Example: How can we identify uncontrolled diabetics?

slide-7
SLIDE 7

Logical Observation Identifiers Names and Codes

http://loinc.org

slide-8
SLIDE 8

Example: How can we identify uncontrolled diabetics?

slide-9
SLIDE 9

VA Corporate Data Warehouse Data Tables

9

slide-10
SLIDE 10

10

Data entry Data coding Data analysis Data harmonization Data

  • rganization

Big Data are Messy

slide-11
SLIDE 11

Outline

  • Importance of data standardization and interoperability
  • PCORnet and the Observational Medical Outcomes

Partnership (OMOP) Common Data Model

  • Million Veteran Program (use case)
  • Coding algorithms for computable phenotypes

11

slide-12
SLIDE 12

http://www.pcornet.org/

slide-13
SLIDE 13

13

slide-14
SLIDE 14

14

http://pscanner.ucsd.edu/

slide-15
SLIDE 15

15

http://pscanner.ucsd.edu/

slide-16
SLIDE 16

Abstract presented Nov 2015 Am Medical Informatics Assoc

2000 to present

  • 16 million unique patients
  • 11 million w/ at least one encounter
  • 5 million deaths
  • 3 billion procedures
  • 2.5 billion conditions
  • 973,000 providers
slide-17
SLIDE 17

Mapping to Observational Medical Outcomes Partnership (OMOP) Common Data Model Query using the same SQL code

SQL = Structured Query Language

slide-18
SLIDE 18

Observational Outcomes Partnership (OMOP) Common Data Model Implementations

18

> 600 million patients worldwide

slide-19
SLIDE 19

Outline

  • Importance of data standardization and interoperability
  • PCORnet and the Observational Medical Outcomes

Partnership (OMOP) Common Data Model

  • Million Veteran Program (use case)
  • Coding algorithms for computable phenotypes

19

slide-20
SLIDE 20

20

slide-21
SLIDE 21

Million Veteran Program (MVP)

  • National VA research initiative aiming to enroll one

million users of the VHA in an observational cohort

  • Over 500,000 patients already enrolled
  • Blood collection for genotyping and storage
  • Access to electronic medical record
  • Goal is to create database of genomic, military

exposure, lifestyle and electronic health information

slide-22
SLIDE 22

Currently enrolling at >50 VHA Facilities

22

Principal Investigators: John Concato MD MS MPH

  • J. Michael Gaziano MD MPH
slide-23
SLIDE 23

Genome-wide association study (GWAS): identify genotype(s) associated with specified phenotype

1 2 3 4 5 6 7 8 9 10 . . . . . . . . . . . . . . . . . . 22 23

Chromosome (genotype)

Strength of association with computable phenotype

slide-24
SLIDE 24

Genome-wide association study (GWAS): identify genotype(s) associated with specified phenotype

gene (on chromosome 6) linked with specified phenotype

1 2 3 4 5 6 7 8 9 10 . . . . . . . . . . . . . . . . . . 22 23

Chromosome (genotype)

Strength of association with computable phenotype

slide-25
SLIDE 25

Outline

  • Importance of data standardization and interoperability
  • PCORnet and the Observational Medical Outcomes

Partnership (OMOP) Common Data Model

  • Million Veteran Program (use case)
  • Coding algorithms for computable phenotypes

25

slide-26
SLIDE 26

What is a computable phenotype?

26

Unstructured data

  • Visit notes
  • Signs/symptoms
  • Smoking/alcohol
  • Employment
  • Radiology reports
  • Discharge summary
  • Pathology reports

Computable Phenotype

Structured data

  • ICD9/10 codes
  • CPT codes
  • Prescriptions
  • Lab results
  • Vital signs

Electronic Health Record

= +

slide-27
SLIDE 27

Phenotype Algorithms – https://phekb.org/phenotypes

Phenotype Methods Owner

Atrial Fibrillation

CPT Codes, ICD 9 Codes, Natural Language Processing Vanderbilt

Dementia

ICD 9 Codes, Medications eMERGE Univ Washington

Heart Failure

CPT, ICD 9 Codes, Labs, Meds, Natural Language Processing eMERGE Mayo

Coronary Disease

CPT Codes, ICD 9 Codes PCORI MidSouth CDRN

Sleep Apnea

CPT Codes, ICD 9 Codes Beth Israel Deaconess

Type 2 Diabetes

ICD 9 Codes, Labs, Medications eMERGE Northwestern

Venous Thromboembolism

CPT, ICD 9 Codes, Vital Signs Natural Language Processing eMERGE Mayo

slide-28
SLIDE 28

28

Electronic Health Record Training Set Data Mart Predicted Cases + Non-cases

  • 2. Iteratively

refine & test classification algorithm Validation Set

  • 1. Identify cases

and non-cases (often requires chart review)

  • 3. Validate

final algorithm (probabilistic approach)

slide-29
SLIDE 29

29 J Am Med Inform Assoc 2013 Genome Medicine 2015

slide-30
SLIDE 30

MVP Phenomics Group

Mission: 1) to provide a phenotyping framework for MVP Phenomics Science 2) to manage and coordinate resources for MVP phenotyping projects 3) to play a leading role towards “Mapping the Human Phenome” Organization: Kelly Cho PhD MPH Lead, MVP Phenotyping Scott DuVall PhD Lead, MVP-VINCI Collaboration Jackie Honerlaw RN MPH Manager, Phenomics Core Kevin Malohi BS Manager, VINCI Data Services Mai Nguyen PhD Manager, MVP Data Analytics Anne Ho MPH Lead, MVP Data Management David Gagnon MD PhD Lead, Biostatistics and Data Science

30

slide-31
SLIDE 31

Summary – Big Data Phenomics in the VA

  • Big data are messy
  • VA EHR data have been mapped to national VA

Corporate Data Warehouse (CDW)

  • CDW data have been transformed to OMOP Common

Data Model

  • Million Veteran Program actively using these data
  • Phenotype algorithms can be shared at PheKB.org

31

slide-32
SLIDE 32

32