[PPT] - Genomics & Personalized Medicine: Analysis & Clinical PowerPoint Presentation

SLIDE 1

Genomics & Personalized Medicine: Analysis & Clinical Implementation

SLIDE 2

Our vision

To create a borderless, complementary and synergistic research environment in southeast Wisconsin to translate discoveries into better health for our citizens; while simultaneously providing comprehensive educational and training programs to develop the next generation of clinical and translational researchers.

SLIDE 3

Our Principles of Operation

Creating a culture that fosters interdisciplinary innovation within and among institutions and contributes to the new discipline of clinical and translational science

Transformation

Providing core resources to assist investigators in the development, implementation, analysis, and dissemination

f clinical and translational research

Facilitation

Training clinicians and basic scientists in the emerging discipline of Clinical and Translational Science

Education

Leveraging institutional strengths and resources to enhance trans-disciplinary clinical and translational research

Collaboration

SLIDE 4

CTSI-Cancer Center Workshop

Future Directions & Opportunities for Cancer-related Research Collaboration

Friday, December 16, 2011

Rehabilitation Research Workshop Wednesday, May 26, 2010 12:00 - 4:00 pm Marquette University

Nucleating Workshops & Conferences

Exploring the Metabolic Syndrome:

Basic Mechanisms, Clinical Implications and Community Impact

Wednesday, February 9, 2011

Brain Imaging Workshop University of Wisconsin - Milwaukee Wednesday, May 20, 2009

Drug Discovery and Repurposing: Form ing Partnerships May 31, 20 12 – Medical College of Wisconsin

SLIDE 5

Convergence of Disciplines 2012 Pilot Awards

Medicine
Biomedical Informatics
Rehabilitation
Psychology
Economics
Nursing
Dentistry
Public Health
Computer Science
Business
Physical Therapy
Exercise Science
Biomedical Engineering
Genetics
Physics
Chemistry
Mechanical Engineering
Psychiatry

SLIDE 6

Howard Jacob, PhD

Dr. Jacob led a team of researchers at the

Medical College of Wisconsin who used NextGen sequencing to help diagnose the medical condition of Nicholas Volker.

Dr. Jacob and his team have also built the

world’s first integrated Genomics Medicine Clinic.

He earned his PhD in Pharmacology from

the University of Iowa followed by fellowships at MIT, Harvard and Stanford.

He is the founding director of the Personalized Medicine Program and

Human and Molecular Genetics Center, and Professor in Physiology and Pediatrics at the Medical College of Wisconsin and Children’s Hospital of Wisconsin.

Dr. Jacob currently serves as the Director of Biomedical Informatics for

CTSI

SLIDE 7

Who is Personalized Medicine?

Howard J. Jacob, Ph.D. http://www.mcw.edu/HMGC Jacob@mcw.edu

SLIDE 8

Personalized Medicine

“products and services that leverage the

science of genomics and proteomics (directly

r indirectly) and capitalize on the trends

toward wellness and consumerism to enable tailored approaches to prevention and care.”

PricewaterhouseCoopers The new science of personalized medicine

SLIDE 9

Goals for today

Discuss six aspects of personalized medicine

– Data generation Implementation – Data analysis Ethics – Data knowledge Education

Break out groups

– Meet, discuss, create opportunities for collaboration

Setting the stage

SLIDE 10

SLIDE 11

SLIDE 12

SLIDE 13

SLIDE 14

SLIDE 15

How long is all the DNA in your Body?

2m X 100,000,000,000,000 cells = 2 x1014m

The earth to the sun is 150,000,000,000m (1.5x1011)

2x1014/1.5x1011 = 1333 trips Or 666.5 round trips

SLIDE 16

SLIDE 17

IS FAMILY HISTORY IMPORTANT?

SLIDE 18

SLIDE 19

Personalized Medicine

Adults: Take 2 aspirin

Family history is how we practice genetics in the clinic every day in 2012

SLIDE 20

SLIDE 21

SLIDE 22

370 Drugs

SLIDE 23

Nic Volker, gene found, treatment, doing well One in a Billion by Mark Johnson and Kathleen Gallagher

SLIDE 24

Clinical Care Team Sequencing Team Clinical Molecular Testing Data Assessment Team EthicsTeam Clinical Phenotypes & Family History BioInformatics & Electronic Medical Records Patient

SLIDE 25

Additional Reference Data Clinical Genomic Data Clinical Reference Data Diagnostic Test Results Molecular Reference Data

Expression profiling Proteomics profiling Genomic Medicine Genetics CNV profiling Epigenetic profiling Drug interactions Ref genomes Ref pathways Genotype to phenotype Family data Sequence data Ref genotypes Outcomes Clinical attributes Treatments Environmental exposures Radiology Phenotype data Molecular interactions Immunohistochemistry FACS Hematology Pathology Patent info Ref expression data Biomarkers Pharmacogenomics Biochemical Disease registry Diagnoses SNP typing Risk factors G x E associations

SLIDE 26

Who will pay for all this?

SLIDE 27

Letter from Insurance Group

6 out of 12 pre-authorizations

SLIDE 28

SLIDE 29

Personalized Medicine

Repeat for hundreds of diseases and treatments Repeat for thousands of patients Personalized Treatment

Individual Patient G + C + E

Predictive Model for Disease Susceptibility & Treatment Response State-of-the-Art Machine Learning Genetic, Clinical, & Environmental Data

SLIDE 30

Would you like your genome sequenced?

SLIDE 31

Categorical Model of Choice

ptional

disclosures mandatory disclosures

1° diagnostic 2° none 2° not actionable childhood 2° treatable childhood 2° actionable adulthood 2° not actionable adulthood

SLIDE 32

Parental Decisions Utilizing Categorical Model

Proceeded with WGS None* Not Actionable Childhood Actionable Adult Not Actionable Adult Yes 12 1 10 10 7 No 1 11 2 2 5

*Mandatory disclosures include pathogenic and actionable childhood onset diseases.

SLIDE 33

Consumer Driven

Individual Health & Wellness Plan

Data and Knowledge Changes Age and Health Status Changes

SLIDE 34

1. Generate whole genome sequence 2. Analyze (de novo assembly) 3. Annotate the genome for all structures 4. Link to Electronic Medical Record 5. Link to all biomedical literature

6. Provide data to physician for clinical management

ALL IN ONE DAY Then provide prospective follow-up as new discoveries come “online”

Goal

SLIDE 35

Challenges

Managing Data
Analyzing Data

– Data Visualization

Diagnosis (genomics and clinical presentation)
Implementation
Storing data
Staying current as medical knowledge changes and

the patient changes

SLIDE 36

SLIDE 37

SLIDE 38

SLIDE 39

SLIDE 40

SLIDE 41

How do we help patients today?

Our CTSI is personalized medicine

SLIDE 42

Nic Volker, gene found, treatment, doing well One in a Billion by Mark Johnson and Kathleen Gallagher Andrew, gene(s) have not been found, no diagnosis Cracking your genetic code; NOVA Avery What if her DNA was sequenced at birth?

SLIDE 43

Current and Future Sequencing Technologies

Michael Tschannen

Molecular Genetics Technical Specialist Sequencing Core Manager Human and Molecular Genetics Center Medical College of Wisconsin

SLIDE 44

Many Choices of Technology

SLIDE 45

Sanger Sequencing – ABI3730xl

Specs: “Gold Standard” of Sequencing 800-1000bp read length 1.2Mb per 4 hour run Typical Applications: Clinical Variant Verification Amplicon Sequencing Gene Panel Sequencing Region Specific Sequencing

SLIDE 46

Roche GS-FLX+ (454)

Specs: 2 million reads per 2 day run 450-700 bp reads 600-800 Mb per run Typical Uses: Metagenomics (16S) Amplicon Sequencing Bacterial WGS Viral WGS Sequence Capture Gene Panel Sequencing

SLIDE 47

Ion Torrent – Life Technologies

Ion Personal Genome Machine Specs: Up to 200bp reads Up to 1Gb in 5 hour run Typical Uses: Amplicon Sequencing Sequence Capture Ion Proton Specs: Up to 200bp reads Up to 10Gb per 4 hour run Typical Uses: Whole Exome Sequencing Whole Genome Sequencing (future)

SLIDE 48

Illumina HiSeq 2000

Specs: 2.5 – 3 Billion reads per 12 day run 2x100 Paired End Sequencing 2 Flowcells per run 2 Human Genomes per flowcell 24 Exomes per flowcell Typical Uses: Human/Rat/Mouse Whole Genome Sequencing Exome Sequencing RNAseq – Gene Expression / Transcript Identification

SLIDE 49

Illumina MiSeq

Specs: 35 million reads per flowcell 1/5 of a single HiSeq Lane 2x250 PE reads 7-8Gb output per 36 hour run Upgrading to 15Gb output in 2013 Typical Uses: Whole Exome Sequencing HiSeq Library/Pool QC Amplicon Sequencing Gene Panel Sequencing

SLIDE 50

Pacific Biosciences PacBio RS

Specs: 120,000 Reads per SMRT Cell 150-250 Mb per 3 hour cell run 16 SMRT cells per day 3,000bp average read length New XL chemistry – up to 5kB Typical Uses: Bacterial Whole Genome Viral Whole Genome Amplicon Sequencing Sequencing of repeats Genomic gap filling

SLIDE 51

Oxford Nanopore

Specs: Protein Nanopore GridION – 2,000/8,000 pores – 150Gb/Day MinION – 500 pores – 25Gb/Day No set run time/read length Whole Human Genome -15 min - $10/Gb Uses: Still Unreleased Whole Genome Sequencing Metagenomics “Destination” Sequencing

SLIDE 52

The Future

Higher Throughput Faster Cheaper Longer Read Length More Accurate Easier Assembly Efficient Data Storage

SLIDE 53

Acknowledgements

Contact Info: Mike Tschannen mtschann@mcw.edu 414-456-8890

SLIDE 54

Genomic Medicine Big Data: Production, Analysis and Management

Liz Worthey Ph.D

Assistant Professor Clinical and Translational Genomic Analysis Lab Human and Molecular Genetics Center Pediatric Genomics, Department of Pediatrics Medical College of Wisconsin

SLIDE 55

Human Genome Sequencing

First Human Genome Sequence

10 years to complete
$2.7 billion
100s ABI 3730 sequencing

machines

3 Gbases data total

Single Human Genome Sequence now

8 days to complete
$5,000
1 Illumina HiSeq2500 machine
100 Gbases per day

The cost to sequence one base has dropped 100 million fold

SLIDE 56

Many technologies exist

SLIDE 57

Analysis phases

NGS RUN BASE CALLING Lane-level Q/C ALIGNMENT PRIMARY ALIGNMENT GENOTYPE CALLING DE-MULTIPLEX

SECONDARY TERTIARY

QC metrics Thresholds fastq

bam
other
Vcf
other

VARIANTS LIST NOVEL KNOWN ANNOTATE PRIORITIZE CLASSIFY CLINICAL REPORT

PRIMARY

ASSAY DESIGN SUB IND CNV SV

Derived from figure from Birgit Funke

SLIDE 58

Challenge: A variety of events lead to production of many sequencing errors

Polymerase errors (~300,000 per genome)
De-Phasing

– Occurs with step-wise addition methods when growing primers move out of synchronicity for any given cycle

Dark Nucleotides causing false deletion errors

– The nucleotide does not contain a fluorescent label – Breakdown of dye-label nucleotides – Contamination with unlabeled nucleotides

SLIDE 59

Sequencing the same sample with known variants on different platforms produces different sequencing errors

Courtesy of Chris Mason and SeQC Consortium

Illumina more likely to have error after ‘G’
PCR-based methods miss GC- and AT-rich regions
PolyA miscount errors for pyrosequencing methods

SLIDE 60

Secondary analysis – read mapping

p.L80P p.Q98X p.Q86X p.V146G p.R123W p.L153R p.R173G p.R175P p.Q183H p.R217G p.E240X p.Q239X p.Q236X p.L227P p.A288G p.D311N p.E315A p.R319Q p.A323P p.D406V p.X420W p.C417F p.E391X p.Q384X p.R359W p.Q290X p.R256X p.R62X p.Q45X p.E57K

NEMO MECP2 Many methods are available: Which to use?

Speed
Accuracy
Computational

requirements

Robustness
Support

SLIDE 61

Some of these regions are of significant interest

RPS17 HEXB DAZ1 SFTPA2 C4A C4B SSX4 INPP5E F7 CES1 PMS2 SOX18 CCL3L1 CHST14 SLC7A9 ATP8B1 CYP2R1 ABCC2 MME LPA CSH1 OCLN HBA2 KRT81 CFHR3 HRAS DSG1 FCGR2C SMN2 F13A1 CYP2C19 TFE3 CR1 NCF1 STRC CFC1 D2HGDH EVC FKTN FCGR2B ANTXR2 OTOA ENAM

52 2,925 829 Actionable Poorly covered Gaps shared in 25 WGS

SFTPA1 SMN1 OPN1MW OPN1LW IKBKG NEB SLC6A8 CHRNA3 CFD

Amelogenesis imperfecta Systemic lupus erythematosus

>90 diseases

Ehlers-Danlos syndrome, musculocontractural type Ellis-van Creveld syndrome C4a deficiency

881 1117 1551 309 185 932 362 663 526 309

Clinically actionable Genes

SLIDE 62

Strategies for spanning gaps

Some gaps are caused by overrepresentation of certain

genomic regions in the sequence output, with underrepresentation of others

Sanger fill in – too expensive
WGS plus WES strategy?
Some gaps are caused by repeats that can’t be sequenced

through or cause issues with mapping – add longer read data from a PacBio or ?

De Novo assembly?

SLIDE 63

Variant callers give different results on the same data

Differences between sequencing/analysis pipelines are between 4 and 14% of variants per sample Various mapping and variant calling algorithms used in each analysis

Rosenfeld, Mason, and Smith. PLoS One. 2012.

Concordance good for latest renditions of SNV callers, ok for indel callers, and poor for SVs Comparison of variants called from 32 HapMap genomes by Complete Genomics and 1K Genomes Project

Blue=CG Red=1KGP

SLIDE 64

Even calling of SNVs has issues

Patient 2F Patient 2C2

Concordance between Illumina pipeline and BWA/GATK SNV calls

Rosenfeld, Mason, and Smith. PLoS One. 2012.

SLIDE 65

But cannot simply exclude variants called by a single method

Transition mutations occur more frequently than transversions thus: (Ti/Tv >1) Sequencing errors tend to be more transversions thus: (Ti/Tv < 1)

Elliott Margulies et. al. at Illumina have shown that only 95% of high

quality SNVs are called the same in 14 stringently performed replicates

Solutions:
Ongoing algorithm development
Hold of until methods mature – we don’t yet “do” clinical SVs

SLIDE 66

Tertiary analysis

~250,000 ~1,800 ~1,200 ~50 ~10 0-5 ~4,800,000

Variant storage, analysis, prioritization, and reporting

SLIDE 67

Analytical considerations/challenges

n the variant impact axis
Existing WGS technologies produce many

sequencing errors

Existing mapping algorithms in combination with

short read technology give rise to many mapping errors

Bioinformatics limitations with variant calling

(especially indels and SV)

Data is incorrect/outdated - Allele Frequencies
Data must be used appropriately - nucleotide or

amino acid conservation scores

Data must be understood - SIFT, PolyPhen, Condel

SLIDE 68

e.g. Allele frequency data cannot be relied upon without review

The allele frequency reported for a particular variant can vary widely amongst commonly used data sources These variants were not randomly selected – these variants are all associated with disease in HGMD Solution: compare multiple data sources and consider the sources in terms of possible source disease status as well as technological aspects etc.

SLIDE 69

Many of the datasets being used were never intended to be

used in the way we are currently applying them

We estimate that up to ~8% of genotype to phenotype data

in larger mutation databases is incorrect:

– Typos/human entry errors are common

E.g. term breast cancer entered rather than cervical

– Old/Outdated

<70% in PubMed is captured in DBs

– Many entries subsequently disproven or even retracted – These are not seen as part of DBs “job”

It is important to understand the quality of the datasets and

to curate/track corrections or in house annotations

Data sharing is hugely important
Identifying a variant with a functional impact on a gene is not

the same as identifying the cause of the disease

Analytical considerations/challenges

n the functional classification axis

SLIDE 70

e.g. Many reported “rare mutations” are not so rare polymorphisms

HUGE curation task to ensure data is correct and up to date.
Currently – never believe anything – many hours of

verification required

Data sharing will be critical to our cumulative success!
HEXB
NPHS1
LAMA2
ADA
MTHFR
GALC
MEFV
CYP21A2
ARSB
CPT1A
HESX1
HGSNAT
ACADM
FKTN
WNT10A
PMM2
MPL
POLG
BTD
NTRK1
ALG6
DPYD
HADHA
GAA
AHI1
AMPD1
ATP7B
CDH23
SBDS
NEFL
GLB1
ATP7B
ETFB
NHLRC1
IGHMBP2
SERPINA1
NPHS1
SLC26A2
MYO5A
DPYD

SLIDE 71

Challenge: Data storage

Data storage costs currently ~$1,200 per year per patient per WGS (on high performance disks; without compression)

SLIDE 72

At 1 Tb per genome

2013: 100,000 genomes 1 Tb each = 100 PetaBytes WGS for all new U.S. babies/year: 4,000,000 genomes = 4 ExaBytes WGS for all U.S. citizens over 50: 310,000,000 genomes = 310 ExaBytes WGS for all: 7,000,000,000 genomes = 7 ZettaBytes

SLIDE 73

At 400 Gb per genome

2013: 100,000 genomes 100 Gb each = 40 TeraBytes WGS for all new U.S. babies/year: 4,000,000 genomes = 1.6 PetaBytes WGS for all U.S. citizens over 50: 310,000,000 genomes = 124 PetaBytes WGS for all: 7,000,000,000 genomes = 2.8 ExaBytes

SLIDE 74

http://www.allthingstechnology.net/2011/07/how-much-byte-make-yottabyte.html

How much data is this?

@1Tb/genome All = 7 Zb @400 Gb/genome All = 2.8 Eb

SLIDE 75

Solutions

Development of and agreement on

sustainable retention guidelines

Compression?

– By retaining only the data related to variant calls we can compress 50-100 fold. – But we need to keep in mind clinical use cases.

Transparency of data used to determine that no variant

existed (no call)?

Transparency of data that shows that a particular

region was not covered sufficiently to be sure of accuracy of the call – gap calls?

SLIDE 76

Performing WGS in a clinically appropriate timeline

Data generation/analysis/reporting steps HiSeq2000 (Current) HiSeq2500 (RapidRun) Barcode and Accessioning of samples 0.5 0.5 DNA extraction and Sequencing 289* 33.5 DNA Extraction 2 2 Sequencing Lib Prep and Quant 77* 4.5 Verification and Sequencing 210* 27 Secondary analysis 19 19 Tertiary analysis 5 5 Variant Annotation 3 3 Production of Clinical Report, Delivery 2 2 Interpretation and reporting 11 11 Interpretation 9 9 Preparation and review of final report 2 2 Total (hours) 324.5 69

* 3 samples processed simultaneously in these steps

SLIDE 77

Currently a $5,000 sequence, a $200 analysis, and a $1,250 interpretation +

Analysis/Interpretation:
Bioinformatician: loading, processing, report generation - $200
Clinical Geneticist: interpretation - $1000
Follow up:
Technician: Sanger confirmation/analysis - $150
Reporting:
Analyst / Clinical geneticist: Report finalization - $100

+

Getting better but not good enough

SLIDE 78

Capabilities – We are a diverse group of researchers with varied

backgrounds and expertise in genetics, genomics, physiology, computer science, clinical informatics, statistics, clinical research, clinical diagnostics, bioinformatics etc. etc.

Needs – Progress is being made but many challenges remain in

the continued development of Genomic Medicine. In Informatics realm we have significant challenges in:

– Data storage and management, transfer and sharing – Secondary analysis – reference genome, mapping, variant calling – Tertiary analysis – annotation, prioritization, visualization – Clinical Interpretation – data mining, phenotype data extraction and analysis, genotype – phenotype correlation curation – Education – across the board

Goals – Collaboration to further our joint capabilities:

– Identification of disease cohorts for collaborations in translational research – Collaborations to develop novel or reuse existing methodologies/algorithms addressing the challenges outlined – Identification of areas of expertise for submission of joint grants

Summary

SLIDE 79

Families and Patients
Referring Physicians
MCW- HMGC
Howard Jacob
David Bick
David Dimmock
Mary Shimoyama
Jill Northup
Jenny Guerts
Brad Taylor
CHW Genetics Center
Regan Veith
Angela Pickart
William Rhead
AGEN-Seq Technicians
Mike Tschannen
Daniel Helbling
Brett Chirempes
Jayme Wittke
Jamie Wendt-Andrae
CHW
Juliet Kersten
Paula North
Tara Schmit
Jack Routes
Altheia

Roquemore-Goins

Gail Bernadi
Michael Gutzeit
Steven Leuthner
Rodney Willoughby
Thomas May
Robert Kliegman
Funding/Support:
MCW Children’s

Research Institute

Jeffrey Modell

Foundation

Private Donors
Bioinformatics/Curation

/Systems support

Brandon Wilk
Jeremy Harris
Wendy Demos
Arthur Weborg
George Kowalski
Weihong Jin
Weisong Liu
Jeff DePons
Sharon Tsaih
Oliver Hummel
Stacy Zacher
Marek Tutaj
Greg McQuestion
Kent Brodie
Stan Laulederkind
Victoria Petri
Jennifer Smith
Alex Stoddard
Pushkala Jayaraman

Acknowledgements

SLIDE 80

Genomics & Personalized Medicine: Analysis & Clinical Implementation Breakout Sessions 1 & 2

SLIDE 81

Genomics and Science Education

Dr. Neil Lamb

HudsonAlpha Institute for Biotechnology

Dr. Tim Herman

Milwaukee School of Engineering

SLIDE 82

Teachers FIRST

From Interesting Research to Scientific Teaching

Tim Herman Center for BioMolecular Modeling Milwaukee School of Engineering

An NIH Science Education Partnership Award (SEPA) project

SLIDE 83

The MSOE Center for BioMolecular Modeling

…an instructional materials development laboratory, …with a science education outreach mission.

SLIDE 84

… recent advances in the technologies that have delivered the promise of genomic and proteomic medicine to clinics have completely outstripped the education of the public who stand to benefit from this new science ….. … as well as the very health care professionals who suddenly find themselves in a position to make use of this new technology.

The Problem…..

SLIDE 85

In the old days …..

SLIDE 86

SLIDE 87

Zinc Finger Nucleases and Genome Editing
The CCR5 Gene and Resistance to HIV
Cytochrome p450s and Pharmacogenomics
Beery Twins Story – Sepiapterin Reductase
Nic Volker Story –XIAP.

Genes, Genomes and Personalized Medicine

…. Molecular Stories ….

Nic Volker Story –XIAP.

SLIDE 88

http://cbm.msoe.edu/stupro/so/module2012/xiapHome.html

SLIDE 89

Neil E. Lamb, Ph.D. Director of Educational Outreach HudsonAlpha Institute for Biotechnology

an online game about complex disease, risk assessment and prevention/treatment

funded through a 5-year NIH Science and Education Partnership Award (SEPA)

SLIDE 90

goal: assess disease risks for the six-person crew and

pack the ship with supplies that reduce risk and/or provide treatment

game can be played in single person or classroom format
Students’ activities and decisions are available to the

teacher

setting: preparing to launch the Argos1 - a 20-year

exploratory mission to Triton, a moon of Neptune

accompanying website with background

information

What is Touching Triton?

SLIDE 91

Touching Triton Gameplay

select mission identify ‘combined disease risks’ choose crew member launch mission determine risks make packing recommendations

bserve outcomes

family history genomic findings medical records

SLIDE 92

Crew Selection

SLIDE 93

Crew Member Dashboard

SLIDE 94

Medical Records

SLIDE 95

Genomic Data

MACULAR DEGENERATION

SLIDE 96

Family History

SLIDE 97

Final Risk Assessment

SLIDE 98

Packing

SLIDE 99

Advisor support

SLIDE 100

grounded in research
risk from genomic data - multiply odds ratios from each

SNP, compare to population risk

risks from medical records and family history selected

using a slider bar

verall risk assessment - slider bar
bjective: understand how students perceive risks in the

context of complex disorders

‘consensus lifetime risk’ - determined from a survey of

medical geneticists, genetic counselors and clinical researchers - teachers can compare this to student risk estimates

About Risks

SLIDE 101

Project Timeline

Y1 Y2 Y3 Y4 Y5

discovery & planning phase prototype generation usability testing to assess functionality repeated testing & revision; generate teacher content & background in-class testing; focus

n content

final beta testing; summer training for 30 early adopters final revisions & deployment; website complete embedded training in classrooms of 75 Alabama teachers

nline

training module crafted & deployed assessment & broad dissemination

Dec. 2012

SLIDE 102

Filming of Crew Scenes

SLIDE 103

Acknowledgements

Adam Hott
Kelly East
Jennifer Carden
Madelene Loftin
JD Frey

HudsonAlpha

Paul Chang
Bryan Powell
Nevin Langdon

Metabahn/PCD/HyCreative

Camellia Sanford

Rockman et al.

This project supported by the National Center for Research Resources and the Division of Program Coordination, Planning and Strategic Initiatives of the National Institutes of Health through Grant Number R25OD010981

The U.S. Space and Rocket Center

SLIDE 104

Ethical, Legal, Social Implications (ELSI) related to Genomics (the brief version)

MCW Program in Genomics and Ethics and MCW Ctr. for Patient Care and Outcomes Research (nothing to disclose)

Mike Farrell, MD

SLIDE 105

“Safety” ensuring more good than harm
ELSI: techniques to ensure that genomics is

safe for patients, relatives, and society.

Key: effective collaboration between

clinicians, lab, informatics & ELSI scholars

A “Safety” Paradigm for ELSI

Patient care TODAY Safer, wider practice TOMORROW ELSI Technological Advances

“What can we do?” “What should we do?” “How should we do it?”

SLIDE 106

Distributive justice

– Socially just allocation of goods in society – Example question: should we spend money

n genomics when people need access to

care, immunizations, mental health care, food, etc…

This ELSI is commonly raised, but ignored

in this talk because of the assumption that genomic sequencing is here to stay.

Not included in this talk…

SLIDE 107

When Genomics is implemented, what will

happen to…

ELSI example #1: Harm vs. Benefit

Patients? • Problems with understanding, anxiety, other emotions,

decision-making, self-image, stigmatization, etc…

Relatives? • Genomic information disseminates from the patient, exposing

relatives to risks for which they did not consent.

Children? • Especially vulnerable because identities are still forming.

No right of refusal.
The mis-paternity problem (“dad” is not biological father).
Disrupted bonding between parents and children.

Healthcare? • Providers will be confused, and may require re-training to deal

with a huge volume of information.

How will genomic data be treated by payors, employers?

Communities? • Will there be cultural disparities, stigmatization, or unique sorts

f problems with misconceptions?

Society? • Will genetic risk factors change how we appraise each others’

value as individuals?

Will genomics data worsen health disparities?

SLIDE 108

Many ELSIs are phrased in the form of a

“lingering question.” Why is this so?

– Likelihood of the ELSI is unclear – It is hard to study a hypothetical innovation, or generalize from dissimilar situations – Some ELSIs are best described in works of imaginative fiction (e.g. the movie Gattaca) – fiction may raise awareness, or it may give the public an unrealistic idea of genomics ELSI example #1: Harm vs. Benefit cont’d

SLIDE 109

Another problem: some genomic data has a

varying degree of uncertainty

Example: Does a given variant matter?

ELSI example #1: Harm vs. Benefit cont’d

(Mike’s view about)

Association between a genomic variant and phenotype Published evidence of association between variant and …

Disease
Risk for disease
Disease in animals
Risk for disease in animals
Disease in children vs. adults

Plausible association between variant and …

Disease
Risk for disease

Evidence or plausible association between variant and …

Phenotype that does not relate

to disease (e.g. height) Uncertain significance

SLIDE 110

Is genomic sequencing a test, or a

procedure?

What constitutes adequate informed

consent? How much information needs to be provided?

How do we ensure that patients can weigh

harms and goods for themselves.

What are the boundaries between clinical

care and research?

ELSI example #2: Consent

SLIDE 111

Given that sequencing is complicated,

and has risks and benefits…

–Should there be pre-requisites for credentials or training of providers before they can order sequencing? –Should patients be required to see a genetic counselor before and/or after sequencing? ELSI example #3: Professional issues

SLIDE 112

Answer is not simple
Solutions need to be crafted for individual

ELSIs, and the communities that may experience them.

Some solutions have their own problems

– Genetic counselors: cost and supply limits – Consent: how much education is needed? – Best practices: do they matter if anybody can buy sequencing from a private lab?

Addressing ELSI is thus a matter of expertise

and troubleshooting

OK, so what do we do about ELSIs?

SLIDE 113

Expertise in ELSIs, communication, and research ethics
Based in MCW’s Center for Bioethics and Medical

Humanities (itself based in Institute for Health & Society) Contact us to collaborate! kstrong@mcw.edu

Program in Genomics and Ethics

Kimberly Strong PhD Ryan Spellecy PhD Thomas May PhD Mike Farrell MD Art Derse MD JD Kaija Zusevics PhD (postdoc) Alison La Pean Kirschner MS CGC (genetic counselor)

SLIDE 114

Current State of Practice: Whole Genome Sequencing in clinical use today

David Bick, M.D.

Director of Advanced Genomics Laboratory at Children's Hospital of Wisconsin Professor of Pediatrics and Obstetrics & Gynecology Chief, Section of Genetics, Dept. of Pediatrics, Medical College of Wisconsin

Human and Molecular Genetics Center

Geno enomics & & Per erson

nal

alized ed M Medi edicine: A Anal nalysis and and Clin Clinic ical I l Imple lementatio ion Clin Clinic ical & l & Transla lational Sc l Scie ience I Institute 12-17-12

SLIDE 115

Disclosures

The following relationship(s) exist related to this

presentation:

– Children’s Hospital of Wisconsin (CHW) and Medical College of Wisconsin (MCW) provides fee for service whole genome sequencing (WGS) for clinical use. – WGS is not an FDA approved device, the FDA has determined that such approval is not necessary. – Testing is performed in CLIA/CAP approved laboratory.

SLIDE 116

Whole Genome Sequencing (WGS) & Whole Exome Sequencing (WES) is in clinical practice Children’s Hospital of Wisconsin & Medical College of Wisconsin offers whole genome sequencing:

Nature 478, 22-24 (2011) News Feature Human genetics: Genomes on prescription Brendan Maher

48 patients evaluated for the programme, 17 have been accepted, and their families have gone through six hours or more of genetic counselling before sequencing. Insurance companies have agreed to foot the bill for at least two of the cases.

SLIDE 117

Steps in clinical whole genome sequencing

Patient selection
Genetic counseling
Generation of sequence data - WGS

vs WES

Data analysis
Writing the report
How can we improve the process?

SLIDE 118

Patient selection

Choose cases that are likely genetic

and rare

Non-syndromic MR not a good choice
Assure that all reasonable testing

already done & testing will advance clinical care

As more gene connected to

phenotypes loosen restrictions

SLIDE 119

Genetic counseling

Topics for counseling. 6 – 10 hrs.
Report cause of patient’s problem and childhood onset

treatable disorders

Patient/family choice: Incidental findings – Ex: BRCA1
Insurance coverage

Exploration of expectations 16% Inheritance 14% Test Methodology 9% Categorical Model of Choice 18% Psychosocial Counseling 19% Follow-up Planning 10% Formal consent 8% Clinical genetics evaluation 9%

6-10 hours total

SLIDE 120

WES vs WGS

Advantages of WES

Exome is 1% of

genome

– WES costs less than WGS – Deeper coverage of some areas compared to WGS

Exome includes only

the coding region

– Tools to interpret changes best developed

Advantages of WGS

WGS has better coverage of

some areas

– Exome capture array does not capture all exons

Certain parts of introns and

regions between genes can be interpreted

Encode project published 9/12

– more than 80% of the genome is functional!

mutations miR-96 cause

autosomal dominant progressive hearing loss

Generation of sequence data - WGS vs WES

SLIDE 121

Laboratory Considerations in WGS/ WES

NGS RUN BASE CALLING Lane-level Q/C ALIGNMENT PRIMARY ALIGNMENT GENOTYPE CALLING DE-MULTIPLEX SECONDARY TERTIARY QC metrics Thresholds fastq

bam
other
Vcf
other

VARIANTS LIST NOVEL KNOWN ANNOTATE PRIORITIZE CLASSIFY CLINICAL REPORT

SUB IND CNV SV

PRIMARY ASSAY DESIGN

Prima mary analy lysis is – On machine chemistry and software – generates reads Seconda

ndary anal

alysis – align-ment software and variant calling software – calls variants Ter ertiary anal analysis – variant annotation software – clinical relevance

ANALYSIS STEPS:

SLIDE 122

Analysis Challenges

The reference human

genome (called Hg19 ) 250 gaps

Much variation not in

Hg19 – WGS can end up with unmapped reads – Read with and insertion or deletion may not map to ref genome

Ref genome &

database of variants improving

Current NexGen devices

works well for single bp substitutions

Partial list of genomic variants and

structures

– Single base-pair substitutions – Insertions – Deletions – Adjacent insertion and deletion – GC rich regions – Trinucleotide repeats – Copy number variants – Homopolymer tracts – Translocations – Inversions – Short tandem repeats – Pseudogenes – Highly polymorphic regions (e.g. HLA locus)

SLIDE 123

Tertiary analysis – annotation considerations

Incomplete phenotypic description of patients
Each variant evaluated for clinical relevance to

THAT PATIENT

– Known mutations

Human Gene Mutation Database, Online Mendelian

Inheritance in Man, Gene specific databases

– Stop, readthough, missense

NCBI , Ensembl
SIFT, Polyphen2

– Splice-site

GeneSplicer

– Evolutionary conservation

PhastCons for nucleotides, AA conservation tools

– Novel /rare variants

dbSNP, Exome Variant Server

SLIDE 124

A brief history of genetic testing

Single disorder – Cystic Fibrosis

– Lab designs and validates test for this gene – MD orders test in patients with appropriate phenotype

Gene panel - Cardiomyopathy

– Lab reviews literature, chooses genes assoc with cardiomyopathy, designs and validates test for the chosen genes – MD orders test in patients with appropriate phenotype

Patient Suspected Clinical Dx Final Diagnosis Genetic Counselor Physicia n Molecular Dx Laboratory Director Genetic Counselor

Whole genome/exome

sequencing – Lab designs and validates test – MD orders test in patient

– Lab chooses genes associated with patient’s phenotype AFTER the test is run!

SLIDE 125

Must confirm all

variants with Sanger

Large number of

errors in research literature for “disease causing” variants

Read original

paper before reporting!

Доверяй, но проверяй TRUS TRUST B T BUT V UT VERI ERIFY! FY!

Writing the report

SLIDE 126

How can we improve the process?

Incidental findings

– How should these be handled by the lab?

Variants of uncertain significance

– How can these be changed to benign variants or pathogenic variants? – Mary E. Shimoyama, PhD & Jack Routes, MD – extending clinical whole genome sequencing

SLIDE 127

Families
Referring Physicians
CHW WGS Review

Group

MCW Genetics
David Dimmock
Donald Basel
William Rhead
Stephanie Offord
Esperanza Font-

Montgomery

Gunter Scharer
CHW Genetics Center
Regan Veith
Amanda Laedtke
Heather Radtke
Dania Stachiw
Angela Pickart
Linda Smith
Jeff Kopesky
Neha Sekhri
Cristin Griffis
LuAnn Weik
Dimmock Lab
Daniel Helbling
CHW Laboratory

Medicine

Paula North
Tara Schmit
Altheia

Roquemore-Goins

Michelle Galalvez
Children’s Research

Institute

Juliet Kersten
Illumina
Tina Hambuch
Marc Laurent
Brad Sickler
Funding:
Children’s

Research Institute

Jeffrey Modell

Foundation

Private Donors
AGEN-Seq
Jaime Wendt
Jayme Wittke
Brett Chirempes
MCW- HMGC
Howard Jacob
Liz Worthey
Mary Shimoyama
Stan Lauderkind
Pushkala Jayaraman
Jenny Geurts
Victoria Petri
Alison Sarkis
Sasha Zeng
Bryce Schuler
Florence Yeo
Lisa Armitage
Bioinformatics/Systems

support

Jeff DePons
George Kowalski
Wes Rood
Sharon Tsaih
Brad Taylor
Stacy Zacher
Greg McQuestion
Kent Brodie

Acknowledgements – It takes a village

SLIDE 128

Practical Limitations of Current Practice

Dr. Mary Shimoyama

Medical College of Wisconsin

Dr. John Routes

Medical College of Wisconsin

Research Clinical Divide and Moving Research Software to the Clinic

SLIDE 129

Extending Current State of Practice

John M. Routes Chief, Section of Allergy and Clinical Immunology Co-Director, Clinical and Translational Science Institute of Southeast WI Professor of Pediatrics, Medicine, Microbiology and Molecular Genetics Medical College of Wisconsin

SLIDE 130

Turnaround time
Candidate genes with data sets difficult

to interpret or manipulate

Candidate genes with unknown

function

Lack of experimental models to test

function of mutant protein

Current Limitations

SLIDE 131

Mary Shimoyama, PhD

Assistant Professor

Human and Molecular Genetics Center

Moving Research Software to the Clinic

SLIDE 132

Current Limitations

Multiple analysis tools
Single function
Lack of standards
Developed for basic research
Private – single use
Lack of centralized access points
Data scattered across sites
Lack of data standards and integration
Lack of software suitable for clinic use
Not designed for clinic workflow
Often require specialist knowledge
Difficult to use

SLIDE 133

Leveraging Existing and Emerging Tools for Clinical Use

Identifying needs
Identifying tools – existing and

emerging

Identifying potential collaborations
Development and testing
Implementation

SLIDE 134