Genomics & Personalized Medicine: Analysis & Clinical - - PowerPoint PPT Presentation

genomics amp personalized medicine analysis amp clinical
SMART_READER_LITE
LIVE PREVIEW

Genomics & Personalized Medicine: Analysis & Clinical - - PowerPoint PPT Presentation

Genomics & Personalized Medicine: Analysis & Clinical Implementation Our vision To create a borderless, complementary and synergistic research environment in southeast Wisconsin to translate discoveries into better health for our


slide-1
SLIDE 1

Genomics & Personalized Medicine: Analysis & Clinical Implementation

slide-2
SLIDE 2

Our vision

To create a borderless, complementary and synergistic research environment in southeast Wisconsin to translate discoveries into better health for our citizens; while simultaneously providing comprehensive educational and training programs to develop the next generation of clinical and translational researchers.

slide-3
SLIDE 3

Our Principles of Operation

Creating a culture that fosters interdisciplinary innovation within and among institutions and contributes to the new discipline of clinical and translational science

Transformation

Providing core resources to assist investigators in the development, implementation, analysis, and dissemination

  • f clinical and translational research

Facilitation

Training clinicians and basic scientists in the emerging discipline of Clinical and Translational Science

Education

Leveraging institutional strengths and resources to enhance trans-disciplinary clinical and translational research

Collaboration

slide-4
SLIDE 4

CTSI-Cancer Center Workshop

Future Directions & Opportunities for Cancer-related Research Collaboration

Friday, December 16, 2011

Rehabilitation Research Workshop Wednesday, May 26, 2010 12:00 - 4:00 pm Marquette University

Nucleating Workshops & Conferences

Exploring the Metabolic Syndrome:

Basic Mechanisms, Clinical Implications and Community Impact

Wednesday, February 9, 2011

Brain Imaging Workshop University of Wisconsin - Milwaukee Wednesday, May 20, 2009

Drug Discovery and Repurposing: Form ing Partnerships May 31, 20 12 – Medical College of Wisconsin

slide-5
SLIDE 5

Convergence of Disciplines 2012 Pilot Awards

  • Medicine
  • Biomedical Informatics
  • Rehabilitation
  • Psychology
  • Economics
  • Nursing
  • Dentistry
  • Public Health
  • Computer Science
  • Business
  • Physical Therapy
  • Exercise Science
  • Biomedical Engineering
  • Genetics
  • Physics
  • Chemistry
  • Mechanical Engineering
  • Psychiatry
slide-6
SLIDE 6

Howard Jacob, PhD

  • Dr. Jacob led a team of researchers at the

Medical College of Wisconsin who used NextGen sequencing to help diagnose the medical condition of Nicholas Volker.

  • Dr. Jacob and his team have also built the

world’s first integrated Genomics Medicine Clinic.

  • He earned his PhD in Pharmacology from

the University of Iowa followed by fellowships at MIT, Harvard and Stanford.

  • He is the founding director of the Personalized Medicine Program and

Human and Molecular Genetics Center, and Professor in Physiology and Pediatrics at the Medical College of Wisconsin and Children’s Hospital of Wisconsin.

  • Dr. Jacob currently serves as the Director of Biomedical Informatics for

CTSI

slide-7
SLIDE 7

Who is Personalized Medicine?

Howard J. Jacob, Ph.D. http://www.mcw.edu/HMGC Jacob@mcw.edu

slide-8
SLIDE 8

Personalized Medicine

  • “products and services that leverage the

science of genomics and proteomics (directly

  • r indirectly) and capitalize on the trends

toward wellness and consumerism to enable tailored approaches to prevention and care.”

PricewaterhouseCoopers The new science of personalized medicine

slide-9
SLIDE 9

Goals for today

  • Discuss six aspects of personalized medicine

– Data generation Implementation – Data analysis Ethics – Data knowledge Education

  • Break out groups

– Meet, discuss, create opportunities for collaboration

  • Setting the stage
slide-10
SLIDE 10
slide-11
SLIDE 11
slide-12
SLIDE 12
slide-13
SLIDE 13
slide-14
SLIDE 14
slide-15
SLIDE 15

How long is all the DNA in your Body?

2m X 100,000,000,000,000 cells = 2 x1014m

The earth to the sun is 150,000,000,000m (1.5x1011)

2x1014/1.5x1011 = 1333 trips Or 666.5 round trips

slide-16
SLIDE 16
slide-17
SLIDE 17

IS FAMILY HISTORY IMPORTANT?

slide-18
SLIDE 18
slide-19
SLIDE 19

Personalized Medicine

Adults: Take 2 aspirin

Family history is how we practice genetics in the clinic every day in 2012

slide-20
SLIDE 20
slide-21
SLIDE 21
slide-22
SLIDE 22

370 Drugs

slide-23
SLIDE 23

Nic Volker, gene found, treatment, doing well One in a Billion by Mark Johnson and Kathleen Gallagher

slide-24
SLIDE 24

Clinical Care Team Sequencing Team Clinical Molecular Testing Data Assessment Team EthicsTeam Clinical Phenotypes & Family History BioInformatics & Electronic Medical Records Patient

slide-25
SLIDE 25

Additional Reference Data Clinical Genomic Data Clinical Reference Data Diagnostic Test Results Molecular Reference Data

Expression profiling Proteomics profiling Genomic Medicine Genetics CNV profiling Epigenetic profiling Drug interactions Ref genomes Ref pathways Genotype to phenotype Family data Sequence data Ref genotypes Outcomes Clinical attributes Treatments Environmental exposures Radiology Phenotype data Molecular interactions Immunohistochemistry FACS Hematology Pathology Patent info Ref expression data Biomarkers Pharmacogenomics Biochemical Disease registry Diagnoses SNP typing Risk factors G x E associations

slide-26
SLIDE 26

Who will pay for all this?

slide-27
SLIDE 27

Letter from Insurance Group

6 out of 12 pre-authorizations

slide-28
SLIDE 28
slide-29
SLIDE 29

Personalized Medicine

Repeat for hundreds of diseases and treatments Repeat for thousands of patients Personalized Treatment

Individual Patient G + C + E

Predictive Model for Disease Susceptibility & Treatment Response State-of-the-Art Machine Learning Genetic, Clinical, & Environmental Data

slide-30
SLIDE 30

Would you like your genome sequenced?

slide-31
SLIDE 31

Categorical Model of Choice

  • ptional

disclosures mandatory disclosures

1° diagnostic 2° none 2° not actionable childhood 2° treatable childhood 2° actionable adulthood 2° not actionable adulthood

slide-32
SLIDE 32

Parental Decisions Utilizing Categorical Model

Proceeded with WGS None* Not Actionable Childhood Actionable Adult Not Actionable Adult Yes 12 1 10 10 7 No 1 11 2 2 5

*Mandatory disclosures include pathogenic and actionable childhood onset diseases.

slide-33
SLIDE 33

Consumer Driven

Individual Health & Wellness Plan

Data and Knowledge Changes Age and Health Status Changes

slide-34
SLIDE 34

1. Generate whole genome sequence 2. Analyze (de novo assembly) 3. Annotate the genome for all structures 4. Link to Electronic Medical Record 5. Link to all biomedical literature

  • 6. Provide data to physician for clinical management

ALL IN ONE DAY Then provide prospective follow-up as new discoveries come “online”

Goal

slide-35
SLIDE 35

Challenges

  • Managing Data
  • Analyzing Data

– Data Visualization

  • Diagnosis (genomics and clinical presentation)
  • Implementation
  • Storing data
  • Staying current as medical knowledge changes and

the patient changes

slide-36
SLIDE 36
slide-37
SLIDE 37
slide-38
SLIDE 38
slide-39
SLIDE 39
slide-40
SLIDE 40
slide-41
SLIDE 41

How do we help patients today?

Our CTSI is personalized medicine

slide-42
SLIDE 42

Nic Volker, gene found, treatment, doing well One in a Billion by Mark Johnson and Kathleen Gallagher Andrew, gene(s) have not been found, no diagnosis Cracking your genetic code; NOVA Avery What if her DNA was sequenced at birth?

slide-43
SLIDE 43

Current and Future Sequencing Technologies

Michael Tschannen

Molecular Genetics Technical Specialist Sequencing Core Manager Human and Molecular Genetics Center Medical College of Wisconsin

slide-44
SLIDE 44

Many Choices of Technology

slide-45
SLIDE 45

Sanger Sequencing – ABI3730xl

Specs: “Gold Standard” of Sequencing 800-1000bp read length 1.2Mb per 4 hour run Typical Applications: Clinical Variant Verification Amplicon Sequencing Gene Panel Sequencing Region Specific Sequencing

slide-46
SLIDE 46

Roche GS-FLX+ (454)

Specs: 2 million reads per 2 day run 450-700 bp reads 600-800 Mb per run Typical Uses: Metagenomics (16S) Amplicon Sequencing Bacterial WGS Viral WGS Sequence Capture Gene Panel Sequencing

slide-47
SLIDE 47

Ion Torrent – Life Technologies

Ion Personal Genome Machine Specs: Up to 200bp reads Up to 1Gb in 5 hour run Typical Uses: Amplicon Sequencing Sequence Capture Ion Proton Specs: Up to 200bp reads Up to 10Gb per 4 hour run Typical Uses: Whole Exome Sequencing Whole Genome Sequencing (future)

slide-48
SLIDE 48

Illumina HiSeq 2000

Specs: 2.5 – 3 Billion reads per 12 day run 2x100 Paired End Sequencing 2 Flowcells per run 2 Human Genomes per flowcell 24 Exomes per flowcell Typical Uses: Human/Rat/Mouse Whole Genome Sequencing Exome Sequencing RNAseq – Gene Expression / Transcript Identification

slide-49
SLIDE 49

Illumina MiSeq

Specs: 35 million reads per flowcell 1/5 of a single HiSeq Lane 2x250 PE reads 7-8Gb output per 36 hour run Upgrading to 15Gb output in 2013 Typical Uses: Whole Exome Sequencing HiSeq Library/Pool QC Amplicon Sequencing Gene Panel Sequencing

slide-50
SLIDE 50

Pacific Biosciences PacBio RS

Specs: 120,000 Reads per SMRT Cell 150-250 Mb per 3 hour cell run 16 SMRT cells per day 3,000bp average read length New XL chemistry – up to 5kB Typical Uses: Bacterial Whole Genome Viral Whole Genome Amplicon Sequencing Sequencing of repeats Genomic gap filling

slide-51
SLIDE 51

Oxford Nanopore

Specs: Protein Nanopore GridION – 2,000/8,000 pores – 150Gb/Day MinION – 500 pores – 25Gb/Day No set run time/read length Whole Human Genome -15 min - $10/Gb Uses: Still Unreleased Whole Genome Sequencing Metagenomics “Destination” Sequencing

slide-52
SLIDE 52

The Future

Higher Throughput Faster Cheaper Longer Read Length More Accurate Easier Assembly Efficient Data Storage

slide-53
SLIDE 53

Acknowledgements

Contact Info: Mike Tschannen mtschann@mcw.edu 414-456-8890

slide-54
SLIDE 54

Genomic Medicine Big Data: Production, Analysis and Management

Liz Worthey Ph.D

Assistant Professor Clinical and Translational Genomic Analysis Lab Human and Molecular Genetics Center Pediatric Genomics, Department of Pediatrics Medical College of Wisconsin

slide-55
SLIDE 55

Human Genome Sequencing

First Human Genome Sequence

  • 10 years to complete
  • $2.7 billion
  • 100s ABI 3730 sequencing

machines

  • 3 Gbases data total

Single Human Genome Sequence now

  • 8 days to complete
  • $5,000
  • 1 Illumina HiSeq2500 machine
  • 100 Gbases per day

The cost to sequence one base has dropped 100 million fold

slide-56
SLIDE 56

Many technologies exist

slide-57
SLIDE 57

Analysis phases

NGS RUN BASE CALLING Lane-level Q/C ALIGNMENT PRIMARY ALIGNMENT GENOTYPE CALLING DE-MULTIPLEX

SECONDARY TERTIARY

QC metrics Thresholds fastq

  • bam
  • other
  • Vcf
  • other

VARIANTS LIST NOVEL KNOWN ANNOTATE PRIORITIZE CLASSIFY CLINICAL REPORT

PRIMARY

ASSAY DESIGN SUB IND CNV SV

Derived from figure from Birgit Funke

slide-58
SLIDE 58

Challenge: A variety of events lead to production of many sequencing errors

  • Polymerase errors (~300,000 per genome)
  • De-Phasing

– Occurs with step-wise addition methods when growing primers move out of synchronicity for any given cycle

  • Dark Nucleotides causing false deletion errors

– The nucleotide does not contain a fluorescent label – Breakdown of dye-label nucleotides – Contamination with unlabeled nucleotides

slide-59
SLIDE 59

Sequencing the same sample with known variants on different platforms produces different sequencing errors

Courtesy of Chris Mason and SeQC Consortium

  • Illumina more likely to have error after ‘G’
  • PCR-based methods miss GC- and AT-rich regions
  • PolyA miscount errors for pyrosequencing methods
slide-60
SLIDE 60

Secondary analysis – read mapping

p.L80P p.Q98X p.Q86X p.V146G p.R123W p.L153R p.R173G p.R175P p.Q183H p.R217G p.E240X p.Q239X p.Q236X p.L227P p.A288G p.D311N p.E315A p.R319Q p.A323P p.D406V p.X420W p.C417F p.E391X p.Q384X p.R359W p.Q290X p.R256X p.R62X p.Q45X p.E57K

NEMO MECP2 Many methods are available: Which to use?

  • Speed
  • Accuracy
  • Computational

requirements

  • Robustness
  • Support
slide-61
SLIDE 61

Some of these regions are of significant interest

RPS17 HEXB DAZ1 SFTPA2 C4A C4B SSX4 INPP5E F7 CES1 PMS2 SOX18 CCL3L1 CHST14 SLC7A9 ATP8B1 CYP2R1 ABCC2 MME LPA CSH1 OCLN HBA2 KRT81 CFHR3 HRAS DSG1 FCGR2C SMN2 F13A1 CYP2C19 TFE3 CR1 NCF1 STRC CFC1 D2HGDH EVC FKTN FCGR2B ANTXR2 OTOA ENAM

52 2,925 829 Actionable Poorly covered Gaps shared in 25 WGS

SFTPA1 SMN1 OPN1MW OPN1LW IKBKG NEB SLC6A8 CHRNA3 CFD

Amelogenesis imperfecta Systemic lupus erythematosus

>90 diseases

Ehlers-Danlos syndrome, musculocontractural type Ellis-van Creveld syndrome C4a deficiency

881 1117 1551 309 185 932 362 663 526 309

Clinically actionable Genes

slide-62
SLIDE 62

Strategies for spanning gaps

  • Some gaps are caused by overrepresentation of certain

genomic regions in the sequence output, with underrepresentation of others

  • Sanger fill in – too expensive
  • WGS plus WES strategy?
  • Some gaps are caused by repeats that can’t be sequenced

through or cause issues with mapping – add longer read data from a PacBio or ?

  • De Novo assembly?
slide-63
SLIDE 63

Variant callers give different results on the same data

Differences between sequencing/analysis pipelines are between 4 and 14% of variants per sample Various mapping and variant calling algorithms used in each analysis

Rosenfeld, Mason, and Smith. PLoS One. 2012.

Concordance good for latest renditions of SNV callers, ok for indel callers, and poor for SVs Comparison of variants called from 32 HapMap genomes by Complete Genomics and 1K Genomes Project

Blue=CG Red=1KGP

slide-64
SLIDE 64

Even calling of SNVs has issues

Patient 2F Patient 2C2

Concordance between Illumina pipeline and BWA/GATK SNV calls

Rosenfeld, Mason, and Smith. PLoS One. 2012.

slide-65
SLIDE 65

But cannot simply exclude variants called by a single method

Transition mutations occur more frequently than transversions thus: (Ti/Tv >1) Sequencing errors tend to be more transversions thus: (Ti/Tv < 1)

  • Elliott Margulies et. al. at Illumina have shown that only 95% of high

quality SNVs are called the same in 14 stringently performed replicates

  • Solutions:
  • Ongoing algorithm development
  • Hold of until methods mature – we don’t yet “do” clinical SVs
slide-66
SLIDE 66

Tertiary analysis

~250,000 ~1,800 ~1,200 ~50 ~10 0-5 ~4,800,000

Variant storage, analysis, prioritization, and reporting

slide-67
SLIDE 67

Analytical considerations/challenges

  • n the variant impact axis
  • Existing WGS technologies produce many

sequencing errors

  • Existing mapping algorithms in combination with

short read technology give rise to many mapping errors

  • Bioinformatics limitations with variant calling

(especially indels and SV)

  • Data is incorrect/outdated - Allele Frequencies
  • Data must be used appropriately - nucleotide or

amino acid conservation scores

  • Data must be understood - SIFT, PolyPhen, Condel
slide-68
SLIDE 68

e.g. Allele frequency data cannot be relied upon without review

The allele frequency reported for a particular variant can vary widely amongst commonly used data sources These variants were not randomly selected – these variants are all associated with disease in HGMD Solution: compare multiple data sources and consider the sources in terms of possible source disease status as well as technological aspects etc.

slide-69
SLIDE 69
  • Many of the datasets being used were never intended to be

used in the way we are currently applying them

  • We estimate that up to ~8% of genotype to phenotype data

in larger mutation databases is incorrect:

– Typos/human entry errors are common

  • E.g. term breast cancer entered rather than cervical

– Old/Outdated

  • <70% in PubMed is captured in DBs

– Many entries subsequently disproven or even retracted – These are not seen as part of DBs “job”

  • It is important to understand the quality of the datasets and

to curate/track corrections or in house annotations

  • Data sharing is hugely important
  • Identifying a variant with a functional impact on a gene is not

the same as identifying the cause of the disease

Analytical considerations/challenges

  • n the functional classification axis
slide-70
SLIDE 70

e.g. Many reported “rare mutations” are not so rare polymorphisms

  • HUGE curation task to ensure data is correct and up to date.
  • Currently – never believe anything – many hours of

verification required

  • Data sharing will be critical to our cumulative success!
  • HEXB
  • NPHS1
  • LAMA2
  • ADA
  • MTHFR
  • GALC
  • MEFV
  • CYP21A2
  • ARSB
  • CPT1A
  • HESX1
  • HGSNAT
  • ACADM
  • FKTN
  • WNT10A
  • PMM2
  • MPL
  • POLG
  • BTD
  • NTRK1
  • ALG6
  • DPYD
  • HADHA
  • GAA
  • AHI1
  • AMPD1
  • ATP7B
  • CDH23
  • SBDS
  • NEFL
  • GLB1
  • ATP7B
  • ETFB
  • NHLRC1
  • IGHMBP2
  • SERPINA1
  • NPHS1
  • SLC26A2
  • MYO5A
  • DPYD
slide-71
SLIDE 71

Challenge: Data storage

Data storage costs currently ~$1,200 per year per patient per WGS (on high performance disks; without compression)

slide-72
SLIDE 72

At 1 Tb per genome

2013: 100,000 genomes 1 Tb each = 100 PetaBytes WGS for all new U.S. babies/year: 4,000,000 genomes = 4 ExaBytes WGS for all U.S. citizens over 50: 310,000,000 genomes = 310 ExaBytes WGS for all: 7,000,000,000 genomes = 7 ZettaBytes

slide-73
SLIDE 73

At 400 Gb per genome

2013: 100,000 genomes 100 Gb each = 40 TeraBytes WGS for all new U.S. babies/year: 4,000,000 genomes = 1.6 PetaBytes WGS for all U.S. citizens over 50: 310,000,000 genomes = 124 PetaBytes WGS for all: 7,000,000,000 genomes = 2.8 ExaBytes

slide-74
SLIDE 74

http://www.allthingstechnology.net/2011/07/how-much-byte-make-yottabyte.html

How much data is this?

@1Tb/genome All = 7 Zb @400 Gb/genome All = 2.8 Eb

slide-75
SLIDE 75

Solutions

  • Development of and agreement on

sustainable retention guidelines

  • Compression?

– By retaining only the data related to variant calls we can compress 50-100 fold. – But we need to keep in mind clinical use cases.

  • Transparency of data used to determine that no variant

existed (no call)?

  • Transparency of data that shows that a particular

region was not covered sufficiently to be sure of accuracy of the call – gap calls?

slide-76
SLIDE 76

Performing WGS in a clinically appropriate timeline

Data generation/analysis/reporting steps HiSeq2000 (Current) HiSeq2500 (RapidRun) Barcode and Accessioning of samples 0.5 0.5 DNA extraction and Sequencing 289* 33.5 DNA Extraction 2 2 Sequencing Lib Prep and Quant 77* 4.5 Verification and Sequencing 210* 27 Secondary analysis 19 19 Tertiary analysis 5 5 Variant Annotation 3 3 Production of Clinical Report, Delivery 2 2 Interpretation and reporting 11 11 Interpretation 9 9 Preparation and review of final report 2 2 Total (hours) 324.5 69

* 3 samples processed simultaneously in these steps

slide-77
SLIDE 77

Currently a $5,000 sequence, a $200 analysis, and a $1,250 interpretation +

  • Analysis/Interpretation:
  • Bioinformatician: loading, processing, report generation - $200
  • Clinical Geneticist: interpretation - $1000
  • Follow up:
  • Technician: Sanger confirmation/analysis - $150
  • Reporting:
  • Analyst / Clinical geneticist: Report finalization - $100

+

Getting better but not good enough

slide-78
SLIDE 78
  • Capabilities – We are a diverse group of researchers with varied

backgrounds and expertise in genetics, genomics, physiology, computer science, clinical informatics, statistics, clinical research, clinical diagnostics, bioinformatics etc. etc.

  • Needs – Progress is being made but many challenges remain in

the continued development of Genomic Medicine. In Informatics realm we have significant challenges in:

– Data storage and management, transfer and sharing – Secondary analysis – reference genome, mapping, variant calling – Tertiary analysis – annotation, prioritization, visualization – Clinical Interpretation – data mining, phenotype data extraction and analysis, genotype – phenotype correlation curation – Education – across the board

  • Goals – Collaboration to further our joint capabilities:

– Identification of disease cohorts for collaborations in translational research – Collaborations to develop novel or reuse existing methodologies/algorithms addressing the challenges outlined – Identification of areas of expertise for submission of joint grants

Summary

slide-79
SLIDE 79
  • Families and Patients
  • Referring Physicians
  • MCW- HMGC
  • Howard Jacob
  • David Bick
  • David Dimmock
  • Mary Shimoyama
  • Jill Northup
  • Jenny Guerts
  • Brad Taylor
  • CHW Genetics Center
  • Regan Veith
  • Angela Pickart
  • William Rhead
  • AGEN-Seq Technicians
  • Mike Tschannen
  • Daniel Helbling
  • Brett Chirempes
  • Jayme Wittke
  • Jamie Wendt-Andrae
  • CHW
  • Juliet Kersten
  • Paula North
  • Tara Schmit
  • Jack Routes
  • Altheia

Roquemore-Goins

  • Gail Bernadi
  • Michael Gutzeit
  • Steven Leuthner
  • Rodney Willoughby
  • Thomas May
  • Robert Kliegman
  • Funding/Support:
  • MCW Children’s

Research Institute

  • Jeffrey Modell

Foundation

  • Private Donors
  • Bioinformatics/Curation

/Systems support

  • Brandon Wilk
  • Jeremy Harris
  • Wendy Demos
  • Arthur Weborg
  • George Kowalski
  • Weihong Jin
  • Weisong Liu
  • Jeff DePons
  • Sharon Tsaih
  • Oliver Hummel
  • Stacy Zacher
  • Marek Tutaj
  • Greg McQuestion
  • Kent Brodie
  • Stan Laulederkind
  • Victoria Petri
  • Jennifer Smith
  • Alex Stoddard
  • Pushkala Jayaraman

Acknowledgements

slide-80
SLIDE 80

Genomics & Personalized Medicine: Analysis & Clinical Implementation Breakout Sessions 1 & 2

slide-81
SLIDE 81

Genomics and Science Education

  • Dr. Neil Lamb

HudsonAlpha Institute for Biotechnology

  • Dr. Tim Herman

Milwaukee School of Engineering

slide-82
SLIDE 82

Teachers FIRST

From Interesting Research to Scientific Teaching

Tim Herman Center for BioMolecular Modeling Milwaukee School of Engineering

An NIH Science Education Partnership Award (SEPA) project

slide-83
SLIDE 83

The MSOE Center for BioMolecular Modeling

…an instructional materials development laboratory, …with a science education outreach mission.

slide-84
SLIDE 84

… recent advances in the technologies that have delivered the promise of genomic and proteomic medicine to clinics have completely outstripped the education of the public who stand to benefit from this new science ….. … as well as the very health care professionals who suddenly find themselves in a position to make use of this new technology.

The Problem…..

slide-85
SLIDE 85

In the old days …..

slide-86
SLIDE 86
slide-87
SLIDE 87
  • Zinc Finger Nucleases and Genome Editing
  • The CCR5 Gene and Resistance to HIV
  • Cytochrome p450s and Pharmacogenomics
  • Beery Twins Story – Sepiapterin Reductase
  • Nic Volker Story –XIAP.

Genes, Genomes and Personalized Medicine

…. Molecular Stories ….

Nic Volker Story –XIAP.

slide-88
SLIDE 88

http://cbm.msoe.edu/stupro/so/module2012/xiapHome.html

slide-89
SLIDE 89

Neil E. Lamb, Ph.D. Director of Educational Outreach HudsonAlpha Institute for Biotechnology

an online game about complex disease, risk assessment and prevention/treatment

funded through a 5-year NIH Science and Education Partnership Award (SEPA)

slide-90
SLIDE 90
  • goal: assess disease risks for the six-person crew and

pack the ship with supplies that reduce risk and/or provide treatment

  • game can be played in single person or classroom format
  • Students’ activities and decisions are available to the

teacher

  • setting: preparing to launch the Argos1 - a 20-year

exploratory mission to Triton, a moon of Neptune

  • accompanying website with background

information

What is Touching Triton?

slide-91
SLIDE 91

Touching Triton Gameplay

select mission identify ‘combined disease risks’ choose crew member launch mission determine risks make packing recommendations

  • bserve outcomes

family history genomic findings medical records

slide-92
SLIDE 92

Crew Selection

slide-93
SLIDE 93

Crew Member Dashboard

slide-94
SLIDE 94

Medical Records

slide-95
SLIDE 95

Genomic Data

MACULAR DEGENERATION

slide-96
SLIDE 96

Family History

slide-97
SLIDE 97

Final Risk Assessment

slide-98
SLIDE 98

Packing

slide-99
SLIDE 99

Advisor support

slide-100
SLIDE 100
  • grounded in research
  • risk from genomic data - multiply odds ratios from each

SNP, compare to population risk

  • risks from medical records and family history selected

using a slider bar

  • verall risk assessment - slider bar
  • bjective: understand how students perceive risks in the

context of complex disorders

  • ‘consensus lifetime risk’ - determined from a survey of

medical geneticists, genetic counselors and clinical researchers - teachers can compare this to student risk estimates

About Risks

slide-101
SLIDE 101

Project Timeline

Y1 Y2 Y3 Y4 Y5

discovery & planning phase prototype generation usability testing to assess functionality repeated testing & revision; generate teacher content & background in-class testing; focus

  • n content

final beta testing; summer training for 30 early adopters final revisions & deployment; website complete embedded training in classrooms of 75 Alabama teachers

  • nline

training module crafted & deployed assessment & broad dissemination

  • Dec. 2012
slide-102
SLIDE 102

Filming of Crew Scenes

slide-103
SLIDE 103

Acknowledgements

  • Adam Hott
  • Kelly East
  • Jennifer Carden
  • Madelene Loftin
  • JD Frey

HudsonAlpha

  • Paul Chang
  • Bryan Powell
  • Nevin Langdon

Metabahn/PCD/HyCreative

  • Camellia Sanford

Rockman et al.

This project supported by the National Center for Research Resources and the Division of Program Coordination, Planning and Strategic Initiatives of the National Institutes of Health through Grant Number R25OD010981

The U.S. Space and Rocket Center

slide-104
SLIDE 104

Ethical, Legal, Social Implications (ELSI) related to Genomics (the brief version)

MCW Program in Genomics and Ethics and MCW Ctr. for Patient Care and Outcomes Research (nothing to disclose)

Mike Farrell, MD

slide-105
SLIDE 105
  • “Safety” ensuring more good than harm
  • ELSI: techniques to ensure that genomics is

safe for patients, relatives, and society.

  • Key: effective collaboration between

clinicians, lab, informatics & ELSI scholars

A “Safety” Paradigm for ELSI

Patient care TODAY Safer, wider practice TOMORROW ELSI Technological Advances

“What can we do?” “What should we do?” “How should we do it?”

slide-106
SLIDE 106
  • Distributive justice

– Socially just allocation of goods in society – Example question: should we spend money

  • n genomics when people need access to

care, immunizations, mental health care, food, etc…

  • This ELSI is commonly raised, but ignored

in this talk because of the assumption that genomic sequencing is here to stay.

Not included in this talk…

slide-107
SLIDE 107
  • When Genomics is implemented, what will

happen to…

ELSI example #1: Harm vs. Benefit

Patients? • Problems with understanding, anxiety, other emotions,

decision-making, self-image, stigmatization, etc…

Relatives? • Genomic information disseminates from the patient, exposing

relatives to risks for which they did not consent.

Children? • Especially vulnerable because identities are still forming.

  • No right of refusal.
  • The mis-paternity problem (“dad” is not biological father).
  • Disrupted bonding between parents and children.

Healthcare? • Providers will be confused, and may require re-training to deal

with a huge volume of information.

  • How will genomic data be treated by payors, employers?

Communities? • Will there be cultural disparities, stigmatization, or unique sorts

  • f problems with misconceptions?

Society? • Will genetic risk factors change how we appraise each others’

value as individuals?

  • Will genomics data worsen health disparities?
slide-108
SLIDE 108
  • Many ELSIs are phrased in the form of a

“lingering question.” Why is this so?

– Likelihood of the ELSI is unclear – It is hard to study a hypothetical innovation, or generalize from dissimilar situations – Some ELSIs are best described in works of imaginative fiction (e.g. the movie Gattaca) – fiction may raise awareness, or it may give the public an unrealistic idea of genomics ELSI example #1: Harm vs. Benefit cont’d

slide-109
SLIDE 109
  • Another problem: some genomic data has a

varying degree of uncertainty

  • Example: Does a given variant matter?

ELSI example #1: Harm vs. Benefit cont’d

(Mike’s view about)

Association between a genomic variant and phenotype Published evidence of association between variant and …

  • Disease
  • Risk for disease
  • Disease in animals
  • Risk for disease in animals
  • Disease in children vs. adults

Plausible association between variant and …

  • Disease
  • Risk for disease

Evidence or plausible association between variant and …

  • Phenotype that does not relate

to disease (e.g. height) Uncertain significance

slide-110
SLIDE 110
  • Is genomic sequencing a test, or a

procedure?

  • What constitutes adequate informed

consent? How much information needs to be provided?

  • How do we ensure that patients can weigh

harms and goods for themselves.

  • What are the boundaries between clinical

care and research?

ELSI example #2: Consent

slide-111
SLIDE 111
  • Given that sequencing is complicated,

and has risks and benefits…

–Should there be pre-requisites for credentials or training of providers before they can order sequencing? –Should patients be required to see a genetic counselor before and/or after sequencing? ELSI example #3: Professional issues

slide-112
SLIDE 112
  • Answer is not simple
  • Solutions need to be crafted for individual

ELSIs, and the communities that may experience them.

  • Some solutions have their own problems

– Genetic counselors: cost and supply limits – Consent: how much education is needed? – Best practices: do they matter if anybody can buy sequencing from a private lab?

  • Addressing ELSI is thus a matter of expertise

and troubleshooting

OK, so what do we do about ELSIs?

slide-113
SLIDE 113
  • Expertise in ELSIs, communication, and research ethics
  • Based in MCW’s Center for Bioethics and Medical

Humanities (itself based in Institute for Health & Society) Contact us to collaborate! kstrong@mcw.edu

Program in Genomics and Ethics

Kimberly Strong PhD Ryan Spellecy PhD Thomas May PhD Mike Farrell MD Art Derse MD JD Kaija Zusevics PhD (postdoc) Alison La Pean Kirschner MS CGC (genetic counselor)

slide-114
SLIDE 114

Current State of Practice: Whole Genome Sequencing in clinical use today

David Bick, M.D.

Director of Advanced Genomics Laboratory at Children's Hospital of Wisconsin Professor of Pediatrics and Obstetrics & Gynecology Chief, Section of Genetics, Dept. of Pediatrics, Medical College of Wisconsin

Human and Molecular Genetics Center

Geno enomics & & Per erson

  • nal

alized ed M Medi edicine: A Anal nalysis and and Clin Clinic ical I l Imple lementatio ion Clin Clinic ical & l & Transla lational Sc l Scie ience I Institute 12-17-12

slide-115
SLIDE 115

Disclosures

  • The following relationship(s) exist related to this

presentation:

– Children’s Hospital of Wisconsin (CHW) and Medical College of Wisconsin (MCW) provides fee for service whole genome sequencing (WGS) for clinical use. – WGS is not an FDA approved device, the FDA has determined that such approval is not necessary. – Testing is performed in CLIA/CAP approved laboratory.

slide-116
SLIDE 116

Whole Genome Sequencing (WGS) & Whole Exome Sequencing (WES) is in clinical practice Children’s Hospital of Wisconsin & Medical College of Wisconsin offers whole genome sequencing:

Nature 478, 22-24 (2011) News Feature Human genetics: Genomes on prescription Brendan Maher

48 patients evaluated for the programme, 17 have been accepted, and their families have gone through six hours or more of genetic counselling before sequencing. Insurance companies have agreed to foot the bill for at least two of the cases.

slide-117
SLIDE 117

Steps in clinical whole genome sequencing

  • Patient selection
  • Genetic counseling
  • Generation of sequence data - WGS

vs WES

  • Data analysis
  • Writing the report
  • How can we improve the process?
slide-118
SLIDE 118

Patient selection

  • Choose cases that are likely genetic

and rare

  • Non-syndromic MR not a good choice
  • Assure that all reasonable testing

already done & testing will advance clinical care

  • As more gene connected to

phenotypes loosen restrictions

slide-119
SLIDE 119

Genetic counseling

  • Topics for counseling. 6 – 10 hrs.
  • Report cause of patient’s problem and childhood onset

treatable disorders

  • Patient/family choice: Incidental findings – Ex: BRCA1
  • Insurance coverage

Exploration of expectations 16% Inheritance 14% Test Methodology 9% Categorical Model of Choice 18% Psychosocial Counseling 19% Follow-up Planning 10% Formal consent 8% Clinical genetics evaluation 9%

6-10 hours total

slide-120
SLIDE 120

WES vs WGS

Advantages of WES

  • Exome is 1% of

genome

– WES costs less than WGS – Deeper coverage of some areas compared to WGS

  • Exome includes only

the coding region

– Tools to interpret changes best developed

Advantages of WGS

  • WGS has better coverage of

some areas

– Exome capture array does not capture all exons

  • Certain parts of introns and

regions between genes can be interpreted

  • Encode project published 9/12

– more than 80% of the genome is functional!

  • mutations miR-96 cause

autosomal dominant progressive hearing loss

Generation of sequence data - WGS vs WES

slide-121
SLIDE 121

Laboratory Considerations in WGS/ WES

NGS RUN BASE CALLING Lane-level Q/C ALIGNMENT PRIMARY ALIGNMENT GENOTYPE CALLING DE-MULTIPLEX SECONDARY TERTIARY QC metrics Thresholds fastq

  • bam
  • other
  • Vcf
  • other

VARIANTS LIST NOVEL KNOWN ANNOTATE PRIORITIZE CLASSIFY CLINICAL REPORT

SUB IND CNV SV

PRIMARY ASSAY DESIGN

Prima mary analy lysis is – On machine chemistry and software – generates reads Seconda

  • ndary anal

alysis – align-ment software and variant calling software – calls variants Ter ertiary anal analysis – variant annotation software – clinical relevance

ANALYSIS STEPS:

slide-122
SLIDE 122

Analysis Challenges

  • The reference human

genome (called Hg19 ) 250 gaps

  • Much variation not in

Hg19 – WGS can end up with unmapped reads – Read with and insertion or deletion may not map to ref genome

  • Ref genome &

database of variants improving

  • Current NexGen devices

works well for single bp substitutions

  • Partial list of genomic variants and

structures

– Single base-pair substitutions – Insertions – Deletions – Adjacent insertion and deletion – GC rich regions – Trinucleotide repeats – Copy number variants – Homopolymer tracts – Translocations – Inversions – Short tandem repeats – Pseudogenes – Highly polymorphic regions (e.g. HLA locus)

slide-123
SLIDE 123

Tertiary analysis – annotation considerations

  • Incomplete phenotypic description of patients
  • Each variant evaluated for clinical relevance to

THAT PATIENT

– Known mutations

  • Human Gene Mutation Database, Online Mendelian

Inheritance in Man, Gene specific databases

– Stop, readthough, missense

  • NCBI , Ensembl
  • SIFT, Polyphen2

– Splice-site

  • GeneSplicer

– Evolutionary conservation

  • PhastCons for nucleotides, AA conservation tools

– Novel /rare variants

  • dbSNP, Exome Variant Server
slide-124
SLIDE 124

A brief history of genetic testing

  • Single disorder – Cystic Fibrosis

– Lab designs and validates test for this gene – MD orders test in patients with appropriate phenotype

  • Gene panel - Cardiomyopathy

– Lab reviews literature, chooses genes assoc with cardiomyopathy, designs and validates test for the chosen genes – MD orders test in patients with appropriate phenotype

Patient Suspected Clinical Dx Final Diagnosis Genetic Counselor Physicia n Molecular Dx Laboratory Director Genetic Counselor

  • Whole genome/exome

sequencing – Lab designs and validates test – MD orders test in patient

– Lab chooses genes associated with patient’s phenotype AFTER the test is run!

slide-125
SLIDE 125
  • Must confirm all

variants with Sanger

  • Large number of

errors in research literature for “disease causing” variants

  • Read original

paper before reporting!

Доверяй, но проверяй TRUS TRUST B T BUT V UT VERI ERIFY! FY!

Writing the report

slide-126
SLIDE 126

How can we improve the process?

  • Incidental findings

– How should these be handled by the lab?

  • Variants of uncertain significance

– How can these be changed to benign variants or pathogenic variants? – Mary E. Shimoyama, PhD & Jack Routes, MD – extending clinical whole genome sequencing

slide-127
SLIDE 127
  • Families
  • Referring Physicians
  • CHW WGS Review

Group

  • MCW Genetics
  • David Dimmock
  • Donald Basel
  • William Rhead
  • Stephanie Offord
  • Esperanza Font-

Montgomery

  • Gunter Scharer
  • CHW Genetics Center
  • Regan Veith
  • Amanda Laedtke
  • Heather Radtke
  • Dania Stachiw
  • Angela Pickart
  • Linda Smith
  • Jeff Kopesky
  • Neha Sekhri
  • Cristin Griffis
  • LuAnn Weik
  • Dimmock Lab
  • Daniel Helbling
  • CHW Laboratory

Medicine

  • Paula North
  • Tara Schmit
  • Altheia

Roquemore-Goins

  • Michelle Galalvez
  • Children’s Research

Institute

  • Juliet Kersten
  • Illumina
  • Tina Hambuch
  • Marc Laurent
  • Brad Sickler
  • Funding:
  • Children’s

Research Institute

  • Jeffrey Modell

Foundation

  • Private Donors
  • AGEN-Seq
  • Jaime Wendt
  • Jayme Wittke
  • Brett Chirempes
  • MCW- HMGC
  • Howard Jacob
  • Liz Worthey
  • Mary Shimoyama
  • Stan Lauderkind
  • Pushkala Jayaraman
  • Jenny Geurts
  • Victoria Petri
  • Alison Sarkis
  • Sasha Zeng
  • Bryce Schuler
  • Florence Yeo
  • Lisa Armitage
  • Bioinformatics/Systems

support

  • Jeff DePons
  • George Kowalski
  • Wes Rood
  • Sharon Tsaih
  • Brad Taylor
  • Stacy Zacher
  • Greg McQuestion
  • Kent Brodie

Acknowledgements – It takes a village

slide-128
SLIDE 128

Practical Limitations of Current Practice

  • Dr. Mary Shimoyama

Medical College of Wisconsin

  • Dr. John Routes

Medical College of Wisconsin

Research Clinical Divide and Moving Research Software to the Clinic

slide-129
SLIDE 129

Extending Current State of Practice

John M. Routes Chief, Section of Allergy and Clinical Immunology Co-Director, Clinical and Translational Science Institute of Southeast WI Professor of Pediatrics, Medicine, Microbiology and Molecular Genetics Medical College of Wisconsin

slide-130
SLIDE 130
  • Turnaround time
  • Candidate genes with data sets difficult

to interpret or manipulate

  • Candidate genes with unknown

function

  • Lack of experimental models to test

function of mutant protein

Current Limitations

slide-131
SLIDE 131

Mary Shimoyama, PhD

Assistant Professor

Human and Molecular Genetics Center

Moving Research Software to the Clinic

slide-132
SLIDE 132

Current Limitations

  • Multiple analysis tools
  • Single function
  • Lack of standards
  • Developed for basic research
  • Private – single use
  • Lack of centralized access points
  • Data scattered across sites
  • Lack of data standards and integration
  • Lack of software suitable for clinic use
  • Not designed for clinic workflow
  • Often require specialist knowledge
  • Difficult to use
slide-133
SLIDE 133

Leveraging Existing and Emerging Tools for Clinical Use

  • Identifying needs
  • Identifying tools – existing and

emerging

  • Identifying potential collaborations
  • Development and testing
  • Implementation
slide-134
SLIDE 134

Genomics & Personalized Medicine: Analysis & Clinical Implementation Breakout Sessions 3 & 4