Pre-amplification critically analysed Jo Vandesompele professor, - - PowerPoint PPT Presentation

pre amplification critically analysed
SMART_READER_LITE
LIVE PREVIEW

Pre-amplification critically analysed Jo Vandesompele professor, - - PowerPoint PPT Presentation

Pre-amplification critically analysed Jo Vandesompele professor, Ghent University co-founder and CEO, Biogazelle Advanced Methods in RNA quantification (London, UK) May 21, 2009 outline introduction 450 miRNA pre-amplification


slide-1
SLIDE 1

Pre-amplification critically analysed

Jo Vandesompele professor, Ghent University co-founder and CEO, Biogazelle Advanced Methods in RNA quantification (London, UK) May 21, 2009

slide-2
SLIDE 2
  • utline

 introduction  450 miRNA pre-amplification

 Mestdagh et al., Nucleic Acids Research, 2008  [Mestdagh et al., Genome Biology, in press]

 whole mRNAome pre-amplification

 prognostic gene signature in cancer patients  Vermeulen et al., The Lancet Oncology, accepted

slide-3
SLIDE 3

pre-amplification – the one and only quality criterion

 preservation of differential expression (fold changes)

before (B) and after (A) sample pre-amplification

 [no introduction of bias]

(G1S1)B/(G1S2) B = (G1S1) A/(G1S2) A G1B/G2B < > G1A/G2A gene G, sample S, before B, after A

slide-4
SLIDE 4

pre-amplification – the scene

 pre: before actual qPCR  amplification: make large amounts of RNA/(c)DNA from limited input

 single cell – picograms – nanograms >> micrograms

 transcriptome wide

 all long RNA molecules  advantages

  • no prior knowledge of target genes is needed
  • study can grow

 disadvantages

  • somewhat more expensive

 focused pre-amplification

 predefined set of sequences  advantages

  • fast and simple

 disadvantages

  • all targets need to be known in advance
slide-5
SLIDE 5

pre-amplification – the players

 transcriptome wide

 Eberwine method

  • T7-RNA polymerase based in vitro transcription > antisense RNA
  • home brew protocols

 SMART method (Clontech)

  • template switch mechanism sense RNA
  • T7-RNA in vitro transcription

 Phi29 based

  • rolling circle amplification – strand displacement
  • cDNA

 SPIA technology (NuGEN)

  • hybrid RNA/DNA SPIA primer
  • cDNA

 focused pre-amplification

 limited cycle PCR (10-14 cycles)

slide-6
SLIDE 6

5’ 3’ 5’ 3’ 5’ 5’ 3’

AAAAA UUUUU UUUUU AAAAA AAAAA TTTTT

1 streng cDNA-synthese (met T7-oligo(dT) primer)

ste

1 streng cDNA-synthese (met random hexameren)

ste

2 streng cDNA-synthese

de

antisense RNA-amplificatie (T7 RNA polymerase)

TTTTT TTTTT

5’ 3’ 3’ 5’ 3’ 5’ 3’ 5’ 5’ 3’

AAAAA TTTTT

5’ 5’ 3’ 3’ 5’

UUUUU

antisense RNA-amplificatie (T7 RNA polymerase)

3’

2 streng cDNA-synthese (met T7-oligo(dT) primer)

de

NNNNNN NNNNNN

5’ 3’

1 ronde

ste

2 ronde

de

Ebermine method

slide-7
SLIDE 7

SMART

slide-8
SLIDE 8

SPIA

slide-9
SLIDE 9

microRNA pre-amplification

 stem-loop megaplex reverse transcription using 20 ng total RNA  limited-cycle pre-amplification (14)  qPCR profiling 450 miRNAs and controls  higher sensitivity  minimal amplification bias (Mestdagh et al., Nucleic Acids Research)

slide-10
SLIDE 10

1 2 3 4 5 6 7 8 10 15 20 25 30 35 0.5 1 1.5 2 2.5 10 15 20 25 30

  • 0.6
  • 0.5
  • 0.4
  • 0.3
  • 0.2
  • 0.1
  • 1.4
  • 1.2
  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

1 2 3 4 5 6 7 8 10 15 20 25 30 35

Average CqNP (NBL-S, IMR-32) Average CqNP (NBL-S, IMR-32) ∆∆Cq (|∆CqNP - ∆CqP|) NBL-S, IMR-32 Average ∆∆Cq ∆∆Cq (|∆CqNP - ∆CqP|) NBL-S, IMR-32 Average ∆∆Cq

minimal pre-amplification bias

slide-11
SLIDE 11

5 10 15 20 25 30 2 4 6 8 10 12 14 5 10 15 20 25 30 35 2 4 6 8 10 12 14 5 10 15 20 25 30 2 4 6 8 5 10 15 20 25 30 35 2 4 6 8

Cq-value Cq-value

Cq-value Cq-value

total cell number total RNA input (pg) miR-18a R2 = 0.975 miR-20b R2 = 0.993 miR-92 R2 = 0.998 miR-19a R2 = 0.996

A B

1 2 4 8 16 32 64 128 1 2 4 8 16 32 64 128 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 30 25 20 15 10 5 30 25 20 15 10 5 35 30 25 20 15 10 5 35 30 25 20 15 10 5

single cell profiling

total cell number total RNA input (pg)

slide-12
SLIDE 12

Mestdagh et al., Nucleic Acids Research, 2008

slide-13
SLIDE 13
  • utline

 background research & goals  neuroblastoma  prognostic marker selection  study design and workflow

 RNA quality control  sample pre-amplification  normalization

 data-analysis and results

slide-14
SLIDE 14

biomarker signature based stratification

slide-15
SLIDE 15

biomarker signature based stratification

slide-16
SLIDE 16

aim

 development and validation of a robust prognostic gene signature for

neuroblastoma using real-time qPCR

 identifying patients with

 increased risk in the current low risk and high risk group  good molecular signature in the current high risk group

 better choice of risk-related therapy

slide-17
SLIDE 17

neuroblastoma

 most frequent extra-cranial solid tumor in

children

 originates from primitive (immature)

sympathetic nervous system cells

 1:100,000 children (< 15 years)

 20 cases/year Belgium | 700 cases/year

USA

 15% of childhood cancer deaths  prognosis is dependent on

 tumor stage (localized vs. metastatic

disease)

 age at diagnosis (< or > 1 year)  genetic defects: amplification MYCN, ploidy,

loss of 1p, gain of 17q

slide-18
SLIDE 18

prognostic classification

 misclassifications resulting in overtreatment or undertreatment  need for additional tumor-specific prognostic markers  current microarray gene expression studies

 data overfitting  unstable gene lists  lack of overlap  biological & technical noise  much more genes than samples  probe annotation / platform  different risk definition  different data processing and analysis

slide-19
SLIDE 19
  • meta-analysis of 7 published microarray gene

expression studies

  • literature screening of almost 800 abstracts from

single-gene studies

selection of a top ranking list of 59 prognostic markers

  • two PCR-based assays
  • capillary gel electrophoresis (Experion)

RNA quality control 423 samples sample pre-amplification (WT-Ovation) analysis of 366 primary untreated neuroblastoma tumours using real-time qPCR

  • Prediction Analysis of Microarrays
  • Kaplan-Meier
  • Cox proportional hazards

data-analysis

study workflow

slide-20
SLIDE 20

towards real-time PCR signature profiling

 100 ng total RNA

 30 ng quality control  10 ng unbiased amplification

WT-Ovation (NuGEN)

 PCR assay design and validation

 sensitivity, specificity and efficiency

RTPrimerDB (Pattyn et al., 2006, NAR; Lefever et al, 2009, NAR)

 absolute standards

 real-time PCR using 384-well format

 sample maximization strategy

(Hellemans et al., Genome Biology, 2007)

 366 tumors and 1 gene/plate

slide-21
SLIDE 21

WT-Ovation reproducibility

10,00 15,00 20,00 25,00 30,00 35,00 ACTB RPL13A 18S YWHAZ B2M GAPDH UBC HPRT1 SDHA HMBS mean Cq (n=3) genes

Stratagene cell line A cell line B cell line C

mean of 5, 15 and 50 ng of total RNA amplified

slide-22
SLIDE 22

WT-Ovation – no amplification bias

median bias = 0.36, 90%tile bias = 0.61

10 20 30 40 50 60 70 80 90 100 0,00 0,10 0,20 0,30 0,40 0,50 0,60 0,70 0,80 0,90 1,00

cumulative distribution bias (Cq)

slide-23
SLIDE 23

WT-Ovation – no amplification bias

median bias = 0.36, 90%tile bias = 0.61

10 20 30 40 50 60 70 80 90 100 0,00 0,10 0,20 0,30 0,40 0,50 0,60 0,70 0,80 0,90 1,00

cumulative distribution bias (Cq)

 no need for DNase treatment  no need for cleanup of amplified products

slide-24
SLIDE 24

preservation of differential expression

slide-25
SLIDE 25

qPCR reproducibility

5 10 15 20 25 30 35 40 50 100 150 200 250 300 350 400

10 # 100 # 1,000 # 10,000 #

 within a 384-well plate: 4 x 96 replicates

slide-26
SLIDE 26

qPCR reproducibility

 between two identical 384-well plates  maximum ΔCq: 0.45

15 20 25 30 35 40 15 20 25 30 35 40

slide-27
SLIDE 27

 synthetic control

 55 nucleotides  PAGE purification  blocking group  5 points dilution series: 15 molecules > 150.000 molecules

RCRP absolute standards stuffer FP

slide-28
SLIDE 28

absolute standards

 reproducibility across master mixes (5) and instruments (2)

5 10 15 20 25 30 35 1000000 100000 10000 1000 100 10 MM1 MM2 MM3 MM4 MM5

slide-29
SLIDE 29

absolute standards cross lab comparison

5 standards (triplicates) 5 reference genes + 5

  • ther genes

366 samples

slide-30
SLIDE 30

 5 standards (triplicates)

absolute standards cross lab comparison

average ΔCq standards correction Cq samples Cq qPCR instrument 1, mastermix 1 Cq qPCR instrument 2, mastermix 2

16 18 20 22 24 26 28 30 32 34 36 16 18 20 22 24 26 28 30 32 34 36

slide-31
SLIDE 31

 ARHGEF7 gene

 366 samples  use of 5 standards (triplicates) for correction

absolute standards cross lab comparison

Cq qPCR instrument 1, mastermix 1 Cq qPCR instrument 2, mastermix 2

16 18 20 22 24 26 28 30 32 34 36 16 18 20 22 24 26 28 30 32 34 36

slide-32
SLIDE 32

SPUD assay (Nolan et al, 2006): detection of inhibitors Computed gel analysis (Experion, Biorad): evaluation of total RNA quality 5’-3’ assay (HPRT1): evaluation of mRNA integrity

rigorous control of RNA quality

423 primary untreated NB (100 ng total RNA)

30 ng

366 RNA samples

slide-33
SLIDE 33

 differences in reference gene ranking between intact and degraded RNA

(Perez-Novo et al., Biotechniques, 2005)

impact of RNA quality on expression stability

slide-34
SLIDE 34

RNA quality parameters

2 4 6 8 10 10 20 30 40 50 5 10 15 20 20 40 60 80 5 10 15 20 25 20 40 60 80

RQI 5’-3’ dCq AluSq Cq frequency

slide-35
SLIDE 35

delta-Cq 5’-3’ vs. RQI

RNA samples ordered by average rank (good -> worse)

  • 50

50 100 150 200 250 300 350 400 450 50 100 150 200 250 300 350 400 450 RQI delta-Cq

slide-36
SLIDE 36

RNA quality control & sample selection

 423 samples:

 4 samples < DOT/DOO without event  5 samples < presence of enzymatic inhibitors (SPUD)  20 samples < lack of mRNA integrity (no ΔCq 5’-3’)

  • 12/14 failed WT-Ovation
  • all low RQI values

 28 samples < poor RNA quality (RQI + ΔCq 5’-3’)

 366 best samples (86.5 %)

 RQI:

  • average = 7.4
  • median = 7.6
  • 90%-tile > 6.1

 ΔCq 5’-3’:

  • average = 2.36
  • median = 2.06
  • 90%-tile < 4.75
slide-37
SLIDE 37

normalisation using geNorm technology

 framework for qPCR gene expression normalisation using the reference

gene concept:

 quantified errors related to the use of a single reference gene

(> 3 fold in 25% of the cases; > 6 fold in 10% of the cases)

 developed a robust algorithm for assessment of expression stability of

candidate reference genes

 proposed the geometric mean of at least 3 reference genes for

accurate and reliable normalisation

 Vandesompele et al., Genome Biology, 2002

slide-38
SLIDE 38

http://medgen.ugent.be/genorm

geNorm software

 automated analysis

 ranking of candidate reference genes according to their stability  determination of how many genes are required for reliable normalization

slide-39
SLIDE 39

0.003 0.006 0.021 0.023 0.056 NF4 NF1

 cancer patients survival curve

statistically more significant results

geNorm validation (I)

log rank statistics Hoebeeck et al., Int J Cancer, 2006

slide-40
SLIDE 40

 mRNA haploinsufficiency measurements

accurate assessment of small expression differences

geNorm validation (II)

Hellemans et al., Nature Genetics, 2004

patient / control

3 independent experiments

95% confidence intervals

slide-41
SLIDE 41

 use of multiple references is now well established

 > 1250 citations of our geNorm technology in PubMed  > 8000 geNorm downloads in 100 countries

normalisation using multiple stable reference genes

slide-42
SLIDE 42

 data analysis using qbasePLUS

 based on Ghent University’s geNorm and qBase technology  up to fifty 384-well plates  multiple reference genes for accurate normalization  detection and correction of inter-run variation  dedicated error propagation  automated analysis; no manual interaction required

data analysis

http://www.qbaseplus.com

slide-43
SLIDE 43

 data analysis using qbasePLUS

 based on Ghent University’s geNorm and qBase technology  up to fifty 384-well plates  multiple reference genes for accurate normalization  detection and correction of inter-run variation  dedicated error propagation  automated analysis; no manual interaction required

 59 prognostic markers + 5 reference genes  366 samples  hierarchical clustering  survival analysis

 Prediction Analysis of Microarrays  Cox proportional hazards modeling  Kaplan-Meier

data analysis

slide-44
SLIDE 44

Prediction Analysis of Microarrays PAM training test 15 low risk 15 high risk 334 samples PFS OS

slide-45
SLIDE 45

classification of patients with respect to PFS and OS

50 100 150 OS total SIOPEN cohort (n = 313)

survival probability (%)

LR n=245 (5) HR n=68 (27)

p = <0.001 (log-rank)

20 40 60 100 80 time (months) 50 100 150 PFS total SIOPEN cohort (n = 312)

survival probability (%)

LR n=245 (42) HR n=67 (35)

p = <0.001 (log-rank)

20 40 60 100 80 time (months)

slide-46
SLIDE 46

value of the classifier in relation to currently used risk factors: PFS

50 100 150 20 40 60 80 100 10 20 30 40 50 60 70 LR n=94 (17) LR n=152 (25) LR n=234 (39) LR n=222 (31) LR n=8 (1) LR n=24 (11) HR n=48 (24) HR n=31 (18) HR n=34 (21) HR n=34 (15) HR n=34 (17) HR n=20 (12)

p = <0.001 (log-rank) p = <0.001 (log-rank) p = <0.001 (log-rank) p = <0.001 (log-rank) p = 0.12 (log-rank)

time (months) 50 100 150 time (months) 50 100 150 time (months) 50 100 150 time (months) time (months) time (months) 100 80 60 40 20

survival probability (%)

100 80 60 40 20

survival probability (%)

100 80 60 40 20

survival probability (%)

100 80 60 40 20

survival probability (%)

100 80 60 40 20

survival probability (%) survival probability (%)

PFS age <=12 months (n = 172) PFS not stage 4 (n = 256) PFS stage 4 (n = 58) PFS amplification (n = 42 ) MYCN PFS single copy (n = 265) MYCN PFS age > 12 months (n = 142) p = 0.22 (log-rank)

100 80 60 40 20

slide-47
SLIDE 47

Cox multivariate analysis independent predictor (age, stage, MYCN) multivariate cox analysis

PAM classifier strong independent predictor: patients with high molecular risk have a 19-fold higher risk to die from disease a 4-fold higher risk for relapse/progression compared to patients with low molecular risk

slide-48
SLIDE 48

RNA quality control cut-off

 depends on the application

 microarray vs. qPCR  expression difference of the target(s)  abundance & stability of the target(s)  fresh frozen vs. FFPE

 based on the performance of our classifier

ROC AUC accuracy analysis

bad good RQI <3: 0.27 3: 0.82 5’-3’ dCq >7: 0.43 7: 0.79 AluSq Cq >15: 0.13 15: 0.81

slide-49
SLIDE 49

conclusions (I)

 validation matters – quality control along the entire workflow

 assay performance  template quality  normalization  data-analysis

slide-50
SLIDE 50

conclusions (II)

 largest qPCR gene-expression study (rigourous RNA quality control)

 optimized workflow  using minimal amounts of RNA (100 ng)  use of absolute standards (cross-lab comparison)  selected gene list (59) on a large panel of tumours (366 + 223)

 robust multigene expression prognostic classifier

 validated on an independent set of tumours  independent after controling for other known risk factors  suitable for routine lab tests

 this study might form the basis for future research, i.e. prospective

studies

 cDNA library source for future qPCR gene expression studies

slide-51
SLIDE 51

Frank Speleman

Jo Vandesompele

Nadine Van Roy

Katleen De Preter

Jasmien Hoebeeck

Filip Pattyn

Tom Van Maerken

Joëlle Vermeulen Center for Medical Genetics, Ghent, Belgium

Geneviève Laureys

Gianpaolo Tonini

Olivier Delattre

Jean Bénard

Valérie Combaret

Raymond Stallings

Angelika Eggert

Akira Nakagawara

Matthias Fischer Grants: Childhood Cancer Fund, Emmanuel van der Schueren foundation, UGent-GOA, FWO, IUAP, IWT Collaborators

Nurten Yigit

Els De Smet

Liesbeth Vercruysse

Anne De Paepe

acknowledgements