Different approaches on normalisation of gene expression RT-qPCR - - PowerPoint PPT Presentation

different approaches on normalisation of gene expression
SMART_READER_LITE
LIVE PREVIEW

Different approaches on normalisation of gene expression RT-qPCR - - PowerPoint PPT Presentation

Different approaches on normalisation of gene expression RT-qPCR data Jan Hellemans PhD, Ghent University co-founder and CEO, Biogazelle Lo studio dellespressione genica in real -time PCR September 5, 2008 Siena, Italy acknowledgement


slide-1
SLIDE 1

Different approaches on normalisation of gene expression RT-qPCR data

Jan Hellemans PhD, Ghent University co-founder and CEO, Biogazelle Lo studio dell‟espressione genica in real-time PCR September 5, 2008 Siena, Italy

slide-2
SLIDE 2

acknowledgement

 Jo Vandesompele (geNorm, qBase)  Filip Pattyn  Jasmien Hoebeeck  Katleen De Preter  Nurten Yigit  Frank Speleman  Geert Mortier

slide-3
SLIDE 3

qPCR workflow Cq values Data processing Statistical analysis & interpretations Experiment design Sample prep Assay design qPCR reactions

slide-4
SLIDE 4

qPCR workflow – data processing Cq values Data processing Statistical analysis & interpretations Experiment design Sample prep Assay design qPCR reactions

Data processing

  • absolute vs relative quantification
  • relative quantification
  • Cq values to relative quantities
  • unwanted technical variation
  • normalization
  • inter-run calibration
slide-5
SLIDE 5

absolute vs relative quantification

 “absolute” quantification

how many copies or molecules (molarity)

standard (dilution series)

quantification relative to the standard

 “all quantification is relative”

the accuracy of external standard quantification is entirely dependent

  • n the accuracy of the standards

log linear relationship between input and Ct + reproducibility

precise and reproducible answer, but not necessarily an accurate answer

exception: digital PCR

 relative quantification

  • ne sample relative to another

  • ne transcript relative to another (e.g. splice isoforms)

Vandenbroucke et al., Nucleic Acids Research, 2001

 comparative Cq method / delta-Cq method

slide-6
SLIDE 6

relative quantification – Cq values to relative quantities

RQ = 2ΔCq

RQ1/3 = 24 = 16 RQ2/3 = 22 = 4 RQ3/3 = 20 = 1

2 4 1 2 3

PCR cycle threshold fluorescence Cq1 Cq2 Cq3

slide-7
SLIDE 7

relative quantification – variation differences in RQ due to

  • different gene expression level
slide-8
SLIDE 8

Data processing - relative quantification differences in RQ due to

  • different gene expression level
  • different total starting amount
slide-9
SLIDE 9

relative quantification – variation differences in RQ due to

  • different gene expression level
  • different total starting amount
  • run dependent differences
slide-10
SLIDE 10

relative quantification – variation differences in RQ due to

  • different gene expression level
  • different total starting amount
  • run dependent differences
slide-11
SLIDE 11

relative quantification – variation Cq RQ NRQ CNRQ Normalization Inter-run calibration technical variation

  • avoid
  • minimize
  • correct
slide-12
SLIDE 12

relative quantification – Cq values to relative quantities Cq RQ NRQ CNRQ RQ = 2ΔCq RQ = EΔCq

Calculate gene specific amplification efficiency (E) from

  • dilution series
  • fluorescence curve
slide-13
SLIDE 13

relative quantification – amplification efficiencies

 calculate and propagate the error on E  minimize SE(E)

 increase number of dilution points (n)  increase range of dilution

n a a n a a a

Q Q Cq Cq Q Q slope

1 2 1 slope

E

1

10 ) 1 (n s s slope SE

x e 2

10 ln slope slope SE E E SE 2

1 2 , ,

n Cq Cq s

n a predicted a measured a e n a a x

Q Q n s

1 2

1 1

slide-14
SLIDE 14

relative quantification – amplification efficiencies

 1-2

SE(E)=0.032

increase number of points increase range of points

 1&2

SE(E)=0.032

slide-15
SLIDE 15

relative quantification – amplification efficiencies

 1-2

SE(E)=0.032

 1-3

SE(E)=0.013

increase number of points increase range of points

 1&2

SE(E)=0.032

 1&3

SE(E)=0.018

slide-16
SLIDE 16

relative quantification – amplification efficiencies

 1-2

SE(E)=0.032

 1-3

SE(E)=0.013

 1-4

SE(E)=0.008

increase number of points increase range of points

 1&2

SE(E)=0.032

 1&3

SE(E)=0.018

 1&4

SE(E)=0.008

slide-17
SLIDE 17

relative quantification – amplification efficiencies

 1-2

SE(E)=0.032

 1-3

SE(E)=0.013

 1-4

SE(E)=0.008

 1-5

SE(E)=0.005

increase number of points increase range of points

 1&2

SE(E)=0.032

 1&3

SE(E)=0.018

 1&4

SE(E)=0.008

 1&5

SE(E)=0.004

slide-18
SLIDE 18

relative quantification – amplification efficiencies

 1-2

SE(E)=0.032

 1-3

SE(E)=0.013

 1-4

SE(E)=0.008

 1-5

SE(E)=0.005

 1-6

SE(E)=0.003

increase number of points increase range of points

 1&2

SE(E)=0.032

 1&3

SE(E)=0.018

 1&4

SE(E)=0.008

 1&5

SE(E)=0.004

 1&6

SE(E)=0.002

slide-19
SLIDE 19

relative quantification – amplification efficiencies

101% 920%

slide-20
SLIDE 20

relative quantification – normalization

 minimize technical variation

 sample: size and type  RNA extraction: quality and quantity  RNA degradation  cDNA synthesis

 correct for technical variation

 normalization

Cq RQ NRQ CNRQ

slide-21
SLIDE 21

relative quantification – normalization

 Livak and Schmittgen (2001)

 100% PCR efficiency  1 reference gene

 Pfaffl (2001)

 adjusted PCR efficiency  1 reference gene

 qBase model (2007)

 adjusted PCR efficiency  multiple reference genes Ct

NRQ 2

ref Ct ref goi Ct goi

E E NRQ

, , n ref Ct ref n i goi Ct goi

i i

E E NRQ

, ,

slide-22
SLIDE 22

relative quantification – inter-run calibration

 different analysis settings

Cq RQ NRQ CNRQ

slide-23
SLIDE 23

relative quantification – inter-run calibration

 different analysis settings  instrument, reagents and consumable variation

Cq RQ NRQ CNRQ

slide-24
SLIDE 24

relative quantification – inter-run calibration

 avoid inter-run variation

 use sample maximization

 minimize inter-run variation

 use the same instrument, reagents and consumables

 correct for inter-run variation

 inter-run calibration

Cq RQ NRQ CNRQ

slide-25
SLIDE 25

relative quantification – inter-run calibration

 [3 GOI + 3 REF] x [11 samples + 1 NTC]

slide-26
SLIDE 26

 sample maximization is to be preferred

 no increase in variation due to absence of inter-run variation  suitable for retrospective studies and controlled experiments

 gene maximization

 introduces (under-estimated) inter-run variation  applicable for prospective studies or large studies in which the number

  • f samples do not fit on the plate anymore

 inter-run variation can be controlled and corrected for using inter-run

calibrators (IRC)

 inter-run calibration

 possible on two levels:

  • Ct values
  • normalized relative quantities

 the more inter-run calibrators, the better  simple vs. complex inter-run calibration  specialised software is needed

relative quantification – inter-run calibration

slide-27
SLIDE 27

relative quantification – inter-run calibration

 correct for inter-run variation by including IRC‟s  IRC:

 inter-run calibrator  identical sample measured for the same gene in different runs

Cq RQ NRQ CNRQ IRCrun 1 IRCrun 2

+ = NRQlow + = NRQhigh

CNRQ

slide-28
SLIDE 28

relative quantification - qBase

slide-29
SLIDE 29

relative quantification – qBase & qBasePlus

Hellemans et al., Genome Biology, 2007 http://www.qbaseplus.com

slide-30
SLIDE 30

relative quantification - qBasePlus

slide-31
SLIDE 31

qPCR workflow – data processing Cq values Data processing Statistical analysis & interpretations Experiment design Sample prep Assay design qPCR reactions

Normalization

  • why reference genes?
  • why multiple reference genes?
  • geNorm
  • effect of sample quality
  • normalization quality control
slide-32
SLIDE 32

normalization

 2 sources of variation in gene expression measurements

 gene-specific (biological) variation (true fold changes)  non-specific (technical) variation

  • RNA extraction yield
  • RNA quantity & quality
  • reverse transcription efficiency
  • PCR efficiency (inhibitors)

 purpose of normalization is reduction of the technical/experimental

variation

slide-33
SLIDE 33

normalization

slide-34
SLIDE 34

normalization – reference genes

 reference genes

 most popular  captures most variation

 attention!

 reference genes (might) vary in expression  until recently, non-validated reference genes were used

(assuming stable expression)

 normalisation against 3 or more validated reference genes is considered

as the most appropriate and universally applicable method

 which genes?  how to do the calculations?

slide-35
SLIDE 35

normalization – multiple reference genes

 framework for qPCR gene expression normalisation using the reference

gene concept:

 quantified errors related to the use of a single reference gene

(> 3 fold in 25% of the cases; > 6 fold in 10% of the cases)

 developed a robust algorithm for assessment of expression stability of

candidate reference genes

 proposed the geometric mean of at least 3 reference genes for

accurate and reliable normalisation

 Vandesompele et al., Genome Biology, 2002

slide-36
SLIDE 36

normalization – multiple reference genes

 quantitative RT-PCR analysis of 10 reference genes (belonging to different

functional and abundance classes) on 85 samples from 13 different human tissues

1 2 3 4

ACTB HMBS HPRT1 TBP UBC A B C D E F G 15 fold difference between A and B if normalized by only one gene (ACTB or HMBS)

slide-37
SLIDE 37

normalization – geNorm

 pairwise variation V (between 2 genes)  gene stability measure M

average pairwise variation V of a gene with all other genes gene A gene B sample 1 a1 b1 log2(a1/b1) sample 2 a2 b2 log2(a2/b2) sample 3 a3 b3 log2(a3/b3) … … … … sample n an bn log2(an/bn) standard deviation = V

slide-38
SLIDE 38

http://medgen.ugent.be/genorm

normalization – geNorm

 automated analysis

 ranking of candidate reference genes according to their stability  determination of how many genes are required for reliable normalization

slide-39
SLIDE 39

0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 HM BS B2M RPL1 3A SDHA TBP ACTB UBC YHWAZ GAPD HPRT1

normalization – geNorm

 ranking of candidate reference genes according to their stability

slide-40
SLIDE 40

normalization – geNorm

 geometric mean of 3 reference gene expression levels  controls for outliers  compensates for differences in expression level between the reference

genes geometric mean = (a x b x c) 1/3 arithmetic mean = a + b + c 3

slide-41
SLIDE 41

 robust – insensitive to outliers

normalization – geNorm

NF ACTB HMBS HPRT1 TBP UBC

slide-42
SLIDE 42

 purpose of normalization: reduction of technical variation

  • nly geNorm best reference genes are able to reduce most of the variation

normalization – geNorm

3 good references 3 mediocre references 3 bad references

slide-43
SLIDE 43

0.003 0.006 0.021 0.023 0.056 NF4 NF1

 cancer patients survival curve

statistically more significant results

normalization – geNorm

log rank statistics Hoebeeck et al., Int J Cancer, 2006

slide-44
SLIDE 44

 mRNA haploinsufficiency measurements

accurate assessment of small expression differences

normalization – geNorm

Hellemans et al., Nature Genetics, 2004

patient / control

3 independent experiments

95% confidence intervals

slide-45
SLIDE 45

present mathematical (linear mixed-effects) models to further analyze candidate reference genes

log yij = μ + Ti + Gj + εij

 other approaches

 Global Pattern Recognition (Akilesh et al., Genome Research, 2003)  BestKeeper (Pfaffl et al., Biotechnology Letters, 2004)  Equivalence test (Haller et al., Analytical Biochemistry, 2004)  ANOVA test (Brunner et al., BMC Plant Biology, 2004)  Normfinder (Andersen et al., Cancer Research, 2004)  Szabo et al., Genome Biology, 2004  Abruzzo et al., Biotechniques, 2005

normalization – geNorm

 Vandesompele, Kubista & Pfaffl

Reference gene validation software for improved normalization book chapter in “Real-time PCR: an essential guide”, Horizon Bioscience, 2nd edition (2009)

slide-46
SLIDE 46

normalization – effect of sample quality

 differences in reference gene ranking

(Perez-Novo et al., Biotechniques, 2005)

slide-47
SLIDE 47

normalization – quality control

 geNorm M value

 selection of the best set of reference genes  quality control of reference gene stability

tissue type gene CV (%) M mean CV% mean M neuroblastoma UBC 31.84 0.740 30.89 0.703 SDHA 27.40 0.660 HPRT1 37.11 0.736 GAPDH 27.21 0.675 fibroblast YHWAZ 18.19 0.408 14.81 0.365 HPRT1 8.84 0.308 GAPDH 17.40 0.378 leukocyte B2M 15.76 0.400 15.81 0.394 UBC 15.79 0.389 YWHAZ 15.89 0.393 bone marrow YWHAZ 17.77 0.383 15.47 0.372 UBC 13.60 0.356 RPL13A 15.03 0.376 normal pool TBP 47.51 1.099 43.73 0.925 HPRT1 46.99 0.988 HMBS 31.16 0.849 SDHA 49.50 0.869 GAPDH 43.50 0.819

sample panel

  • homogenous
  • CV: 25%
  • M: 0.5
  • heterogenous
  • CV: 50%
  • M: 1.0

Hellemans et al., Genome Biology, 2007

slide-48
SLIDE 48

 need for something new

 reference gene validation requires (extensive) experimental work  sometimes not possible (lack of sample material, funding, time or

devotion)

 there must be something better

 EAR normalization (Expressed Alu Repeat)

“using a repetitive sequence in the human transcriptome as a measure for the mRNA fraction”

normalization – EARs

slide-49
SLIDE 49

 rationale: repeat sequences are present in the UTR of many genes, and

the differential expression of a small number of genes won‟t influence the

  • verall repeat abundance in the transcriptome

normalization – EARs

slide-50
SLIDE 50

 Alu repeats

 by far the most abundant repeats in the human genome  1 million copies (10% of the genome), 31 subfamilies (well conserved)  short interspersed elements (SINE) replicating via retrotransposition  ~280 bp long, followed by a variable poly-A tail  no known biological function  implicated in human disease (unequal recombination)

normalization – EARs

slide-51
SLIDE 51

 extraction of all Alu repeat elements in the human genome

 UCSC genome browser table function

 database with repeat element info and gene structure information for all

human genes -> „expressed Alu repeats‟

 MySQL

 Alu subfamily sequence alignment

 PHP script „Alu FASTA generator‟  wEMBOSS clustalW alignment

 primer design  roughly 1500 human genes contain

  • ne or more Alu repeats

normalization – EARs

slide-52
SLIDE 52

normalization – EARs

ADAMTS4 (1q23.3) ADCY6 (12q13.12) AluSq AluSg

slide-53
SLIDE 53

 AluSx assay (AluSq | AluJ)

normalization – EARs

64, 16, 4 and 1 ng QPCR Reference Total RNA (Stratagene)

slide-54
SLIDE 54

 comparison of Alu repeat levels and NF based on 3 best reference genes

 Pearsons correlation 0.943 (p=0.0014)

normalization – EARs

0.5 1 1.5 2 2.5 3 3.5 4 CHP-134 CLB-GA IMR-32 QPCR Ref Total RNA SK-N-AS SK-N-FI SK-N-SH

AluSx NF3

slide-55
SLIDE 55

 MYCN expression levels normalized by Alu repeat or NF3

normalization – EARs

1 10 100 1000 10000 100000 CHP-134 CLB-GA IMR-32 NGP QPCR Ref Total RNA SK-N-AS SK-N-FI SK-N-SH MYCN normalised by AluSq MYCN normalised by NF3

slide-56
SLIDE 56

 3rd EMBO qPCR course, Heidelberg, 2006

normalization – EARs

slide-57
SLIDE 57

 simple and convenient normalization strategy for

 gene expression analysis (cDNA) (EAR normalization)  gene copy number quantification (DNA)

 no (extensive) experimental validation required  only limited sample amount required  strategy could be expanded to other expressed repeats in other organisms

normalization – EARs

slide-58
SLIDE 58

RDML: exchanging & publishing of qPCR data & results

RDML: http://www.RDML.org

slide-59
SLIDE 59

Conclusions

 perform quality controls on every step of the workflow  technical variation

 avoid  minimize  correct for

 use multiple validated reference genes  verify reference target stability in every experiment  consider normalization with EAR’s  software to help you

 RTprimerDB (assay design & database - http://medgen.ugent.be/rtprimerdb)  geNorm (reference gene validation - http://medgen.ugent.be/genorm)  qBasePlus (relative quantification & quality controls – http://www.qbaseplus.com)