 
              From reference genes to global mean normalization Jo Vandesompele professor, Ghent University co-founder and CEO, Biogazelle qPCR Symposium USA November 9, 2009 – Millbrae, CA
outline  what is normalization  gold standard for mRNA normalization  global mean normalization and selection of stable small RNAs for microRNA normalization
introduction to normalization  2 sources of variation in gene expression results  biological variation (true fold changes)  experimentally induced variation (noise and bias)  purpose of normalization is reduction of the experimental variation  input quantity: RNA quantity, cDNA synthesis efficiency, …  input quality: RNA integrity, RNA purity, …  gold standard is the use of multiple stably expressed reference genes  which genes?  how many?  how to do the calculations?
normalization: geNorm solution  framework for qPCR gene expression normalisation using the reference gene concept:  quantified errors related to the use of a single reference gene (> 3 fold in 25% of the cases; > 6 fold in 10% of the cases)  developed a robust algorithm for assessment of expression stability of candidate reference genes  proposed the geometric mean of at least 3 reference genes for accurate and reliable normalisation  Vandesompele et al., Genome Biology, 2002
geNorm software  automated analysis  ranking of candidate reference genes according to their stability  determination of how many genes are required for reliable normalization  http://medgen.ugent.be/genorm
geNorm validation (I)  cancer patients survival curve statistically more significant results log rank statistics NF4 0.003 NF1 0.006 0.021 0.023 0.056 Hoebeeck et al., Int J Cancer, 2006
geNorm validation (II)  mRNA haploinsufficiency measurements accurate assessment of small expression differences patient / control  3 independent experiments  95% confidence intervals  Hellemans et al., Nature Genetics, 2004
normalization using multiple stable reference genes  geNorm is the de facto standard for reference gene validation and normalization  > 2,000 citations of our geNorm technology  > 10,000 geNorm software downloads in 100 countries
global mean normalization  when a large set of genes are measured, the average expression level reflects the input amount and could be used for normalization  e.g. microarray based normalization o lowess, mean ratio, …  SAGE / NGS sequencing counts  the set of genes must be unbiased and sufficiently large  we make use of this principle to normalize microRNA data from experiments in which we quantify a substantial number of miRNAs (450 or 650) in a given sample
global mean normalization  small-RNA controls  classic normalization strategy  small nuclear RNAs, small nucleolar RNAs  18 available from Applied Biosystems  global mean normalization  method applied for microarray data  universal: applicable for every miRNA dataset  many datapoints needed (megaplex vs. multiplex)  miRNAs/controls that resemble the mean  minimal standard deviation when comparing miRNA expression with mean ( geNorm V value, standard deviation of log transformed ratios)  compatible with multiplex assays  need to determine mean
small RNA controls  How ‘stable’ is the global mean compared to controls?  geNorm analysis using controls and mean as input variables  exclusion of potentially co-regulated controls HY3 7q36 RNU19 5q31.2 RNU24 9q34 RNU38B 1p34.1-p32 RNU43 22q13 RNU44 1q25.1 RNU48 6p21.32 RNU49 17p11.2 RNU58A 18q21 RNU58B 18q21 RNU66 1p22.1 RNU6B 10p13 U18 15q22 U47 1q25.1 U54 8q12 U75 1q25.1 Z30 17q12 RPL21 13q12.2
miRNA expression datasets  neuroblastoma tumour samples  T-ALL samples  EVI1 deregulated leukemias  retinoblastoma tumour samples  normal tissues  normal bone marrow
T-ALL geNorm ranking 1,8 1,6 1,4 expression stability 1,2 1 0,8 0,6 0,4 0,2 0
geNorm ranking neuroblastoma leukemia EVI1 overexpression bone marrow pool normal tissues
neuroblastoma – removal of variation 120 100 80 not normalised stable controls 60 mean miRNAs 40 20 0 0 50 100 150 200 250 300
removal of variation leukemia EVI1 overexpression T-ALL bone marrow pool normal tissues
biological validation  MYCN binds to the mir-17-92 promoter CATGTG CACGTG CACGTG CATGTG CATGTG CATGTG CATGTG mir-17-92 cluster CpG island -5 kb +5 kb A B C 12 11 10 IMR5 9 Fold enrichment 8 WAC2 7 6 5 4 3 2 1 0 A B C Amplicon
biological validation  choice of normalization strategy influences differential miRNA expression  Mir-17-92 expression in neuroblastoma tumours 3,5 3 2,5 2 1,5 stable controls mean 1 miRNAs 0,5 0
biological validation  choice of normalization strategy influences differential miRNA expression  Mir-17-92 expression in neuroblastoma tumours 3,5 3 2,5 2 1,5 stable controls mean 1 miRNAs 0,5 0
biological validation  choice of normalization strategy influences differential miRNA expression  Mir-17-92 expression in neuroblastoma tumours 3,5 3 2,5 2 1,5 stable controls mean 1 miRNAs 0,5 0
balanced differential expression 3 controls 2 mean fold change (MYCN amplified vs. MYCN single copy) 1 0 -1 -2 average FC controls = -0.404 average Fc mean = 0.050 -3 average FC miRNAs = 0.124 -4 -5 -6 -7
correlation MYCN downregulated genes – 2 normalization strategies stable miRNA control normalisation mean normalisation
strategy also works for microarray data  each sample is measured by RT-qPCR and microarray  global mean normalization  standardization per method  hierarchical clustering  samples cluster by sample (and NOT by method)
conclusions global mean normalization  novel and powerful miRNA normalization strategy  maximal reduction of technical noise  improved identification of differentially expressed genes  balancing of differential expression  universally applicable o global mean o multiple stable endogenous controls  Mestdagh et al., Genome Biology, 2009
qbase PLUS normalization  most powerful, flexible and user-friendly real-time PCR data-analysis software  based on Ghent University’s geNorm and qBase technology  state of the art normalization procedures o one or more classic reference genes o global mean normalization o expressed repeat normalization  detection and correction of inter-run variation  dedicated error propagation  fully automated analysis; no manual interaction required  booth 19 http://www.qbaseplus.com
conclusions  proper normalization has a major impact on your results  provides statistically more significant results  enables accurate assessment of small expression differences  gold standard for mRNA gene expression analysis  geNorm evaluation of candidate reference genes  geometric mean of multiple stably expressed reference genes  global mean normalization and subsequent geNorm based selection of reference genes that resemble the mean is a valid option when measuring a large and unbiased set of genes (e.g. all miRNAs)
acknowledgments  miRNA  Pieter Mestdagh (UGent)  Frank Speleman (UGent)  Applied Biosystems  qbase PLUS  Jan Hellemans (Biogazelle – UGent)  Stefaan Derveaux (Biogazelle – UGent)
 January 28-29, 2010 Ghent, Belgium  www.advances-in-genomics.org
Recommend
More recommend