Gene Expressions and Genomes 02-223 Personalized Medicine: - PowerPoint PPT Presentation

Gene ¡Expressions ¡and ¡Genomes ¡ 02-‑223 ¡Personalized ¡Medicine: ¡ Understanding ¡Your ¡Own ¡Genome ¡ Fall ¡2014 ¡

Why ¡Gene ¡Expression? ¡ Genome-‑wide ¡associaGon ¡mapping ¡ DNA ¡sequence ¡ Disease ¡or ¡healthy? ¡ Molecular ¡ mechanism? ¡

Why ¡Gene ¡Expression ¡ • IdenGfying ¡the ¡geneGc ¡variants ¡that ¡confer ¡disease ¡risk ¡is ¡not ¡ enough ¡to ¡decipher ¡the ¡molecular ¡mechanisms ¡of ¡how ¡the ¡ geneGc ¡variaGon ¡influence ¡the ¡disease: ¡ – In ¡medicine ¡ ¡ • We ¡need ¡to ¡determine ¡which ¡biological ¡pathways ¡and ¡genes ¡are ¡ involved ¡in ¡the ¡disease ¡process. ¡ • The ¡idenGfied ¡pathways ¡and ¡genes ¡can ¡be ¡a ¡target ¡for ¡drug. ¡ – In ¡science ¡ • Determining ¡which ¡pathways ¡underlie ¡the ¡associaGon ¡between ¡the ¡ geneGc ¡variaGon ¡and ¡phenotype ¡can ¡provide ¡insights ¡on ¡the ¡ funcGon ¡of ¡genes. ¡

Microarrays ¡for ¡Molecular ¡Biology ¡ TranscripGon ¡factor ¡ Microarray ¡for ¡measuring ¡ DNA ¡ gene ¡expression ¡levels ¡ transcription mRNA ¡ translation Proteins ¡

Microarray ¡Hybridiza=on ¡ • Watson-‑Crick ¡base ¡pairing ¡of ¡complementary ¡DNA ¡sequences. ¡ • Microarrays ¡have ¡tens ¡of ¡thousands ¡of ¡spots, ¡each ¡represenGng ¡a ¡ piece ¡of ¡one ¡gene, ¡immobilized ¡on ¡a ¡glass ¡slide. ¡ • The ¡intensity ¡(or ¡intensity ¡raGo) ¡of ¡each ¡spot ¡indicates ¡the ¡amount ¡ of ¡labeled ¡cDNA ¡hybridized, ¡thus, ¡represenGng ¡the ¡starGng ¡mRNA ¡ transcript ¡abundance. ¡

Hybridization and Scanning— cDNA arrays � - Prepare Cy3, Cy5- � labeled ss cDNA � - Scan � - Hybridize 600 ng of � labeled ss cDNA to � glass slide array �

Individuals ¡ What ¡is ¡gene ¡ baseline ¡ expression? ¡ expression ¡ Expression ¡= ¡acGvity ¡ 0 10 ¡ 20 ¡ 70 ¡ 80 ¡ gene ¡1 ¡ level ¡of ¡gene ¡in ¡ experiment ¡ genes ¡ Higher ¡ Lower ¡ expression ¡ expression ¡ compared ¡to ¡ compared ¡to ¡ baseline ¡ baseline ¡

Hierarchical ¡Clustering ¡ • Probably ¡the ¡most ¡popular ¡clustering ¡algorithm ¡in ¡ computaGonal ¡biology ¡ • AgglomeraGve ¡( bo^om-‑up) ¡ • Algorithm: ¡ 1. IniGalize: ¡each ¡item ¡a ¡cluster ¡ 2. Iterate: ¡ • select ¡two ¡most ¡ similar ¡clusters ¡ • merge ¡them ¡ 3. ¡ ¡ ¡Halt: ¡when ¡there ¡is ¡only ¡one ¡cluster ¡le_ ¡ dendrogram

Similarity ¡Criterion: ¡Single ¡Linkage ¡ • cluster ¡similarity ¡= ¡similarity ¡of ¡two ¡most ¡similar ¡ members ¡ - Potentially long and skinny clusters

In ¡most ¡cases ¡(1-‑ r 2 ), ¡ where ¡ r 2 ¡is ¡the ¡correlaGon ¡ coefficient, ¡is ¡used ¡as ¡ Example: ¡Single ¡Linkage ¡ similarity ¡measure ¡ between ¡samples ¡ 5 4 3 2 1

In ¡most ¡cases ¡(1-‑ r 2 ), ¡ where ¡ r 2 ¡is ¡the ¡correlaGon ¡ coefficient, ¡is ¡used ¡as ¡ Example: ¡Single ¡Linkage ¡ similarity ¡measure ¡ between ¡samples ¡ (1,2) 3 4 5 (1,2) ⎡ ⎤ ⎢ ⎥ 3 ⎢ ⎥ 4 ⎢ ⎥ ⎢ ⎥ 5 ⎣ ⎦ 5 4 3 2 1

In ¡most ¡cases ¡(1-‑ r 2 ), ¡ where ¡ r 2 ¡is ¡the ¡correlaGon ¡ coefficient, ¡is ¡used ¡as ¡ Example: ¡Single ¡Linkage ¡ similarity ¡measure ¡ between ¡samples ¡ 5 4 3 2 1

Example: ¡Single ¡Linkage ¡ 5 4 3 2 1

Similarity ¡Criterion: ¡Complete ¡Linkage ¡ • cluster ¡similarity ¡= ¡similarity ¡of ¡two ¡least ¡similar ¡ members ¡ + tight clusters

Similarity ¡Criterion: ¡Average ¡Linkage ¡ • cluster ¡similarity ¡= ¡average ¡similarity ¡of ¡all ¡pairs ¡ the ¡most ¡widely ¡used ¡ similarity ¡measure ¡ Robust ¡against ¡noise ¡

But ¡What ¡Are ¡the ¡Clusters? ¡ In ¡some ¡cases ¡we ¡can ¡determine ¡the ¡“correct” ¡number ¡of ¡clusters. ¡However, ¡things ¡are ¡rarely ¡ this ¡clear ¡cut, ¡unfortunately. ¡

• Nonhierarchical, ¡each ¡object ¡is ¡placed ¡in ¡exactly ¡one ¡of ¡K ¡non-‑ overlapping ¡clusters. ¡ • the ¡user ¡has ¡to ¡specify ¡the ¡desired ¡number ¡of ¡clusters ¡K. ¡ • In ¡hierarchical ¡clustering, ¡we ¡use ¡similarity ¡measures ¡between ¡ two ¡observed ¡samples, ¡whereas ¡in ¡K-‑means ¡clustering, ¡we ¡use ¡ the ¡similarity ¡measures ¡between ¡an ¡observed ¡sample ¡and ¡the ¡ cluster ¡center ¡(mean). ¡ ¡

Example: ¡Clustering ¡Genes ¡ • Clustering ¡genes ¡helps ¡determine ¡ new ¡funcGons ¡for ¡unknown ¡genes ¡ • Applying ¡hierarchical ¡clustering ¡ algorithm ¡to ¡gene ¡expression ¡data ¡ was ¡an ¡early ¡“killer ¡applicaGon” ¡in ¡ this ¡area ¡

Gene ¡Expression ¡Data ¡and ¡Personalized ¡ Medicine ¡ (Golub ¡et ¡al., ¡Science, ¡1999) ¡ • One ¡of ¡the ¡earliest ¡work ¡that ¡demonstrated ¡the ¡ feasibility ¡of ¡using ¡only ¡microarray ¡gene ¡expression ¡data ¡ to ¡determine ¡cancer ¡subtypes ¡for ¡paGents ¡ • A ¡staGsGcal ¡model ¡was ¡learned ¡to ¡predict ¡the ¡labels ¡for ¡ acute ¡myeloid ¡leukemia ¡(ALL) ¡and ¡acute ¡lymphoblasGc ¡ leukemia ¡ ¡(AML) ¡for ¡each ¡paGent ¡given ¡gene ¡expression ¡ data ¡ – ¡Dataset ¡used ¡to ¡learn ¡the ¡model ¡consisted ¡of ¡27 ¡ALL ¡and ¡11 ¡ AML ¡paGents ¡ – Tested ¡the ¡learned ¡model ¡on ¡20 ¡ALL ¡and ¡14 ¡AML ¡paGents ¡and ¡ 29 ¡out ¡of ¡34 ¡paGents ¡were ¡predicted ¡to ¡have ¡correct ¡cancer ¡ subtypes ¡

Gene ¡Expression ¡Signature ¡Can ¡Dis=nguish ¡ Cancer ¡Types ¡ PaGents ¡ Genes ¡that ¡are ¡informaGve ¡for ¡predicGng ¡ cancer ¡types ¡

FDA ¡Approves ¡Gene-‑Based ¡Breast ¡Cancer ¡ Test* ¡ “MammaPrint ¡is ¡a ¡DNA ¡ microarray-‑based ¡test ¡that ¡ measures ¡the ¡acGvity ¡of ¡70 ¡ genes... ¡The ¡test ¡measures ¡each ¡ of ¡these ¡genes ¡in ¡a ¡sample ¡of ¡a ¡ woman's ¡breast-‑cancer ¡tumor ¡ and ¡then ¡uses ¡a ¡specific ¡formula ¡ to ¡determine ¡whether ¡the ¡ paGent ¡is ¡deemed ¡low ¡risk ¡or ¡ high ¡risk ¡for ¡the ¡spread ¡of ¡the ¡ cancer ¡to ¡another ¡site.” ¡

Learning ¡Bayesian ¡Networks ¡ • Probability ¡distribuGon ¡over ¡directed ¡graph ¡ ¡ – Model ¡data ¡distribuGon ¡in ¡populaGon ¡ Data – CondiGonal ¡probability ¡distribuGon ¡(CPD) ¡for ¡ each ¡variable/node ¡condiGonal ¡on ¡its ¡parent ¡ nodes ¡ – ProbabilisGc ¡inference: ¡ • PredicGon ¡ • ClassificaGon ¡ MSFT ¡ • Dependency ¡structure ¡ INTL ¡ NVLS ¡ – InteracGons ¡between ¡variables ¡ – Causality ¡ n ∏ P( x 1 ,..., x n ) = P( x i | x i + 1 ,..., x n ) MOT ¡ – ScienGfic ¡discovery ¡ i = 1 n ∏ = P( x i | Pa( x i )) i = 1 Slides ¡from ¡the ¡presentaGon ¡by ¡Segal ¡et ¡al. ¡UAI03 ¡

The ¡Module ¡Network ¡Idea ¡ Bayesian Network Module Network CPD 1 CPD 1 MSFT ¡ MSFT ¡ Module I CPD 2 CPD 2 CPD 3 MOT ¡ MOT ¡ CPD 4 DELL ¡ INTL ¡ DELL ¡ INTL ¡ Module II CPD 6 CPD 5 CPD 3 AMAT ¡ AMAT ¡ HPQ ¡ HPQ ¡ Module III Slides ¡from ¡the ¡presentaGon ¡by ¡Segal ¡et ¡al. ¡UAI03 ¡

• Applying ¡module ¡ network ¡to ¡2355 ¡genes ¡ in ¡the ¡173 ¡arrays ¡of ¡the ¡ yeast ¡stress ¡data ¡set ¡

Gene Expressions and Genomes 02-223 Personalized Medicine: - PowerPoint PPT Presentation

Gene Expressions and Genomes 02-223 Personalized Medicine: Understanding Your Own Genome Fall 2014 Why Gene Expression? Genome-wide associaGon mapping

Genomes for LIfe Cohort study of Genomes

Chapter 7 Expressions and Statements Expressions Arithmetic Expressions Conditional

Regular Expressions (REs) Regular Expressions (REs) p.1/37 Expressions In arithmetic:

The 1000 genomes project The 1000 genomes project Genetic variation > 1% 1000 2500

Eukaryotic Gene Eukaryotic Gene Prediction Prediction Eukaryotic gene structure Eukaryotic

Fem Poble(s): Expressions Meritxell (Txell) Martn Pardo, Ph.D Research associate Data

Gene Finding Strategies to find gene structures on the web Swiss Institute of Bioinformatics

Staphylococcus aureus Pathogenesis - Gene exchanges - Gene regulation - Gene products - Gene

Working with gene features and genomes Typical workflow when working with sequence data (e.g.,

Algorithms in Bioinformatics: A Practical Introduction Genome Alignment Complete genomes

Gene Expression Data Introduction to gene expression data Expression data storage concept An

genomes Ekaterina Shelest 09.03.2018 Gttingen Part 1. Gene clusters and their discovery From

Regexp Lecture 26: Regular Expressions Regular Expressions Regular expressions are a small

Mat 2170 Week 3 Chapter Three Java Expressions Variable Declarations Java Expressions

61A Lecture 6 Friday, September 7 Lambda Expressions 2 Lambda Expressions >>> ten =

Objectives You should be able to ... Regular Languages Use the syntax of regular expressions

Developing Students as Systems Thinkers K R I S T I A N G O M E S K A T R I N E R O S E D A M

Presenters: Dr. Andrea Frolic and Diana Tikasz Moderated by: Ash Couillard Presenter: Jane Hastie,

Hidden No More: Moving from Shame to Wholehearted Living Kate Thieda , MS, LPC, NCC

Schwartz Rounds: Origins, early development and implementation in the US and UK Schwartz

Introduction to Programming for BioInformatics P. Takis Metaxas Computer Science Department

He who asks is a fool for five CSE427 minutes, but he who does not Computational Biology ask

The Genetic Code, the Golden Section and Genetic Music

Disclosures Complementary and Integrative Therapies Consultant - Janssen Consultant -

Sambuz

Useful Links

Newsletter

Mail Us

Gene Expressions and Genomes 02-223 Personalized Medicine: - PowerPoint PPT Presentation

Gene Expressions and Genomes 02-223 Personalized Medicine: Understanding Your Own Genome Fall 2014 Why Gene Expression? Genome-wide associaGon mapping

Genomes for LIfe Cohort study of Genomes

Chapter 7 Expressions and Statements Expressions Arithmetic Expressions Conditional

Regular Expressions (REs) Regular Expressions (REs) p.1/37 Expressions In arithmetic:

The 1000 genomes project The 1000 genomes project Genetic variation &gt; 1% 1000 2500

Eukaryotic Gene Eukaryotic Gene Prediction Prediction Eukaryotic gene structure Eukaryotic

Fem Poble(s): Expressions Meritxell (Txell) Martn Pardo, Ph.D Research associate Data

Gene Finding Strategies to find gene structures on the web Swiss Institute of Bioinformatics

Staphylococcus aureus Pathogenesis - Gene exchanges - Gene regulation - Gene products - Gene

Working with gene features and genomes Typical workflow when working with sequence data (e.g.,

Algorithms in Bioinformatics: A Practical Introduction Genome Alignment Complete genomes

Gene Expression Data Introduction to gene expression data Expression data storage concept An

genomes Ekaterina Shelest 09.03.2018 Gttingen Part 1. Gene clusters and their discovery From

Regexp Lecture 26: Regular Expressions Regular Expressions Regular expressions are a small

Mat 2170 Week 3 Chapter Three Java Expressions Variable Declarations Java Expressions

61A Lecture 6 Friday, September 7 Lambda Expressions 2 Lambda Expressions &gt;&gt;&gt; ten =

Objectives You should be able to ... Regular Languages Use the syntax of regular expressions

Developing Students as Systems Thinkers K R I S T I A N G O M E S K A T R I N E R O S E D A M

Presenters: Dr. Andrea Frolic and Diana Tikasz Moderated by: Ash Couillard Presenter: Jane Hastie,

Hidden No More: Moving from Shame to Wholehearted Living Kate Thieda , MS, LPC, NCC

Schwartz Rounds: Origins, early development and implementation in the US and UK Schwartz

Introduction to Programming for BioInformatics P. Takis Metaxas Computer Science Department

He who asks is a fool for five CSE427 minutes, but he who does not Computational Biology ask

The Genetic Code, the Golden Section and Genetic Music

Disclosures Complementary and Integrative Therapies Consultant - Janssen Consultant -

Sambuz

Useful Links

Newsletter

Mail Us

The 1000 genomes project The 1000 genomes project Genetic variation > 1% 1000 2500

61A Lecture 6 Friday, September 7 Lambda Expressions 2 Lambda Expressions >>> ten =