European Networking Summer School (ENSS) Plant Genomics & - - PowerPoint PPT Presentation

european networking summer school enss plant genomics
SMART_READER_LITE
LIVE PREVIEW

European Networking Summer School (ENSS) Plant Genomics & - - PowerPoint PPT Presentation

Standing Committee for Life, Earth and Environmental Sciences (LESC) European Networking Summer School (ENSS) Plant Genomics & Bioinformatics 28.7.2009 Supported by: Austria Fonds zur Frderung der wissenschaftlichen Forschung (FWF)


slide-1
SLIDE 1

28.7.2009

European Networking Summer School (ENSS) Plant Genomics & Bioinformatics

Standing Committee for Life, Earth and Environmental Sciences (LESC)

slide-2
SLIDE 2

28.7.2009

Supported by:

Austria Fonds zur Förderung der wissenschaftlichen Forschung (FWF) Belgium: Fonds voor Wetenschappelijk Onderzoek (FWO) Finland: Academy of Finland - Research Council fo Biosciences and Environment Ireland: Irish Research Council for Science Engineering and Technology (IRCSET) Italy: Consiglio Nazionale delle Ricerche (CNR) - Dipartimento Agroalimentare Netherlands: Nederlandse Wetenschappelijk voon Onderzoek (NWO) Norway: The Research Council of Norway Poland: The Polish Academy of Science Romania: Ministry of Education and Research United Kingdom: Biotechnology and Biological Sciences Research Council (BBSRC)

slide-3
SLIDE 3

28.7.2009

AIMS

  • Support plant genome research networks based by

training of young investigators

  • Summer courses with theoretic and practical training
  • Access to technologies, resources, skills and know-how
slide-4
SLIDE 4

28.7.2009

ENSS 2009

Plant Bioinformatics, Systems and Synthetic Biology

27-31 July 2009 University of Nottingham, UK Natalio Krasnogor, Jaume Bacardit, Malcolm Bennett

ENSS 2010

Plant Epigenetics

September 2010 Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) in Gatersleben, Germany Michael Florian Mette

slide-5
SLIDE 5

28.7.2009

Comparative and Functional Genomics

Functional genomics is the understanding of the function of genes and other parts of the genome Comparative genomics involves the use of computer programs to line up multiple genomes/genes for the identification of similarities

slide-6
SLIDE 6

28.7.2009

What is needed to do comparative and functional genomics?

model organism

Why are model organisms important? Criteria for a good model organism? Relationship of the model to important crop plants? How many genes are the same? Why using knock out/down mutants? How will they help us determine gene function?

slide-7
SLIDE 7

28.7.2009

What will you hear?

Background on annotating gene function using comparative genomic tools Example for the use comparing genomes/genes from individuals between populations to determine their function Example to show how these tools can be employed to get a glimps on the function of a yet unknown gene

slide-8
SLIDE 8

28.7.2009

All Starts With Genome Sequencing Projects

http://www.ncbi.nlm.nih.gov/genomes/leuks.cgi http://www.ensembl.org/info/about/species.html How many plant genomes have been sequenced?

slide-9
SLIDE 9

28.7.2009

Plant Genome Sequencing Projects

slide-10
SLIDE 10

28.7.2009

Improvements in the rate of DNA sequencing over the past 30 years and into the future

Stratton et al., 2009

slide-11
SLIDE 11

28.7.2009

Quote from Joe Ecker IARC2009

Capillary sequencing – 500 people – 7 years – 70.000.000 $ Perlegen sequencing – 50 people – 1 year – 70.000 $ Next generation sequencing – 2 people – 7 days – 7.000 $ - 50 x coverage

slide-12
SLIDE 12

28.7.2009

Paradigm Change

slide-13
SLIDE 13

28.7.2009

After genome sequencing still many questions remain – example Arabidopsis

MASC 2007 MASC 2009

slide-14
SLIDE 14

28.7.2009

Sequenced genomes – the basis to address questions on

Function of all genes Role of single nucleotide polymorphisms (SNPs, natural variation) Where? - Localization (organs, tissues, cellular, sub-cellular) Functional redundancies/diversification of gene families Role of noncoding regions and repeats in the genome Biological role(s) When? – Regulation (transcriptional, post-transcriptional, post-translational,..) Interacting partners - Networks Role of alternative splicing variants

slide-15
SLIDE 15

28.7.2009

Functional Genomic Tools

Sequences genome, full-length cDNA clones Gene knock-outs, knock-downs (T-DNA, transposon, amiRNA, tilling, gene targeting, collection of natural variants, ....) Methods for studying functions of nonprotein-coding sequences Comprehensive analysis of gene expression (microarray, deep sequencing, cell sorting, laser dissection, reporter constructs, … ) Large-scale protein analyses (proteomics, protein arrays, large scale Y2H, interactomes-networks, 3D structures) Metabolomics

  • omics
slide-16
SLIDE 16

28.7.2009

Comparative genomics between species

comparison of genomes from different taxa

slide-17
SLIDE 17

28.7.2009

Comparative genomics within a species

comparison of genomes from different individuals between populations that might be differentially adapted to particular environments ….

Weigel Lab

slide-18
SLIDE 18

28.7.2009

What to compare?

  • n the structural level:

sequence similarity (nucleic acid, protein, domains) gene location (synteny) gene structure (length, number of exons) amount of noncoding DNA highly conserved regions (fundamental/essential genes) highly/less polymorphic regions (indication of adaptation, selection)

  • n the functional level:

expression pattern epigenetic regulation post-transcriptional translational regulation subcellular localization interactions post-translational regulation/modification

slide-19
SLIDE 19

28.7.2009

How to compare?

Search tools for homologies: BLAST FASTA

slide-20
SLIDE 20

28.7.2009

How to compare?

http://www.expasy.org/

slide-21
SLIDE 21

28.7.2009

slide-22
SLIDE 22

28.7.2009

slide-23
SLIDE 23

28.7.2009

slide-24
SLIDE 24

28.7.2009

Example

  • 1. BLAST:

Proteome analysis in Poplar result a peptide of MILSALLTSVGINLGLC UniGene

  • 2. UniGene
slide-25
SLIDE 25

28.7.2009

G – Entrez Gene

slide-26
SLIDE 26

28.7.2009

G – Entrez Gene

slide-27
SLIDE 27

28.7.2009

Domains – Function - Localization?

slide-28
SLIDE 28

28.7.2009

Example

  • 1. BLAST:

Proteome analysis in Poplar result a peptide of MILSALLTSVGINLGLC

slide-29
SLIDE 29

28.7.2009

G – Entrez Gene

slide-30
SLIDE 30

28.7.2009

G – Entrez Gene

slide-31
SLIDE 31

28.7.2009

alternatively spliced

slide-32
SLIDE 32

28.7.2009

GBrowser

gene family 13 members

slide-33
SLIDE 33

28.7.2009

Synteny Search

Can the annotation of one member of the gene family in any plant species guide to the function?

slide-34
SLIDE 34

28.7.2009

Search for HYP1

slide-35
SLIDE 35

28.7.2009

Common Function?

slide-36
SLIDE 36

28.7.2009

Other Synteny Tools

http://chibba.agtec.uga.edu/duplication/

slide-37
SLIDE 37

28.7.2009

Plant Genome Duplication Database (PGDD)

http://chibba.agtec.uga.edu/duplication/

slide-38
SLIDE 38

28.7.2009

Intra-genome Dotblot analysis at PGDD

duplication of chr 3 and chr 2

non-synonymous substitution (Ka) synonyoumous substitution (Ks)

slide-39
SLIDE 39

28.7.2009

Cross-genome Dotblot analysis at PGDD

slide-40
SLIDE 40

28.7.2009

PGDD

microsynteny

Vitis vinifera Medicago trunculata

slide-41
SLIDE 41

28.7.2009

Back to TAIR

slide-42
SLIDE 42

28.7.2009

SUBA-Database

slide-43
SLIDE 43

28.7.2009

novel putative function HYP1 http://aramemnon.botanik.uni-koeln.de/ 15 related proteins

slide-44
SLIDE 44

28.7.2009

What is AtGFS10?

http://www.ncbi.nlm.nih.gov/sites/gquery

slide-45
SLIDE 45

28.7.2009

Topology Prediction

slide-46
SLIDE 46

28.7.2009

Strongly predicted to be in the secretroy pathway

slide-47
SLIDE 47

28.7.2009

Expression Analyses

Poplar homologs

http://bbc.botany.utoronto.ca/efp/ cgi-bin/efpWeb.cgi

slide-48
SLIDE 48

28.7.2009

highest in roots and young leaves

Expression Analyses

highest in male catkins highest in leaves

slide-49
SLIDE 49

28.7.2009

Upon water stress and day/night cycles

slide-50
SLIDE 50

28.7.2009

co-expressed gene in Arabidopsis

slide-51
SLIDE 51

28.7.2009

Expression of the Arabidopsis homolog upon

  • smotic stress
slide-52
SLIDE 52

28.7.2009

Still no entry in Proteins Wiki

http://proteins.wikia.com/wiki/

slide-53
SLIDE 53

28.7.2009

Summary Peptide/Protein Annotation MILSALLTSVGINLGLC

  • belongs to gene family –
  • members HYP1 (hypothetical protein 1)
  • RXW8 (name of cDNA)
  • ERD4 (early responsive to dehydration)
  • AtGFS10 (protein involved in vacuolar sorting fo storage proteins,

green fluorescent seed, gfs mutant)

Quote: „no closely related homologs of GFS10 in the Arabidopsis genome“ but topology similar – Aramemnon and 39% amino acid homology

  • integral membrane protein
  • plasma membrane, secretory pathway
  • induced during water stress

wild-type

secrete vacuole-targeted GFP out of the seed cells

slide-54
SLIDE 54

28.7.2009

What next?

Poplar gene with homolog in Arabidopsis (81%) First functional tests in Arabidopsis: knock outs

  • ver-/ectopic-/ inducible expression

in vivo localization – XFP, immuno interaction with other proteins Design experiments for functional characterization

slide-55
SLIDE 55

28.7.2009

QTL-Analysis or Association Mapping

slide-56
SLIDE 56

28.7.2009

Understanding functional consequences of natural variation: trichome patterning in Arabidopsis

Example for the comparison of genomes/gene and their function from individuals between populations

Julia Hilscher, Christian Schlötterer, Marie-Theres Hauser

slide-57
SLIDE 57

28.7.2009

Trichomes in Arabidopsis

  • Single cell structure
  • Present on leaves, stem, petioles, sepals
  • 32C->polyploid
  • Model for cell fate specification

A B D C

slide-58
SLIDE 58

28.7.2009

Trichome function

  • Arabidopsis

– Protection against herbivory Mauricio & Rausher (1997), Handley, Ekbom & Ågren (2005)

  • A. lyrata

– Protection against herbivory Kivimäki, Kärkkäinen, Gaudeul, Løe & Ågren (2007)

  • Other plants

– Decrease of water loss – Increased light reflection – Freezing tolerance – Ca++ homeostasis – Heavy metal storage – Metabolite production and storage – Cotton fiber development

slide-59
SLIDE 59

28.7.2009

Cross-species function of Trichome regulators

Cotton fiber development Trichome development of Arabidopsis

35S::GaMYB2 in Arabidopsis

„hairs on seeds“

slide-60
SLIDE 60

28.7.2009

Trichome density differs in natural populations

0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 Gr-1 Blh-1 Bur-0/1 Ba-1 Buckhorn Pass Ler Nok-0 Ga-0 Lz-0 Wt-5 Oy-0 Di-0 Van-0 Brest-1 Ct-1/4 No-0 An-1 Wa-1/3 83-3 Tsu-0 Kin-0 Pi-0 Pa-1/1 T-20-I Ag-0/4 T-12-I I-33 N7 Lip-0/1 RLD-1 Stoc 5 T-29-I Mh-0 Mt-0 Ge-0 Tscha-1 Hi-0 Kara7-2 Sf-2 Pa-3 Dül/4 Nd-0 Est-1 Yo-0 St-0 Col Aa-0 Sha Grivo-1 Bay Uk-3 Es-0 9481 Te-0 Ws-0 Tein-1 Bla-1 Kas-1 Ms-0 Ryb 2 Mas/2 Ita-0 Cond Per -1/5 Lim 3 84-1 Alc-0/1 Tsar/1 Edi-0/2 Gy-0 Pog-0/2 Bas-2/3 20-13 Rub-1/2 Can-0

Can-0 Gr-1

1.5 mm

slide-61
SLIDE 61

28.7.2009

QTL mapping

x F1x F1 F2

slide-62
SLIDE 62

28.7.2009

Composite Interval Mapping

  • 266 F2 individuals
  • 24 microsatellites
  • A single QTL on

chromosome 2 explains 33% of the variation in trichome number

20 40 60 80 100 120 10 20 30 40 50 60

LR0 nga63 AthZFPG T27K12-Sp6 T2K10

20 40 60 80 100 120 10 20 30 40 50

LR0 [cM] CIW2 F19G14A nga2235 ngaT3B23 nga361 nga3470 nga3692 ngaT2P4

Chr I Chr II

20 40 60 80 100 120 10 20 30 40 50 60

LR0 nga162 3g283 7 F1P2TGF nga6

20 40 60 80 100 120 10 20 30 40

LR0 [cM] JV30/31 nga8 CIW7 nga1139

Chr III Chr IV

20 40 60 80 100 120 10 20 30 40

LR0 [cM] nga15 1 nga13 9 AtS0191 nga12 9

Chr V

Trichome number Trichome density

slide-63
SLIDE 63

28.7.2009

Components of trichome development

Patterning involves positive and negative regulators with redundant functions

WT try cpc

slide-64
SLIDE 64

28.7.2009

Model of epidermal patterning

Phenotypic class of mutants Gene Action no trichomes “glabrous” GL1 Positive regulators TTG1 decreased trichome # GL3/ EGL3 GL2 nests of trichomes TRY Negative regulators single-repeat R3 MYB genes, act non-cell autonomous increased trichome # CPC

GL3 GL3 TTG1 GL1 GL2, … CPC, TRY, ETC1, ETC2, TCL1, TCL2,ETC3

modified from Larkin et al., 2003

All of the genes or their paralogs are also involved in root hair patterning

slide-65
SLIDE 65

28.7.2009

Composite Interval Mapping

TCL2

slide-66
SLIDE 66

28.7.2009

Fine mapping on chromosome II

  • Selective genotyping

– 465 F2 with extreme phenotype – 10 additional markers

  • 66 recombinants in the initial interval
  • > expected mapping resolution of 40kb
  • 288 kb mapping interval with remainig 87 genes

ngaT3B23 nga3000 nga3023 HDUP3038 Small Mybs nga3073 nga3098 nga361 nga31082 nga3119 nga3168 nga3341

% recombinant chromosomes

1 2 3 4 5 6

slide-67
SLIDE 67

28.7.2009

3 candidate genes

Three single-repeat R3 MYB genes are located in the fine- mapping interval

At2g30440 TCL1 ETC2 KIS TCL2 At2g30430

1kb

slide-68
SLIDE 68

28.7.2009

Association with trichome number

CPC 84-I Gr Can Oy Ler Col 84-I CPC Gr Can Oy Ler Col Ws 20-13 84-I CPC Gr Can Oy Ler Col Ws 20-13

low trichome number accessions high trichome number accessions

Small MYB_A Small MYB_B Small MYB_C

slide-69
SLIDE 69

28.7.2009

Complementation test

mutants are complemented with the wildtype alleles from high and low trichome number accessions

smallmybCol SmallMybGr-1 smallMYBLer smallMYBCan-0 SmallMYB20-13 # of trichomes/leaf 100 200 300 400 F1 cross to low trichome number accession high trichome number accession

slide-70
SLIDE 70

28.7.2009

Identification of QTN

Many SNPs show association with trichome phenotype Screen many high/low trichome accessions for recombinants

slide-71
SLIDE 71

28.7.2009

Identification of QTN: 2 candidates

  • 53

+55

slide-72
SLIDE 72

28.7.2009

Identification of QTN

Gr-1 Can-0

A C A G

  • 53 bp

+55 bp

position Gr-1 Can-0

C C G G

Gr-1 Can-0

A A A A

allele background SNP status wt +55 Gr-1

  • 53 Can-0

+55 Can-0

  • 53 Gr-1
slide-73
SLIDE 73

28.7.2009

Identification of QTN

Transgenic complementation

50 100 150 200 250 300 # of trichomes/leaf +55Gr-1 +55Can-0 Gr-1 Can-0

  • 53Can-0
  • 53Gr-1

allele background

ANOVA SNP state: p=0.001 allele background: p=0.35

Mean ± s.e.m. of n>21 T1 lines each, 3 leaf positions

slide-74
SLIDE 74

28.7.2009

QTN is highly conserved among family members

+55 mutation leads to an amino acid replacement: Lysine (K) to Glutamate (E) K: ancestral, yet unknown importance

Alpha Helices 1-3 constituting R3 MYB domain with conserved W residues forming cluster Predicted bHLH interaction motif: [DE]Lx2[RK]x3Lx6Lx3R (Zimmermann et al., Plant J 2004) Required for CPC movement (Kurata et al., Development 2005)

H1 H2 H3 CPL3 TCL1 CPL3 TCL1

K19E

TCL2 TCL2 TCL1

slide-75
SLIDE 75

28.7.2009

Competition between activators and repressors

TTG1 GL3/EGL3 TTG1 GL3/EGL3 small myb target genes GL1 target genes small Myb GL1 TTG1 GL3/EGL3

competition

slide-76
SLIDE 76

28.7.2009

3 possible factors for trichome patterning

Binding strength to GL3 or the regulatory region Movement rate to neighboring cells Stability

Lysine modification by: methylation N-glycosylation ubiquitylation sumoylation acetylation Glutamate modification by: methylation

slide-77
SLIDE 77

28.7.2009

Next steps

Biochemistry Cell Biology

slide-78
SLIDE 78

28.7.2009

Sumoylation? Use ExPASy

slide-79
SLIDE 79

28.7.2009

Take Home Message

Functional genetics & natural variation – powerful tool Strong QTL or Association 33% of natural variation in trichome number was explainable by a single aa replacement Classical genetic studies failed to identify the major modifier of trichome number Typical accession used for functional tests (Ws, Col) have intermediate-high trichome number and the weak suppressor allele Requirements for success

slide-80
SLIDE 80

28.7.2009

Association Mapping

trichome density

In future

– phenotyping ecotypes/accession

  • look up into the association map
  • identify the gene with the variable SNPs
  • functional confirmation
slide-81
SLIDE 81

28.7.2009

More sequences * More functions * More to compare

slide-82
SLIDE 82

28.7.2009

THANKS

QUESTIONS?