Proteomics Informatics (BMSC-GA 4437) Instructor David Feny - - PowerPoint PPT Presentation

proteomics informatics bmsc ga 4437
SMART_READER_LITE
LIVE PREVIEW

Proteomics Informatics (BMSC-GA 4437) Instructor David Feny - - PowerPoint PPT Presentation

Proteomics Informatics (BMSC-GA 4437) Instructor David Feny Contact information David@FenyoLab.org http://fenyolab.org/presentations/Proteomics_Informatics_2013/ Proteomics Informatics Learning Objectives Be able analyze a proteomics


slide-1
SLIDE 1

Proteomics Informatics (BMSC-GA 4437)

Instructor David Fenyö Contact information David@FenyoLab.org http://fenyolab.org/presentations/Proteomics_Informatics_2013/

slide-2
SLIDE 2
slide-3
SLIDE 3

Proteomics Informatics – Learning Objectives

Be able analyze a proteomics data set and understand the limitations of the results.

slide-4
SLIDE 4

Proteomics Informatics – Overview of Proteomics (Week 1)

  • Why proteomics?
  • Bioinformatics
  • Overview of the course
slide-5
SLIDE 5

Motivating Example: Protein Regulation

Geiger et al., “Proteomic changes resulting from gene copy number variations in cancer cells”, PLoS Genet. 2010 Sep 2;6(9). pii: e1001090.

slide-6
SLIDE 6

Motivating Example: Protein Complexes

Alber et al., Nature 2007

slide-7
SLIDE 7

Motivating Example: Signaling

Choudhary & Mann, Nature Reviews Molecular Cell Biology 2010

slide-8
SLIDE 8

Bioinformatics

Biological System Samples Measurements Experimental Design Raw Data Information Data Analysis

slide-9
SLIDE 9

Mass Spectrometry Based Proteomics Mass spectrometry Lysis Fractionation

MS

Digestion Identified and Quantified Proteins

Peak Finding Charge determination De-isotoping Integrating Peaks Searching

slide-10
SLIDE 10

Proteomics Informatics – Overview of Mass spectrometry (Week 2) Ion Source Mass Analyzer Detector

mass/charge intensity

slide-11
SLIDE 11

Mass Analyzer 1 Frag- mentation Detector Ion Source Mass Analyzer 2

b y

Proteomics Informatics – Overview of Mass spectrometry (Week 2)

slide-12
SLIDE 12

Proteomics Informatics – Overview of Mass spectrometry (Week 2)

Mass Analyzer 1 Frag- mentation Detector

intensity mass/charge

Ion Source Mass Analyzer 2 LC

intensity mass/charge intensity mass/charge intensity mass/charge intensity mass/charge intensity mass/charge

Time

intensity mass/charge intensity mass/charge intensity mass/charge intensity mass/charge intensity mass/charge intensity mass/charge intensity mass/charge intensity mass/charge intensity mass/charge

slide-13
SLIDE 13

Proteomics Informatics – Analysis of mass spectra: signal processing, peak finding, and isotope clusters (Week 3)

m/z Intensity

slide-14
SLIDE 14

Proteomics Informatics – Protein identification I: searching protein sequence collections and significance testing (Week 4)

MS/MS Lysis Fractionation

MS/MS

Digestion Sequence DB All Fragment Masses Pick Protein Compare, Score, Test Significance Repeat for all proteins Pick Peptide LC-MS Repeat for all peptides

slide-15
SLIDE 15

Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)

slide-16
SLIDE 16

Proteomics Informatics – Protein identification III: de novo sequencing (Week 6)

m/z % Relative Abundance 100 250 500 750 1000 [M+2H]2+ 762 260 389 504 633 875 292 405 534 907 1020 663 778 1080 1022

Mass Differences

1-letter code 3-letter code Chemical formula Monois

  • topic

Average A Ala C3H5ON 71.0371 71.0788 R Arg C6H12ON4 156.101 156.188 N Asn C4H6O2N2 114.043 114.104 D Asp C4H5O3N 115.027 115.089 C Cys C3H5ONS 103.009 103.139 E Glu C5H7O3N 129.043 129.116 Q Gln C5H8O2N2 128.059 128.131 G Gly C2H3ON 57.0215 57.0519 H His C6H7ON3 137.059 137.141 I Ile C6H11ON 113.084 113.159 L Leu C6H11ON 113.084 113.159 K Lys C6H12ON2 128.095 128.174 M Met C5H9ONS 131.04 131.193 F Phe C9H9ON 147.068 147.177 P Pro C5H7ON 97.0528 97.1167 S Ser C3H5O2N 87.032 87.0782 T Thr C4H7O2N 101.048 101.105 W Trp C11H10ON2 186.079 186.213 Y Tyr C9H9O2N 163.063 163.176 V Val C5H9ON 99.0684 99.1326

Amino acid masses

Sequences consistent with spectrum

slide-17
SLIDE 17

Proteomics Informatics – Protein identification IV: spectrum library searching (Week 7) Lysis Fractionation Digestion LC-MS/MS

MS/MS

Spectrum Library Pick Spectrum Compare, Score, Test Significance Repeat for all spectra Identified Proteins

slide-18
SLIDE 18

Proteomics Informatics – Protein quantitation I: metabolic labeling (SILAC), chemical labeling, label-free quantitation, spectrum counting (Week 8)

Fractionation Digestion LC-MS Lysis

MS

C ij

I ik

p

ij Pr

p

D ijk p Pep ik

p

LC ik

p

MS ik

p

L ij

p p p p p p C I

MS ik LC ik Pep ik j D ijk ij L ij ij k ik

      

Pr

Sample i Protein j Peptide k

p p p p p p I C

MS ik LC ik Pep ik D ijk ij L ij k ik k ij Pr

 k

slide-19
SLIDE 19

Proteomics Informatics – Protein quantitation I: metabolic labeling (SILAC), chemical labeling, label-free quantitation, spectrum counting (Week 8)

Fractionation Digestion LC-MS Lysis

MS MS

p p p p p p

MS ik LC ik Pep ik D ijk ij L ij k Pr

Assumption: constant for all samples

I I C C

j j j j

i i i i

m n m n

/ /

Sample i Protein j Peptide k

slide-20
SLIDE 20

Proteomics Informatics – Protein quantitation II: software (Week 9) Skyline MaxQuant

slide-21
SLIDE 21

Proteomics Informatics – Protein characterization I: post-translational modifications (Week 10)

Peptide with two possible modification sites MS/MS spectrum

m/z Intensity

Matching Which assignment does the data support? 1, 1 or 2, or 1 and 2?

slide-22
SLIDE 22

Proteomics Informatics – Protein Characterization II: protein-protein interactions, cross-linking, top-down, non-covalent complexes (Week 11)

A B A C D

Protein identification

slide-23
SLIDE 23

Proteomics Informatics – Molecular Signatures (Week 12)

slide-24
SLIDE 24

Proteomics Informatics – Molecular Signatures (Week 12)

slide-25
SLIDE 25

Proteomics Informatics – Presentations of projects (Week 13)

Select a published data set that has been made public and reanalyze it. Highlighted data sets: http://www.thegpm.org/ 10 min presentations

slide-26
SLIDE 26
slide-27
SLIDE 27

Proteomics Informatics (BMSC-GA 4437)

Instructor David Fenyö Contact information David@FenyoLab.org http://fenyolab.org/presentations/Proteomics_Informatics_2013/