Proteomics Informatics Analysis of mass spectra: signal processing, - - PowerPoint PPT Presentation

proteomics informatics analysis of mass spectra signal
SMART_READER_LITE
LIVE PREVIEW

Proteomics Informatics Analysis of mass spectra: signal processing, - - PowerPoint PPT Presentation

Proteomics Informatics Analysis of mass spectra: signal processing, peak finding, and isotope clusters (Week 3) Charge-State Distributions MALDI ESI 1+ 2+ intensity intensity 3+ Peptide 4+ 1+ 2+ mass/charge mass/charge m M nH


slide-1
SLIDE 1

Proteomics Informatics – Analysis of mass spectra: signal processing, peak finding, and isotope clusters (Week 3)

slide-2
SLIDE 2

Charge-State Distributions

mass/charge intensity MALDI ESI mass/charge intensity 1+ 1+ 2+ 3+ 4+ Peptide Protein 2+

n nH M z m + =

M - molecular mass n - number of charges H – mass of a proton mass/charge intensity mass/charge intensity 1+ 27+ 2+ 3+ 4+ MALDI ESI 5+ 31+

slide-3
SLIDE 3

Charge-State

Example: peptide of mass 898 carrying 1 H+ = (898 + 1) / 1 = 899 m/z carrying 2 H+ = (898 + 2) / 2 = 450 m/z carrying 3 H+ = (898 + 3) / 3 = 300.3 m/z

n nH M z m + =

M - molecular mass n - number of charges H – mass of a proton

slide-4
SLIDE 4

m = 1035 Da m= 1878 Da m = 2234 Da

Isotope Distributions

m/z m/z m/z Intensity

0.015% 2H 1.11% 13C 0.366% 15N 0.038% 17O, 0.200% 18O, 0.75% 33S, 4.21% 34S, 0.02% 36S Only 12C and 13C: p=0.0111 n is the number of C in the peptide m is the number of 13C in the peptide Tm is the relative intensity of the peptide m 13C

12C 14N 16O 1H 32S

+1Da +2Da +3Da

slide-5
SLIDE 5

Isotope distributions

Peptide mass Intensity ratio Peptide mass Intensity ratio

m/z monoisotopic mass GFP 29kDa

slide-6
SLIDE 6

Resolution

Resolution = minimum peak separation, ∆M, which allows to distinguish two ion species

Relative Intensity m/z

I I I I I

501.5 502.0 500.5 500.0 499.5

500

50 %

Resolution = M/∆M = 500/0.5 = 1000 ∆M = full width at half maximum (FWHM)

R = M ∆M = resolving power

slide-7
SLIDE 7

Resolution

slide-8
SLIDE 8
  • What resolution do we need to differentiate

a 1600 Da peptide that carries either an acetylation (+ 42.0100) or trimethylation (42.0464 )?

  • R = 1600/0.0364 = 43,956

R = M ∆M = resolving power

Resolution

slide-9
SLIDE 9

Isotope Clusters and Charge State

m/z Intensity 1+ 1 1 1 m/z Intensity 2+ 0.5 0.5 0.5 m/z Intensity 3+ 0.33 0.33 0.33

slide-10
SLIDE 10

Isotope Clusters and Charge State

m/z Intensity Possible to Determine Charge? Yes Yes Maybe No

slide-11
SLIDE 11

432.8990 433.2330 433.5671 433.9014 713.3225 713.8239 714.3251 714.8263

What is the Charge State?

∆ between the isotopes is 0.5 Da ∆ between the isotopes is 0.33 Da

slide-12
SLIDE 12

Noise

slide-13
SLIDE 13

Smoothing

slide-14
SLIDE 14

Smoothing

slide-15
SLIDE 15

Adaptive Background Correction (Unsharp masking)

+ = − =

+ =

w l k w l k

k I w d w d l I ) ( 1 2 ) , , ( '

Unsharp masking Original

slide-16
SLIDE 16

Adaptive Background Correction

slide-17
SLIDE 17

Smoothing and Adaptive Background Correction

slide-18
SLIDE 18

Peak Finding

m/z Intensity

+ = − =

=

w l k w l k

k I l S ) ( ) ( Find maxima of The centroid m/z of a peak

∑ ∑

+ = − = + = − =

w l k w l k w l k w l k

k I k z m k I ) ( ) ( ) (

slide-19
SLIDE 19

Peak Finding

m/z Intensity

The signal in a peak can be estimated with the RMSD 2

2 2

/

/ | |

) ) ( (

w

w l k

I k I ∑ > < −

< −

and the signal-to-noise ratio of a peak can be estimated by dividing the signal with the RMSD of the background

slide-20
SLIDE 20

Estimating peptide quantity

Peak height Curve fitting Peak area Peak height Curve fitting

m/z Intensity

slide-21
SLIDE 21

Time dimension

m/z Intensity

Time

m/z Time

slide-22
SLIDE 22

Sampling

Retention Time Intensity

slide-23
SLIDE 23

5 10 15 20 25 30 0.8 0.85 0.9 0.95 1

3 points

20 40 60 80 100 120 140 0.8 0.85 0.9 0.95 1

3 points

5%

Acquisition time = 0.05σ

5%

Sampling

slide-24
SLIDE 24

0.5 0.6 0.7 0.8 0.9 1 1.1 1 2 3 4 5 6 7 8 9 10

Thresholds (90%) # of points

Sampling

slide-25
SLIDE 25

What is the best way to estimate quantity?

Peak height

  • resistant to interference
  • poor statistics

Peak area

  • better statistics
  • more sensitive to interference

Curve fitting

  • better statistics
  • needs to know the peak shape
  • slow
slide-26
SLIDE 26

Web Tool

http://10.193.36.101/plot-filter-cgi/plot_filter.pl or http://10.193.36.219/plot-filter-cgi/plot_filter.pl

slide-27
SLIDE 27

Web Tool

http://10.193.36.101/plot-filter-cgi/plot_filter.pl or http://10.193.36.219/plot-filter-cgi/plot_filter.pl

slide-28
SLIDE 28

Proteomics Informatics – Analysis of mass spectra: signal processing, peak finding, and isotope clusters (Week 3)