Smart Templates for Peak Pattern Matching with Comprehensive - - PowerPoint PPT Presentation

smart templates for peak pattern matching with
SMART_READER_LITE
LIVE PREVIEW

Smart Templates for Peak Pattern Matching with Comprehensive - - PowerPoint PPT Presentation

Outline Introduction Preprocessing Smart Templates Concluding Remarks Smart Templates for Peak Pattern Matching with Comprehensive Two-Dimensional Liquid Chromatography (LCxLC) Stephen E. Reichenbach a , Peter W. Carr b , Dwight R. Stoll b ,


slide-1
SLIDE 1

Outline Introduction Preprocessing Smart Templates Concluding Remarks

Smart Templates for Peak Pattern Matching with Comprehensive Two-Dimensional Liquid Chromatography (LCxLC)

Stephen E. Reichenbacha, Peter W. Carrb, Dwight R. Stollb, and Qingping Taoc

aComputer Science & Engineering Department

University of Nebraska-Lincoln

bDepartment of Chemistry

University of Minnesota

cGC Image, LLC

Lincoln NE

GC Image

Informatics for Comprehensive Two-Dimensional Chromatography

HPLC 2008, Baltimore MD, 14 May 2008

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-2
SLIDE 2

Outline Introduction Preprocessing Smart Templates Concluding Remarks

Introduction Preprocessing Smart Templates Concluding Remarks

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-3
SLIDE 3

Outline Introduction Preprocessing Smart Templates Concluding Remarks Comprehensive Two-Dimensional Liquid Chromatography Peak Identification and Classification Smart Templates for Peak Pattern Matching

Comprehensive Two-Dimensional Liquid Chromatography

Comprehensive Two-Dimensional Liquid Chromatography (LCxLC) is ever faster and more powerful. The greater peak separation capacity of LCxLC is especially critical for important, but complex biochemical applications, including proteomics and metabolomics. The paucity of efficient, convenient and sufficiently powerful data analysis tools is the greatest impediment to its wide application.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-4
SLIDE 4

Outline Introduction Preprocessing Smart Templates Concluding Remarks Comprehensive Two-Dimensional Liquid Chromatography Peak Identification and Classification Smart Templates for Peak Pattern Matching

Peak Identification and Classification

Fundamental goal: identify, classify, and quantify constituent compounds from chromatographic peaks. Traditional approaches for peak identification include: Retention-time windows Multispectral matching (e.g., mass spectra library search) Retention-time windows must be small for “crowded” separations. Chromatographic variations may cause peaks to “drift” outside of the windows. Multispectral matching may be uncertain for large chemical domains with chemically similar compounds.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-5
SLIDE 5

Outline Introduction Preprocessing Smart Templates Concluding Remarks Comprehensive Two-Dimensional Liquid Chromatography Peak Identification and Classification Smart Templates for Peak Pattern Matching

Smart Templates for Peak Pattern Matching

New approach for peak identification and classification. Smart TemplatesTM record: Multidimensional retention-time pattern of peaks. Analytical metadata, including peak identities, groupings, labels, etc. Rules for recognizing peaks (e.g., based on multispectral characteristics). The Smart Template pattern is recognized in subsequent data and the analytical metadata are used to identify and classify peaks.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-6
SLIDE 6

Outline Introduction Preprocessing Smart Templates Concluding Remarks Background Correction Peak Detection Spectral Identification

Background Correction

LCxLC data contains significant variations in the background. Background must be corrected for accurate peak detection and quantitation. New method builds statistical models of the slowly varying background in each of the two dimensions of separation and then subtracts the background model value from the data. Background correction in each “channel” of multispectral data.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-7
SLIDE 7

Outline Introduction Preprocessing Smart Templates Concluding Remarks Background Correction Peak Detection Spectral Identification

Background Correction Example

Before After

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-8
SLIDE 8

Outline Introduction Preprocessing Smart Templates Concluding Remarks Background Correction Peak Detection Spectral Identification

Peak Detection

Simple two-dimensional approach detects each peak from its apex to surrounding minima. (Drain algorithm developed for GCxGC.) Example uses only the total intensity count (TIC) of the UV data. (Detection threshold for apex magnitude and footprint area.) Multivariate chemometric methods may be able to unmix (e.g., deconvolve) coeluted peaks based on multispectral signatures.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-9
SLIDE 9

Outline Introduction Preprocessing Smart Templates Concluding Remarks Background Correction Peak Detection Spectral Identification

Peak Detection Example

Region with indoles shown in 3D perspective (below). Detected target peaks out- lined in black (right).

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-10
SLIDE 10

Outline Introduction Preprocessing Smart Templates Concluding Remarks Background Correction Peak Detection Spectral Identification

Spectral Identification

Spectral matching based on similarities or differences between a spectrum and reference/library spectra. Spectral matching may be uncertain. For example, spectra of 5 indole standards in detected peaks matched with database of UV absorbance spectra of 26 indoles. Correct spectral match from 33% (indole-3-acetic acid) to 100% (indole-3-acetonitrile). Spectral matching is insufficient for complex mixtures.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-11
SLIDE 11

Outline Introduction Preprocessing Smart Templates Concluding Remarks Templates and Matching Retention-Time Variability Template Matching Errors Smart Templates Automated Rules for Smart Templates

Templates and Matching

Templates record the retention-time pattern of peaks along with analytical metadata (peak identifications, groupings, etc.). Goal of matching is to transform the template pattern in the retention-time plane (e.g., shifting and scaling) to match the detected peaks in another chromatogram. Matching criteria is the number of peak correspondences between the template and the target. Matching is subject to geometric transformation parameters and correspondences are subject to retention-time window.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-12
SLIDE 12

Outline Introduction Preprocessing Smart Templates Concluding Remarks Templates and Matching Retention-Time Variability Template Matching Errors Smart Templates Automated Rules for Smart Templates

Template Matching Example

Standards: Template (#1/64) & Target (#20/64) Overlay Matching

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-13
SLIDE 13

Outline Introduction Preprocessing Smart Templates Concluding Remarks Templates and Matching Retention-Time Variability Template Matching Errors Smart Templates Automated Rules for Smart Templates

Retention-Time Variability

Chromatographic variability changes peak patterns. Template Sequence # Target Sequence # Translation(1) Translation(2) Scaling(1) Scaling(2) 1 2 0.0000 −0.0711 1.0000 1.0119 2 20 −0.2493 −0.1014 0.9924 0.9788 20 38 −0.1069 0.0278 0.9990 1.0032 38 63 −0.2007 0.1883 0.9851 1.0286 63 64 0.0000 −0.0458 1.0000 1.0042 1 64 −0.5480 −0.0036 0.9771 1.0273

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-14
SLIDE 14

Outline Introduction Preprocessing Smart Templates Concluding Remarks Templates and Matching Retention-Time Variability Template Matching Errors Smart Templates Automated Rules for Smart Templates

Retention-Time Variability

Observations: Adjacent runs have small variability. First column translation and scaling are monotonically non-increasing, with larger cumulative effect. With this simple example, matching parameters can be increased to find the correct correspondences. More complex data presents more difficult pattern matching, requiring smarter matching.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-15
SLIDE 15

Outline Introduction Preprocessing Smart Templates Concluding Remarks Templates and Matching Retention-Time Variability Template Matching Errors Smart Templates Automated Rules for Smart Templates

Template Matching Errors

Urine sample (control): Template (#11/64) & Target (#15/64) Arrow 1: Peak error, peak not detected cannot be matched. Arrows 2–4: Peak error, merged peak not detected cannot be matched. Arrow 5: Match error, peak too distant not matched. Arrows 6–7: Match error, merged peak causes incorrect peak match.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-16
SLIDE 16

Outline Introduction Preprocessing Smart Templates Concluding Remarks Templates and Matching Retention-Time Variability Template Matching Errors Smart Templates Automated Rules for Smart Templates

Smart Templates

Smart Templates use chemical rules to constrain peak correspondences during template matching. Rules in Computer Language for Identifying Chemicals (CLIC). CLIC has functions for multispectral comparison; multispectral library search; multispectral characteristics (e.g., spectral peak

  • rdering, ratios), chromatographic peak statistics (e.g., TIC,

symmetry, retention time); and arithmetic, relational, and logical

  • perators.

Rules reduce matching errors with incorrect peaks. Rules allow larger retention-time windows to reduce matching errors from peak “drift”.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-17
SLIDE 17

Outline Introduction Preprocessing Smart Templates Concluding Remarks Templates and Matching Retention-Time Variability Template Matching Errors Smart Templates Automated Rules for Smart Templates

Automated Rules for Smart Templates

Evaluate multispectral comparisons of template spectrum with: (1) correct spectra and (2) incorrect spectra. Construct a rule with a threshold that allows correct matches and disallows incorrect matches. EuclideanDistance(“<ms>”) < 0.2 Different rule threshold for each template peak.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-18
SLIDE 18

Outline Introduction Preprocessing Smart Templates Concluding Remarks Templates and Matching Retention-Time Variability Template Matching Errors Smart Templates Automated Rules for Smart Templates

Smart Templates Examples

Template (#11/64) Template (#3/64) & Target (#15/64) & Target (#63/64)

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-19
SLIDE 19

Outline Introduction Preprocessing Smart Templates Concluding Remarks Conclusions Questions?

Conclusions

Smart Templates can automatically identify and classify peaks even with drift from variable chromatography. Smart Templates record peak pattern, metadata, and chemical

  • rules. Peak matching recognizes the template pattern in new

chromatograms. Applicable to LCxLC and other multidimensional separations. Applicable to UV, mass spectrometric, and other detectors.

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC

slide-20
SLIDE 20

Outline Introduction Preprocessing Smart Templates Concluding Remarks Conclusions Questions?

Questions ?

Steve Reichenbach: reich@unl.edu LCxLC Software: www.LCxLC.com

S.E. Reichenbach, P.W. Carr, D.R. Stoll, Q. Tao Smart Templates for Peak Pattern Matching with LCxLC