Extracting a cellular hierarchy from high- dimensional single-cell - - PowerPoint PPT Presentation

extracting a cellular hierarchy from high dimensional
SMART_READER_LITE
LIVE PREVIEW

Extracting a cellular hierarchy from high- dimensional single-cell - - PowerPoint PPT Presentation

Extracting a cellular hierarchy from high- dimensional single-cell data Peng Qiu Department of Bioinformatics and Computational Biology University of Texas MD Anderson Cancer Center Flow / mass cytometry data Biology questions How many


slide-1
SLIDE 1

Extracting a cellular hierarchy from high- dimensional single-cell data

Peng Qiu

Department of Bioinformatics and Computational Biology University of Texas MD Anderson Cancer Center

slide-2
SLIDE 2

Flow / mass cytometry data

slide-3
SLIDE 3

Biology questions

  • How many cell types are there?
  • How are different cell types related to each other?
  • Does the cellular composition of a sample correlate with its overall phenotype?
slide-4
SLIDE 4

Introduction - gating

  • Example data

– 8-parameter flow cytometry – Mouse bone marrow – Parameters: c-kit, Sca-1, CD11b, B220, TCR-b, CD4, CD8

  • Traditional analysis: Gating
slide-5
SLIDE 5

Basic idea

Myeloids Myeloids B cells T cells Myeloids B cells CD4+ T cells CD8+ T cells

  • Method:

Spanning-tree Progression Analysis of Density-normalized Events (SPADE)

  • Consider the data as a point cloud
  • Extract the shape of the cloud
slide-6
SLIDE 6

SPADE

Qiu et al, Nature Biotechnology, in press

slide-7
SLIDE 7

SPADE applied to mouse bone marrow data

slide-8
SLIDE 8

SPADE vs. gating

slide-9
SLIDE 9

SPADE applied to human bone marrow data

Bendall et al, Science, 2011

slide-10
SLIDE 10

SPADE applied to CyTOF data of human BM

Qiu et al, Nature Biotechnology, in press

slide-11
SLIDE 11

Qiu et al, Nature Biotechnology, in press

slide-12
SLIDE 12

Challenge 2: Normal vs AML

  • 359 subjects

– 316 normal subjects – 43 AML samples

  • 8 Tubes per subject
  • Channels per tube: FSC+SSC+5 colors
slide-13
SLIDE 13

Challenge 2: Normal vs AML

Since the overlap among the 8 different staining panels/tubes is minimal, we consider them separately. Therefore, we have 359 fcs files to compare.

slide-14
SLIDE 14

Tube2 Sample1 Tube2 Sample2 …

Apply SPADE to the union

  • f the two clouds

Challenge 2: Normal vs AML

Since the overlap among the 8 different staining panels/tubes is minimal, we consider them separately. Therefore, we have 359 fcs files to compare.

slide-15
SLIDE 15

SPADE tree for Tube 2

slide-16
SLIDE 16

SPADE tree for Tube 2

slide-17
SLIDE 17

SPADE tree for Tube 2

slide-18
SLIDE 18

SPADE tree for Tube 2

slide-19
SLIDE 19

RELIEF classifier & Earth Mover’s Distance

Earth Mover’s Distance: a metric to compare two probability distributions

  • ver a structured domain.

RELIEF classifier for each testing sample, find its nears normal (N_N) and its nearest AML(N_AML) compute the following score: dist-to-N_N – dist-to-N_AML

slide-20
SLIDE 20

RELIEF classifier & Earth Mover’s Distance

Training samples Testing samples

slide-21
SLIDE 21

Challenge 3A

Use 48*2 samples to derive a SPADE tree Compute cell freq distribution for each sample For each sample, compute its distribution – the distribution

  • f its paired sample.

PCA

slide-22
SLIDE 22

Summary

  • Using SPADE, we can:

– Identify cell types – Compare multiple samples

slide-23
SLIDE 23

Acknowledgement

  • Sylvia Plevritis
  • Garry Nolan

– Erin Simonds, Sean Bendall, – Kenny Gibbs – Karen Sachs, Michael Linderman, Rob Bruggner – Matt Clutter, Tiffany Chen