Use of FLOCK + Friedman-Rafsky (F- R) in Challenge 1 and 4 Mengya - PowerPoint PPT Presentation

Use of FLOCK + Friedman-Rafsky (F- R) in Challenge 1 and 4 Mengya Liu, Southern Methodist University Rick Stanton, JCVI Richard Scheuermann, JCVI N01-AI40076 (BISC) U01-AI089859 (HIPC) R01-EB008400 (Gottardo R, PI)

General cross sample comparison challenge • Algorithms like FLOCK identify data clusters in multidimensional FCM data one file at a time • Would like to compare equivalent populations across multiple samples • Previous approach • Either select a "representative" sample as a template or concatenate data from multiple files • Generate centroid list using FLOCK • Cluster each sample file separately using centroid list • Problems associated with representative or concatenated

Friedman-Rafsky (F-R) algorithm concept • Multivariate generalization of Wald Wolfowitz (WW) run test • WW is a non-parametric statistical test to determine if two populations have the same distributions • Null hypothesis = both populations have same distributions • Label N total cells • m cells from populations A and • n cells from population B and combine • Sort • Test statistic is function of total runs R • Where R = N sequences of identical labels • Examples: • R = 2 for A A A A B B B B • R = 7 for A B A A B A B A • Null hypothesis rejected for small values of R

Friedman-Rafsky (F-R) algorithm concept – Minimal Spanning Tree Minimal Spanning Tree allows multivariate generalization (c) Remove edges linking (a) Pool samples of (b) Calculate Minimal Spanning Tree different samples two sets

F-R Advantages and Drawbacks Advantages: • Non-parametric method – no need for knowledge of distribution parameters • Ability to discriminate population characteristics that are tough to describe parametrically (skew, odd shapes) • Can provide feedback to automated gating algorithms when the number of populations is unknown. • Example, if two subpopulations in sample 1 are matched to one same subpopulation in sample 2, it indicates that either sample 1 is over-partitioned sample 1 or we didn't partitioned sample 2 enough. Drawbacks: • Computationally expensive, need to downsample

Implementation of the F-R algorithm For two samples: • Get the auto-gating results from FLOCK or any other auto-gating software • For every pair of populations, one from sample A and the other from sample B, • If either populations has more than 100 events (predetermined, changeable) • Take a random sample of 100 • Apply the F-R test to the sampled population(s) to obtain the p-value • Repeat 20 times (predetermined, changeable) • Calculate the averaged p-value • Repeat the procedure for all pairs and obtain the p-value matrix • Set up a predetermined cutoff to identify the matched pair (may need to adjust cutoff for different shifts)

Simulation of data to characterize performance Experimental data Simulated data

Movements of simulation of data to characterize performance

Movements of Simulation of data to characterize performance

Use of FLOCK + Friedman-Rafsky (F-R) in Challenge 1 and 4 FLOCK can be accessed via Immport website

Use of FLOCK + Friedman-Rafsky (F-R) in Challenge 1 and 4 Processing Steps • Identify populations with FLOCK • Map FLOCK populations to T Cell target populations for a representative T Cell sample (target = Stanford 1)

Use of FLOCK + Friedman-Rafsky (F-R) in Challenge 1 and 4 Processing Steps • Apply F-R algorithm to perform cross sample associations across the other samples.

Cross sample comparisons – challenge 4 T Cell data Target data set (Stanford 1) compared with other datasets using P Values from the F-R test

Future Directions Better accommodate differences in gains across instruments (shifts, dialations) Evaluate and incorporate lessons learned here at Flowcap III

Use of FLOCK + Friedman-Rafsky (F- R) in Challenge 1 and 4 Mengya - PowerPoint PPT Presentation

Use of FLOCK + Friedman-Rafsky (F- R) in Challenge 1 and 4 Mengya Liu, Southern Methodist University Rick Stanton, JCVI Richard Scheuermann, JCVI N01-AI40076 (BISC) U01-AI089859 (HIPC) R01-EB008400 (Gottardo R, PI) General cross sample

Friedman on Interpretations The Friedman Characterization Friedman on Faithful

FLOCK: A Density Based Clustering Method for FLOCK: A Density Based Clustering Method for

Fly with Me: Algorithms and Methods for Influencing a Flock Katie Genter The University of Texas

Adding Influencing Agents to a Flock Katie Genter and Peter Stone The University of Texas at

VAST CHALLENGE 2017 Bianca Barnucz & Stephanie Wegscheidl OVERVIEW VAST Challenge

Steven Friedman, CohnReznick LLP Steve.friedman@cohnreznick.com 2019 NMHC Annual Meeting January

Welcome to BLOOMERANG ACADEMY THANK YOU for joining us! YOUR PRESENTER Max Friedman Max Friedman

Variations on a Theme by Friedman Ali Enayat, G oteborgs Universitet September 5, 2013

ReSAKSS DATA CHALLENGE Annual Newsletter www.resakss.org/challenge ReSAKSS DATA CHALLENGE ANNUAL

Boosting Simulation Performance with Python Eran Friedman How to use Discrete-Event Simulation

Edward Friedman Philip Papaelias Aaron Gononsky Lois Burns Kenneth Aberbach Sharon McKenzie

STEP CHALLENGE February 7 th March 8 th CHALLENGE OVERVIEW This Step Challenge is a fun

Michelin Challenge Bibendum 2014 CONTENT CHALLENGE BIBENDUM THINK & ACTION TANK TO

Ultimately our vision is about GRAND CHALLENGE using science to make a difference in the world.

New Challenge 10 New Challenge 10 June 1, 2007 Business environment Direction Challenge

Friedman and the Phillips Curve Philosophy of Economics University of Virginia Matthias

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

A Frequentist Semantics for a Generalized Jeffrey Conditionalization Dirk Draheim Tallinn

Welcome ! BAYE SIAN R E G R E SSION MOD E L IN G W ITH R STAN AR M Jake Thompson Ps y

Frequentist and Bayesian statistics Claus Ekstrm E-mail: ekstrom@life.ku.dk Outline 1

Tuning numerical parameters of algorithms: sampling and stochasticity handling Z. Yuan, T. St

Behavioral Programming: A Broader and More Detailed Take on Semantic GP Krzysztof Krawiec 1

Update on TopHat & measurement system interconnection Jordan Aug e , Timur Friedman, Thomas

CMPE 646: VLSI Design Verification and Test Course: CMPE 646: VLSI Design Verification and Test,

Sambuz

Useful Links

Newsletter

Mail Us

Use of FLOCK + Friedman-Rafsky (F- R) in Challenge 1 and 4 Mengya - PowerPoint PPT Presentation

Use of FLOCK + Friedman-Rafsky (F- R) in Challenge 1 and 4 Mengya Liu, Southern Methodist University Rick Stanton, JCVI Richard Scheuermann, JCVI N01-AI40076 (BISC) U01-AI089859 (HIPC) R01-EB008400 (Gottardo R, PI) General cross sample

Friedman on Interpretations The Friedman Characterization Friedman on Faithful

FLOCK: A Density Based Clustering Method for FLOCK: A Density Based Clustering Method for

Fly with Me: Algorithms and Methods for Influencing a Flock Katie Genter The University of Texas

Adding Influencing Agents to a Flock Katie Genter and Peter Stone The University of Texas at

VAST CHALLENGE 2017 Bianca Barnucz &amp; Stephanie Wegscheidl OVERVIEW VAST Challenge

Steven Friedman, CohnReznick LLP Steve.friedman@cohnreznick.com 2019 NMHC Annual Meeting January

Welcome to BLOOMERANG ACADEMY THANK YOU for joining us! YOUR PRESENTER Max Friedman Max Friedman

Variations on a Theme by Friedman Ali Enayat, G oteborgs Universitet September 5, 2013

ReSAKSS DATA CHALLENGE Annual Newsletter www.resakss.org/challenge ReSAKSS DATA CHALLENGE ANNUAL

Boosting Simulation Performance with Python Eran Friedman How to use Discrete-Event Simulation

Edward Friedman Philip Papaelias Aaron Gononsky Lois Burns Kenneth Aberbach Sharon McKenzie

STEP CHALLENGE February 7 th March 8 th CHALLENGE OVERVIEW This Step Challenge is a fun

Michelin Challenge Bibendum 2014 CONTENT CHALLENGE BIBENDUM THINK &amp; ACTION TANK TO

Ultimately our vision is about GRAND CHALLENGE using science to make a difference in the world.

New Challenge 10 New Challenge 10 June 1, 2007 Business environment Direction Challenge

Friedman and the Phillips Curve Philosophy of Economics University of Virginia Matthias

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

A Frequentist Semantics for a Generalized Jeffrey Conditionalization Dirk Draheim Tallinn

Welcome ! BAYE SIAN R E G R E SSION MOD E L IN G W ITH R STAN AR M Jake Thompson Ps y

Frequentist and Bayesian statistics Claus Ekstrm E-mail: ekstrom@life.ku.dk Outline 1

Tuning numerical parameters of algorithms: sampling and stochasticity handling Z. Yuan, T. St

Behavioral Programming: A Broader and More Detailed Take on Semantic GP Krzysztof Krawiec 1

Update on TopHat &amp; measurement system interconnection Jordan Aug e , Timur Friedman, Thomas

CMPE 646: VLSI Design Verification and Test Course: CMPE 646: VLSI Design Verification and Test,

Sambuz

Useful Links

Newsletter

Mail Us

VAST CHALLENGE 2017 Bianca Barnucz & Stephanie Wegscheidl OVERVIEW VAST Challenge

Michelin Challenge Bibendum 2014 CONTENT CHALLENGE BIBENDUM THINK & ACTION TANK TO

Update on TopHat & measurement system interconnection Jordan Aug e , Timur Friedman, Thomas