Multivariate models of inter-subject anatomical variability John - - PowerPoint PPT Presentation

multivariate models of inter subject anatomical
SMART_READER_LITE
LIVE PREVIEW

Multivariate models of inter-subject anatomical variability John - - PowerPoint PPT Presentation

Introduction Geometric Variability Similarity Measures Real data Multivariate models of inter-subject anatomical variability John Ashburner Wellcome Trust Centre for Neuroimaging, UCL Institute of Neurology, 12 Queen Square, London WC1N


slide-1
SLIDE 1

Introduction Geometric Variability Similarity Measures Real data

Multivariate models of inter-subject anatomical variability

John Ashburner

Wellcome Trust Centre for Neuroimaging, UCL Institute of Neurology, 12 Queen Square, London WC1N 3BG, UK.

John Ashburner Anatomical Features

slide-2
SLIDE 2

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

“The only relevant test of the validity

  • f a hypothesis is comparison of

prediction with experience.”

Milton Friedman

John Ashburner Anatomical Features

slide-3
SLIDE 3

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Choosing models/hypotheses/theories

MacKay, DJC. “Bayesian interpolation.” Neural computation 4, no. 3 (1992): 415-447.

John Ashburner Anatomical Features

slide-4
SLIDE 4

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Evidence-based Science

...also just known as “science”. Researchers claim to find differences between groups. Do those findings actually discriminate? How can we most accurately diagnose a disorder from image data? Pharma wants biomarkers. How do we most effectively identify them? There are lots of potential imaging biomarkers. Which are most (cost) effective? Pattern recognition provides a framework to compare data (or preprocessing strategy) to determine the most accurate approach.

John Ashburner Anatomical Features

slide-5
SLIDE 5

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Biological variability is multivariate

John Ashburner Anatomical Features

slide-6
SLIDE 6

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

A generative classification approach

p(x,y=0) = p(x|y=0) p(y=0) Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1 p(x,y=1) = p(x|y=1) p(y=1) Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1 p(x) = p(x,y=0) + p(x,y=1) Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1 p(y=0|x) = p(x,y=0)/p(x) Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1

John Ashburner Anatomical Features

slide-7
SLIDE 7

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Discriminative classification approaches

Ground truth Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1 FLDA Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1 SVC Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1 Simple Logistic Regression Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1

John Ashburner Anatomical Features

slide-8
SLIDE 8

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Bayesian classification

Simple Logistic Regression Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1 2 4 −7 −6 −5 −4 −3 −2 −1 Hyperplane Uncertainty Feature 1 Feature 2 Bayesian Logistic Regression Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1 Bayesian Logistic Regression Feature 1 Feature 2 2 4 −7 −6 −5 −4 −3 −2 −1

John Ashburner Anatomical Features

slide-9
SLIDE 9

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Why Bayesian?

To deal with different priors.

Consider a method with 90% sensitivity and specificity. Consider using this to screen for a disease afflicting 1% of the population. On average, out of 100 people there would be 10 wrongly assigned to the disease group. A positive diagnosis suggests only about a 10% chance of having the disease. P(Disease|Pred+) =

P(Pred+|Disease)P(Disease) P(Pred+|Disease)P(Disease)+P(Pred+|Healthy)P(Healthy)

=

Sensitivity×P(Disease) Sensitivity×P(Disease)+(1−Specificity)×P(Healthy)

Better decision-making by accounting for utility functions.

John Ashburner Anatomical Features

slide-10
SLIDE 10

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Curse of dimensionality

Large p, small n.

John Ashburner Anatomical Features

slide-11
SLIDE 11

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Nearest-neighbour classification

−3 −2 −1 1 2 −2 −1 1 2 Feature 1 Feature 2

Not nice smooth separations. Lots of sharp corners. May be improved with K-nearest neighbours.

John Ashburner Anatomical Features

slide-12
SLIDE 12

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Rule-based approaches

1 2 3 4 1 2 3 4 x y ((x<0.3) & (y<2)) | ((x<0.75) & (y<1.25)) | (y<0.4)

Not nice smooth separations. Lots of sharp corners.

John Ashburner Anatomical Features

slide-13
SLIDE 13

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Corners matter in high-dimensions

John Ashburner Anatomical Features

slide-14
SLIDE 14

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Corners matter in high-dimensions

2 4 6 8 10 12 14 16 18 20 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Circle area = π r2 Sphere volume = 4/3 π r3 Number of dimensions Volume of hyper−sphere (r=1/2)

John Ashburner Anatomical Features

slide-15
SLIDE 15

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Dimensionality = number of voxels

Little evidence to suggest that most voxel-based feature selection methods help.

Little or no increase in predictive accuracy. Commonly perceived as being more “interpretable”.

Prior knowledge derived from independent data is the most reliable way to improve accuracy.

e.g. search the literature for clues about which regions to weight more heavily.

Cuingnet, R´ emi, Emilie Gerardin, J´ erˆ

  • me Tessieras, Guillaume Auzias, St´

ephane Leh´ ericy, Marie-Odile Habert, Marie Chupin, Habib Benali, and Olivier Colliot. “Automatic classification of patients with Alzheimer’s disease from structural MRI: a comparison of ten methods using the ADNI database.” Neuroimage 56, no. 2 (2011): 766-781. Chu, Carlton, Ai-Ling Hsu, Kun-Hsien Chou, Peter Bandettini, and ChingPo Lin. “Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images.” Neuroimage 60, no. 1 (2012): 59-70. See winning strategies in http://www.ebc.pitt.edu/PBAIC.html John Ashburner Anatomical Features

slide-16
SLIDE 16

Introduction Geometric Variability Similarity Measures Real data Prediction Binary Classification Curse of dimensionality

Linear versus Nonlinear methods

Linear methods are more interpretable. Nonlinear methods usually increase dimensionality. Better to preprocess to obtain features that behave more linearly.

40 50 60 70 80 90 100 110 120 130 140 150 160 1.4 1.5 1.6 1.7 1.8 1.9 2 Weight (kg) Height (m) Body Mass Index (linear plot) BMI=18.5 BMI=25 BMI=30 40 50 60 70 80 90 100 120 140 160 1.4 1.5 1.6 1.7 1.8 1.9 2 Weight (kg) Height (m) Body Mass Index (log−log plot) BMI=18.5 BMI=25 BMI=30

2 4 6 8 10 2 4 6 8 10 Feature 1 Feature 2 Raw Data 10 10

−1

10 10

1

Feature 1 Feature 2 Log−Transformed

John Ashburner Anatomical Features

slide-17
SLIDE 17

Introduction Geometric Variability Similarity Measures Real data Manifolds Principal Components

Transformed images fall on manifolds

Rotating an image leads to points on a 1D manifold.

0.2 0.4 0.6 0.1 0.2 0.3 0.4 0.5 0.6 0.7

Rigid-body motion leads to a 6-dimensional manifold (not shown).

John Ashburner Anatomical Features

slide-18
SLIDE 18

Introduction Geometric Variability Similarity Measures Real data Manifolds Principal Components

Local linearisation through smoothing

200 400 600 800 100 200 300 400 500 600 700 800 100 200 300 400 50 100 150 200 250 300 350 400

Spatial smoothing can make the manifolds more linear with respect to small misregistrations. Some information is inevitably lost.

John Ashburner Anatomical Features

slide-19
SLIDE 19

Introduction Geometric Variability Similarity Measures Real data Manifolds Principal Components

One mode of geometric variability

Simulated images Principal components A suitable model would reduce these data to a single dimension.

John Ashburner Anatomical Features

slide-20
SLIDE 20

Introduction Geometric Variability Similarity Measures Real data Manifolds Principal Components

Two modes of geometric variability

Simulated images Principal components A suitable model would reduce these data to two dimensions.

John Ashburner Anatomical Features

slide-21
SLIDE 21

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Similarity Measures

Many methods are based on similarity measures. A common similarity measure is the dot product. Similarity: k(x, y) =

  • k

xkyk Nonlinear methods are often based on distances. Distance: d(x, y) =

  • k

(xk − yk)2 Similarity: k(x, y) = exp(−λd(x, y)2) How do we best measure distances between brain images?

John Ashburner Anatomical Features

slide-22
SLIDE 22

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Image Registration

Image registration measures distances between images. Often involves minimising the sum of two terms:

Distance between the image intensities. Distance of the deformation from zero.

The sum of these terms gives the distance.

µ ° θ f ° φ φ |Jφ| θ |Jθ|

John Ashburner Anatomical Features

slide-23
SLIDE 23

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Different ways of measuring distances

John Ashburner Anatomical Features

slide-24
SLIDE 24

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Different ways of measuring distances

Two simulated images

John Ashburner Anatomical Features

slide-25
SLIDE 25

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Metrics

Distances need to satisfy the properties of a metric:

1 d(x, y) ≥ 0 (non-negativity) 2 d(x, y) = 0 if and only if x = y (identity of indiscernibles) 3 d(x, y) = d(y, x) (symmetry) 4 d(x, z) ≤ d(x, y) + d(y, z) (triangle inequality).

Satisfying (3) requires inverse-consistent image registration. Satisfying (4) requires a specific family of image registration algorithm.

John Ashburner Anatomical Features

slide-26
SLIDE 26

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Non-Euclidean geometry

Distances are not always measured along a straight line. “Shapes are the ultimate non-linear sort of thing”

John Ashburner Anatomical Features

slide-27
SLIDE 27

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Linear approximations to nonlinear problems

John Ashburner Anatomical Features

slide-28
SLIDE 28

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Example Images

Some example (non-brain) images.

John Ashburner Anatomical Features

slide-29
SLIDE 29

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Registered Images

We could register the images to their average shape...

John Ashburner Anatomical Features

slide-30
SLIDE 30

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Deformations

...and study the deformations...

John Ashburner Anatomical Features

slide-31
SLIDE 31

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Jacobian Determinants

...or the relative volumes...

John Ashburner Anatomical Features

slide-32
SLIDE 32

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Scalar Momentum

... or “scalar momentum”

John Ashburner Anatomical Features

slide-33
SLIDE 33

Introduction Geometric Variability Similarity Measures Real data Distances Examples of features

Reconstructed Images

Reconstructions from template and scalar momenta.

John Ashburner Anatomical Features

slide-34
SLIDE 34

Introduction Geometric Variability Similarity Measures Real data Data Features Results

Real data

Used 550 T1w brain MRI from IXI (Information eXtraction from Images) dataset. http://www. brain-development.org/ Data from three different hospitals in London: Hammersmith Hospital using a Philips 3T system Guy’s Hospital using a Philips 1.5T system Institute of Psychiatry using a GE 1.5T system

John Ashburner Anatomical Features

slide-35
SLIDE 35

Introduction Geometric Variability Similarity Measures Real data Data Features Results

Grey and White Matter

Segmented into GM and WM. Approximately aligned via rigid-body.

Ashburner, J & Friston, KJ. Unified segmentation. NeuroImage 26(3):839–851 (2005). John Ashburner Anatomical Features

slide-36
SLIDE 36

Introduction Geometric Variability Similarity Measures Real data Data Features Results

Diffeomorphic Alignment

All GM and WM were diffeomorphically aligned to their common average-shaped template.

Ashburner, J & Friston, KJ. Diffeomorphic registration using geodesic shooting and Gauss-Newton optimisation. NeuroImage 55(3):954–967 (2011). Ashburner, J & Friston, KJ. Computing average shaped tissue probability templates. NeuroImage 45(2):333–341 (2009). John Ashburner Anatomical Features

slide-37
SLIDE 37

Introduction Geometric Variability Similarity Measures Real data Data Features Results

Volumetric Features

A number of features were used for pattern recognition. Firstly, two features relating to relative volumes. Initial velocity divergence is similar to logarithms of Jacobian determinants. Jacobian Determinants Initial Velocity Divergence

John Ashburner Anatomical Features

slide-38
SLIDE 38

Introduction Geometric Variability Similarity Measures Real data Data Features Results

Grey Matter Features

Rigidly Registered GM Nonlinearly Registered GM Registered and Jacobian Scaled GM

John Ashburner Anatomical Features

slide-39
SLIDE 39

Introduction Geometric Variability Similarity Measures Real data Data Features Results

“Scalar Momentum” Features

“Scalar momentum” actually has two components because GM was matched with GM and WM was matched with WM. First Momentum Component Second Momentum Component

John Ashburner Anatomical Features

slide-40
SLIDE 40

Introduction Geometric Variability Similarity Measures Real data Data Features Results

Age Regression

Linear Gaussian Process Regression to predict subject ages.

2 4 6 8 10 12 14 16 18 20 −2020 −2000 −1980 −1960 −1940 −1920 −1900 −1880 −1860 −1840 −1820

Bayesian Model Evidence Smoothing (mm) Log Likelihood

Jacobians Divergences Rigid GM Unmodulated GM Modulated GM Scalar Momentum 2 4 6 8 10 12 14 16 18 20 6 6.5 7 7.5 8 8.5

8−Fold Cross−Validation Smoothing (mm) RMS error (years)

Jacobians Divergences Rigid GM Unmodulated GM Modulated GM Scalar Momentum

Rasmussen, CE & Williams, CKI. Gaussian processes for machine learning. Springer (2006). John Ashburner Anatomical Features

slide-41
SLIDE 41

Introduction Geometric Variability Similarity Measures Real data Data Features Results

Sex Classification

Linear Gaussian Process Classification (EP) to predict sexes.

2 4 6 8 10 12 14 16 18 20 −260 −240 −220 −200 −180 −160 −140 −120

Bayesian Model Evidence Smoothing (mm) Log Likelihood

Jacobians Divergences Rigid GM Unmodulated GM Modulated GM Scalar Momentum 2 4 6 8 10 12 14 16 18 20 0.91 0.92 0.93 0.94 0.95 0.96 0.97 0.98

Gaussian Process (EP) Smoothing (mm) AUC

Jacobians Divergences Rigid GM Unmodulated GM Modulated GM Scalar Momentum

Rasmussen, CE & Williams, CKI. Gaussian processes for machine learning. Springer (2006). John Ashburner Anatomical Features

slide-42
SLIDE 42

Introduction Geometric Variability Similarity Measures Real data Data Features Results

Predictive Accuracies

Age

10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60 70 80 90 100

Scalar Momentum (10mm FWHM) Actual Age (years) Predicted Age (years)

Sex

0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1

ROC Curve (AUC=0.9769) Sensitivity Specificity John Ashburner Anatomical Features

slide-43
SLIDE 43

Introduction Geometric Variability Similarity Measures Real data Data Features Results

Conclusions

Scalar momentum (with about 10mm smoothing) appears to be a useful feature set. Jacobian-scaled warped GM is surprisingly poor. Amount of spatial smoothing makes a big difference. Further dependencies on the details of the registration still need exploring.

John Ashburner Anatomical Features

slide-44
SLIDE 44

Introduction Geometric Variability Similarity Measures Real data Data Features Results John Ashburner Anatomical Features

slide-45
SLIDE 45

Introduction Geometric Variability Similarity Measures Real data Data Features Results John Ashburner Anatomical Features

slide-46
SLIDE 46

Introduction Geometric Variability Similarity Measures Real data Data Features Results John Ashburner Anatomical Features