Automatic Classifiers as Scientific Instruments: One Step Further - - PowerPoint PPT Presentation

automatic classifiers as scientific instruments one step
SMART_READER_LITE
LIVE PREVIEW

Automatic Classifiers as Scientific Instruments: One Step Further - - PowerPoint PPT Presentation

Automatic Classifiers as Scientific Instruments: One Step Further Away from Ground-Truth Jacob Whitehill and Anand Ramakrishnan Worcester Polytechnic Institute (WPI), Massachusetts, USA ICML2019 Machine learning to advance basic


slide-1
SLIDE 1

ICML’2019

Automatic Classifiers as Scientific Instruments:
 One Step Further Away from Ground-Truth

Jacob Whitehill and Anand Ramakrishnan
 Worcester Polytechnic Institute (WPI), Massachusetts, USA

slide-2
SLIDE 2

ICML’2019

Machine learning to advance basic science

  • Machine perception can advance basic science in:
  • Psychology
  • Education
  • Medicine
  • …by providing automatic classifiers as new scientific

instruments, e.g.:

  • Automatic stress detectors from wrist monitors


instead of questionnaires.

  • Facial action unit detectors from video


instead of electromyography.

  • Student engagement detectors from video


instead of observational protocols.

Empatica E4 EDA Emotient/ iMotions Kaur et al. 2018

slide-3
SLIDE 3

ICML’2019

Correlation study

  • Suppose a researcher wishes to measure the relationship

between two constructs U and V, e.g.:

  • U = stress
  • V = academic performance.
  • Standard methodology:
  • Use a standard measurement tool (e.g., survey,
  • bservational protocol) to estimate the values of U and V

from a sample of n participants.

  • This produces two vectors , which we can

assume w.l.o.g. have 0-mean and 1-length.

  • Estimate the correlation between U and V as:

u, v ∈ Rn

Only the angle between the two vectors determines their correlation.

r = ρ(u, v) = u>v = cos ∠(u, v)

slide-4
SLIDE 4

ICML’2019

  • But what if the researcher instead uses an automatic

stress detector d whose correlation with ground-truth measurements is q (known from prior validation)?

  • Instead of , the researcher obtains a vector .
  • What kind of spurious deductions about the correlation

between U and V could result?

Correlation study

u ˆ u

slide-5
SLIDE 5

ICML’2019

Trivariate correlation

  • Suppose and are ground-truth values of U and V.
  • The correlation between and is r = cos(105°) = -.259.

−1.0 −0.5 0.0 0.5 1.0 −1.00 −0.75 −0.50 −0.25 0.00 0.25 0.50 0.75 1.00

u v u v u v

r

slide-6
SLIDE 6

ICML’2019

  • Using a detector d, the researcher might obtain , whose

correlation with is q.

  • The correlation between and is cos(135°)= -.707 —

much larger than, but same sign as, the ground-truth correlation.

Trivariate correlation

−1.0 −0.5 0.0 0.5 1.0 −1.00 −0.75 −0.50 −0.25 0.00 0.25 0.50 0.75 1.00

ˆ u v ˆ u u v ˆ u u

q

slide-7
SLIDE 7

ICML’2019

Trivariate correlation

−1.0 −0.5 0.0 0.5 1.0 −1.00 −0.75 −0.50 −0.25 0.00 0.25 0.50 0.75 1.00

  • But they might also obtain vector , whose correlation


with is also q.

  • The correlation between and is cos(75°)= +.259 —

this is the opposite sign as the ground-truth correlation.

ˆ u0 u v ˆ u0 ˆ u u

q

ˆ u0 v

We call this a false correlation.

slide-8
SLIDE 8

ICML’2019

Main results

1.The set of all vectors whose correlation with is q, is an (n-3)-sphere . 2.If the correlation between and is r, then the expected sample correlation between and , where is drawn uniformly at random from , is qr. 3.We derive a formula h(n,q,r) for the probability of a false correlation. 4.We show that h is monotonically decreasing in q and n.

u T n ∈ Rn

slide-9
SLIDE 9

ICML’2019

Main results

1.The set of all vectors whose correlation with is q, is an (n-3)-sphere . 2.If the correlation between and is r, then the expected sample correlation between and , where is drawn uniformly at random from , is qr. 3.We derive a formula h(n,q,r) for the probability of a false correlation. 4.We show that h is monotonically decreasing in q and n.

u u v v ˆ u ˆ u T n T n ∈ Rn

slide-10
SLIDE 10

ICML’2019

Main results

1.The set of all vectors whose correlation with is q, is an (n-3)-sphere . 2.If the correlation between and is r, then the expected sample correlation between and , where is drawn uniformly at random from , is qr. 3.We derive a formula h(n,q,r) for the probability of a false correlation. 4.We show that h is monotonically decreasing in q and n.

u u v v ˆ u ˆ u T n T n ∈ Rn

slide-11
SLIDE 11

ICML’2019

Main results

1.The set of all vectors whose correlation with is q, is an (n-3)-sphere . 2.If the correlation between and is r, then the expected sample correlation between and , where is drawn uniformly at random from , is qr. 3.We derive a formula h(n,q,r) for the probability of a false correlation. 4.We show that h is monotonically decreasing in q and n.

u u v v ˆ u ˆ u T n T n ∈ Rn

But it can still be non-negligible for values of n, q used in recent affective computing studies — despite a small p-value.

slide-12
SLIDE 12

ICML’2019

Case study: Student engagement

  • vs. cognitive task performance

U: Engagement V: Cognitive task performance

  • Whitehill et al. 2014 measured student engagement using (1)
  • bservational protocol and (2) automatic engagement

detector d (q=0.50).

  • Using hand-coded labels, corr(U, V) was estimated as r=0.37.
  • Given n, q, r, what is probability of false correlation from d?
slide-13
SLIDE 13

ICML’2019

Case study: Student engagement

  • vs. cognitive task performance

25 50 75 100 125 150 175 200 # participants (n) 0.0 0.1 0.2 0.3 0.4 0.5 3roEaEility

(ngagePent new: 3roEaEility of "false negative" correlation (q 0.5, r 0.37)

n

U: Engagement V: Cognitive task performance

  • Whitehill et al. 2014 measured student engagement using (1)
  • bservational protocol and (2) automatic engagement

detector d (q=0.50).

  • Using hand-coded labels, corr(U, V) was estimated as r=0.37.
  • Given n, q, r, what is probability of false correlation from d?