Defense Def ense and and Homeland S Homeland Secu ecurity rity - - PowerPoint PPT Presentation

defense def ense and and homeland s homeland secu ecurity
SMART_READER_LITE
LIVE PREVIEW

Defense Def ense and and Homeland S Homeland Secu ecurity rity - - PowerPoint PPT Presentation

U.S. Army Research, Development and Engineering Command Learning in Intelligent Tutoring Environments (LITE) Lab personnel at USMA, April 2011 (L-R): Dr. Robert Sottilare Dr. Heather Holden Mr. Keith Brawner Mr. Benjamin Goldberg


slide-1
SLIDE 1

U.S. Army Research, Development and Engineering Command

*UNCLASSIFIED – FOUO*

Def Defense ense and and Homeland S Homeland Secu ecurity rity Sim imula ulation tion

Realti ltime me Cl Clusterin stering Of Unla labell lled Se Senso sory y Da Data ta For r Traine ainee e Sta State te Assessment ssessment Se Septe temb mber r 2011

LITE TE Lab

Learning in Intelligent Tutoring Environments (LITE) Lab personnel at USMA, April 2011 (L-R):

  • Dr. Robert Sottilare
  • Dr. Heather Holden
  • Mr. Keith Brawner
  • Mr. Benjamin Goldberg
slide-2
SLIDE 2

2

*UNCLASSIFIED – FOUO*

2

Trainee State Assessment for ITS

  • Student Actions
  • Sensor Data
  • Assessment and Classification
  • Instructional Strategy Decision?

Sensors Sensors Sensors Trainee Trainee Assessment / Classification Instructional Strategy Decision

slide-3
SLIDE 3

3

*UNCLASSIFIED – FOUO*

3

Difficulties of Sensing

  • People are not

consistent

– Day to day – Baseline to Baseline

  • Unsupervised learning
  • Real-time processing
  • Deterministic Algorithms
  • Datastream problems
  • Infinite Length
  • Concept Detection
  • Concept Drift
  • Concept Evolution
slide-4
SLIDE 4

4

*UNCLASSIFIED – FOUO*

4

Potential Solutions

  • Incremental Clustering

– K-means – Agglomerate

  • Growing Neural Gas
  • Adaptive Resonance Theory
slide-5
SLIDE 5

5

*UNCLASSIFIED – FOUO*

5

Incremental K-Means Clustering

  • Strengths

– Benchmark approach – Well supported

  • Weaknesses

– Must know K – Responds to data frequency – Partitions poorly – NP-hard (general case) – Order sensitive (inc. case) Algorithm:

For each point Compare point to all known clusters If no cluster is within vigilance create new cluster here else move matched cluster up to <delta> in the direction

  • f the recent point
slide-6
SLIDE 6

6

*UNCLASSIFIED – FOUO*

6

Agglomerate Clustering

  • Strengths

– Modified inc. k-means – Accounts for cluster merging – Order insensitive – Do not have to know k

  • Weaknesses

– None (a priori) – Low coverage (a posteriori) Algorithm:

Move the closest centroid towards the datapoint Merge the two closest centroids, if appropriate Creates one redundant centroid Set redundant centroid equal to the datapoint

slide-7
SLIDE 7

7

*UNCLASSIFIED – FOUO*

7

Adaptive Resonance Theory

  • Strengths

– Order insensitive – Merges – Responds to new concepts

  • Weaknesses

– Box-shapes – Parameterization issues – NN issues (trending) Algorithm:

Apply new input pattern Compute activation of all neurons Select winning neuron Vigilence test If vigilence is relevant, add new pattern Else not, test next best neuron Else (no neurons), initialize new neuron

slide-8
SLIDE 8

8

*UNCLASSIFIED – FOUO*

8

Growing Neural Gas

  • Strengths

– Responds to new concepts – Order sensitive

  • Weaknesses

– Order sensitive – Data frequency response – Gradient Descent

  • Slows with additional data

Algorithm:

If appropriate (current point does not correspond to known information) create new reference arc store error Else, increment age of all arcs in this area move existing arcs towards new data, establish new ages for arcs Remove Aged arcs If any non-emanating arcs exist, remove If it is the time to add a new point (timing) Add a new reference point, halve the distances of the existing arcs to this point, scale the existing errors Compute path of all arcs For this point against each class: If there are few related nodes, compute the probability of the point belonging to the lowest error class Else determine the modified shape of the cluster it is most likely to belong to

slide-9
SLIDE 9

9

*UNCLASSIFIED – FOUO*

9

Raw Datasets

Clusters, Ordered Clusters, Unordered X: ECG Y: GSR X: STE Y: Engagement

slide-10
SLIDE 10

10

*UNCLASSIFIED – FOUO*

10

Performance – Ordered Shapes

Agglomerate

  • Inc. K-means

ART GNG

slide-11
SLIDE 11

11

*UNCLASSIFIED – FOUO*

11

Performance – Unordered Shapes

Agglomerate

  • Inc. K-means

ART GNG

slide-12
SLIDE 12

12

*UNCLASSIFIED – FOUO*

12

Performance – EEG (STE/Engagement)

Agglomerate

  • Inc. K-means

ART GNG

slide-13
SLIDE 13

13

*UNCLASSIFIED – FOUO*

13

Performance – ECG/GSR

Agglomerate (Failed)

  • Inc. K-means

ART GNG

slide-14
SLIDE 14

14

*UNCLASSIFIED – FOUO*

14

Performance – 4x speed movie

slide-15
SLIDE 15

15

*UNCLASSIFIED – FOUO*

15

Conclusions / Future Work

  • Use constraint-based approaches

– Semi-supervised clustering – Requires selection of initial algorithm

  • Associate performance data with state data

– More complete student picture

  • Evaluate against validated dataset

– Determine sensors to use

  • Evaluate in an ITS system

– Includes instructional strategy selection

  • Use clusters as states, forecast movement between

them