novelty detection from an ego centric perspective
play

Novelty Detection from an Ego- Centric Perspective Omid Aghazadeh, - PowerPoint PPT Presentation

Novelty Detection from an Ego- Centric Perspective Omid Aghazadeh, Josephine Sullivan, and Stefan Carlsson Presented by Randall Smith Friday, November 16, 12 1 Outline Introduction Sequence Alignment Appearance Based Cues


  1. Novelty Detection from an Ego- Centric Perspective Omid Aghazadeh, Josephine Sullivan, and Stefan Carlsson Presented by Randall Smith Friday, November 16, 12 1

  2. Outline • Introduction • Sequence Alignment • Appearance Based Cues • Geometric Similarity • Example • Dynamic Time Warping • Algorithm • Evaluation of Similarity Matching • Results • Conclusion Friday, November 16, 12 2

  3. Introduction • Problem: Select relevant visual input from worn, mobile camera. • Motivation: • Routine Recognition [Blanke & Schiele 2009] • Life Logging [Doherty & Smeaton 2010] [Schiele et. al. 2007] • Memory assistance [Hodges et. al. 2006] Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 3

  4. Introduction : Memory Selection • We must decide what visual inputs to remember. • How should this be done? • Novelty detection. • What is novelty detection? Friday, November 16, 12 4

  5. Introduction : Novelty Detection Known All Inputs Inputs • Novelty = All Inputs - Known Inputs • Novelty detection : identification of inputs that di ff er from previously seen inputs. • Novelty detection can help decide on what is worth remembering. Friday, November 16, 12 5

  6. Introduction : Setup • Heuristic: detect novelty as deviation from background. • Context: collect video sequences from from daily commute to work. • Equipment: 4cm camera + memory stick. Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 6

  7. Introduction : Dataset Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 7

  8. Sequence Alignment • Novelty is defined as a failure to register a sequence with a set of stored reference sequences (25 Hz videos sampled at 1 Hz.) • Accomplished by sequence alignment, via Dynamic Time Warping (DTW). Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 8

  9. Sequence Alignment : Discussion • Could we define or detect novelty in some other way? Friday, November 16, 12 9

  10. Sequence Alignment : Dynamic Time Warping M Time Series A Time Series B Friday, November 16, 12 10

  11. Sequence Alignment : Similarity • In order to use DTW, need to define some cost function • This can by defining a measure of similarity between each pair of frames. • Can use appearance based cues (SIFT, VLAD) to do this. Image: link Friday, November 16, 12 11

  12. Appearance Based Cues • Can compute a fixed length vector each frame and use a kernel in order to compare similarity. • Use SIFT or VLAD/SIFT to compute Bag of Features (BoF). • VLAD: Vector of Locally Aggregated Descriptors: • (1) get k-means code book, and • (2) for each codeword C • take the L2-normalized sum of all the vectors assigned to it. Image: link Friday, November 16, 12 12

  13. Geometric Similarity • Appearance based cues alone are not accurate enough. • Need to match local structures in a geometrically consistent way. • Need a transformation that will do this: fundamental matrix. • The measure of similarity will be the percentage of inliers in an initial set of putative matches, w.r.t to estimated fundamental matrix. • Match against homography mapping to assess correctness of hypothetical fundamental matrix Friday, November 16, 12 13

  14. Geometric Similarity : Discussion • Could we supplement or substitute some other measure of similarity? • How could di ff erent similarity measures a ff ect novelty? Friday, November 16, 12 14

  15. Example Friday, November 16, 12 15

  16. Example Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 16

  17. Dynamic Time Warping M • Define a path : Time Series A p = { ( i 1 , j 1 ) , . . . , ( i K , j K ) } • s.t. (1) , and ( i 1 , j 1 ) = (1 , 1) ( i K , j K ) = ( M, N ) (2) p k +1 − p k ∈ { (0 , 1) , (1 , 0) , (1 , 1) } • Define a cost function c ( i, j ) ≥ 0 K p N X • Let C p = c ( i k , j k ) Time Series B k =1 p ∗ = argmin p C p • Want . • Solved via dynamic programming. Friday, November 16, 12 17

  18. Algorithm • compute features , and nearest neighbor distance ratio F 1 F 2 • keep best matches based on this ordering N P • compute loose homography and inliers H L P H • compute 5 point fundamental matrix from and inliers P HE E P H f s = min(1 , α max(0 , | P HR | • compute similarity − β )) | P | Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 18

  19. Algorithm : Cost Matrix • Need to compute similarity matrix for sequences and . s 1 s 2 • Convert to cost matrix via zero-mean Gaussian with standard deviation . σ c • Why? Noise? • Use DTW to find optimal alignment! • Problem: this is expensive. Friday, November 16, 12 19

  20. Algorithm : Optimization • Optimization: for each frame in find the k nearest neighbors in . s 1 s 2 • Evaluate only the k nearest neighbors instead. Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 20

  21. Algorithm : Match Cost • Let correspond to frame indices in and to frame indices in . j i s 2 s 1 • Let be the minimum cost path from DTW. δ s 1 ,s 2 • The match cost for a frame in to is λ ( i, δ s 1 ,s 2 ) i s 1 s 2 ( if ∃ ( i k , j k ) ∈ δ s 1 ,s 2 s.t. i = i k C i k ,j k λ ( i, δ s 1 ,s 2 ) = 1 otherwise • where is the value of the cost matrix at . ( i k , j k ) C i k ,j k Friday, November 16, 12 21

  22. Algorithm : Novelty Detection • Compute the minimum match cost for each frame in the query sequence: E ( s ( i ) t ) = min s r ∈ S λ ( i, δ s q ,s r ) • where contains all reference sequences. S • Threshold the minimum match cost to find novelties. • Smoothing: Gaussian mask applied to prior to matching with and using σ N 1 threshold . − 23 σ 2 Θ N = e c Friday, November 16, 12 22

  23. Algorithm : Discussion • How else could we implement memory selection or novelty detection? • How does this scale with the number of stored sequences? Friday, November 16, 12 23

  24. Evaluation of Similarity Matching • minimum intersection kernel for BoF and degree one polynomial kernel for VLAD/SIFT • VLAD + BoF + Dense (gray + color) -> 88% = best Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 24

  25. Results : Detecting Novelty Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 25

  26. Results : Precision Recall Curves and Matches Image: CVPR 2011, Aghazadeh et. al., link Friday, November 16, 12 26

  27. Conclusion • The scalability of this algorithm seems to be an issue. • It would be interesting to explore alternative measures of similarity or novelty. • Could this be converted to purely use clustering and only store clips for reference (by the user). • The dataset is quite small, which is understandable given their technique, but perhaps an improved technique could make this work better? Friday, November 16, 12 27

  28. References • H. Jegou, M. Douze, C. Schmid, and P . Perez. Aggregating local descriptors into a compact image representation. In CVPR, 2010. • M. Muller. Information retrieval for music and motion. Springer-Verlag New York Inc, 2007. • Novelty Detection from an Egocentric Perspective. O. Aghazadeh, J. Sullivan, and S. Carlsson. CVPR 2011 Friday, November 16, 12 28

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend