Active Learning for Multimedia Georges Qunot Multimedia Information - PDF document

ACM Multimedia 2007 Half Day Tutorial Active Learning for Multimedia Georges Quénot Multimedia Information Retrieval Group L aboratoire d' I nformatique de G renoble September 24, 2007 1 Tutorial Outline • Introduction / example • TRECVID and evaluation • Active learning principles • Application categories • Implementation aspects • Some works in active learning • A case study in the context of TRECVID – Part 1: Evaluation of active learning strategies – Part 2: TRECVID 2007 collaborative annotation • Conclusion and perspectives 2

Introduction 3 Active learning Two meanings: • Human active learning: when the teacher requires an active participation of the pupils not just that they passively listen. • Machine active learning: supervised machine learning in which the learning system interacts with a teacher / annotator / oracle to get new samples to learn from. We consider here only machine active learning. 4

Learning a concept from labeled examples Raw data: need for a teacher / annotator / oracle / user → human intervention → high cost 5 Learning a concept from labeled examples Full annotation: possibly optimal in quality but highest cost Cats: Cats? Non cats: 6

Learning a concept from labeled examples Partial annotation: less costly, possibly of similar quality but need to select “good” examples for annotation Cats: Cats? Non cats: 7 Learning a concept from labeled examples Incremental partial annotation: samples for annotation are selected on the basis of a class prediction using a learning system → relevance feedback or query learning Cats: Cats? Non cats: 8

TRECVID and Evaluation 9 TRECVID “High Level Feature”detection task From NIST site: • Text Retrieval Conference (TREC): encourage research in information retrieval by providing a large test collection, uniform scoring procedures, and a forum for organizations interested in comparing their results. • TREC Video Retrieval Evaluation (TRECVID): promote progress in content-based retrieval from digital video via open, metrics-based evaluation: http://www-nlpir.nist.gov/projects/trecvid/ • High Level Feature (HLF) detection task: contribute to work on a benchmark for evaluating the effectiveness of detection methods for semantic concepts. 10

TRECVID 2006 “High Level Feature” detection task • Find 39 concepts (High Level Features, LSCOM-lite) in 79484 shots (160 hours of Arabic, Chinese and English TV news). • For each concept, propose a ranked list of 2000 shots. • Performance measure: Mean (Inferred) Average Precision on 20 concepts. • Distinct fully annotated training set: TRECVID 2005 collaborative annotation on development collection: 39 concepts on 74523 subshots, many of them being annotated multiple times. • 30 participants. Best Average Precision: 0.192 11 20 LSCOM-lite features evaluated sports animal weather computer TV screen office US flag meeting airplane desert car mountain truck waterscape/waterfront people marching corporate leader explosion fire police security maps military personnel charts 12

Frequency of hits by features [from Paul Over and Wessel Kraaij, 2006] 13 LSCOM Large Scale Concept Ontology for Multimedia • LSCOM: 850 concepts: – What is realistic (developers) – What is useful (users) – What makes sense to humans (psychologists) • LSCOM-lite: 39 concepts, subset of LSCOM. • Annotation of 441 concepts on ~65K subshots of the TRECVID 2005 development collection. • 33,508,141 concept × annotations → About 20,000 hours or 12 man × years effort at 2 seconds/annotation. • Possibly the same efficiency using active learning with only a 2 or 3 man × years effort. 14

Metrics: precision and recall From relevant and non relevant sets Non relevant Not retrieved Retrieved Relevant Non relevant and not retrieved Relevant but Non relevant not retrieved but retrieved False negatives False positives Relevant and retrieved Corrects 15 Metrics: precision and recall From relevant and non relevant sets Recall = Retrieved and Relevant Corrects = Relevant Relevant Precision = Retrieved and Relevant Corrects = Retrieved Retrieved 2 x Corrects F-measure = Retrieved + Relevant False positives + False negatives Error rate = Relevant 16

Metrics: Recall × Precision curves From ranked lists • Results ranked from most probable to least probable: more informative that just “relevant / non relevant”. • For each k : set Ret k of the k first retrieved items • Fixed set Rel of the relevant items • For each k : Recall( Ret k , Rel ), Precision( Ret k , Rel ) • Curve joining the (Recall, Precision) points with k varying from 1 to N = total number of documents. • Interpolation : Precision = f(Recall) → Continuous curve • “Standard” program: trec_eval (ranked lists, relevant sets) → RP curve, MAP, ... 17 Metrics: Recall × Precision curves From ranked lists Area under the curve: Mean Average Precision (MAP) 18

Active learning principles 19 Active learning • Machine learning: – Learning from data. • Supervised learning: – Learning from labeled data: human intervention. • Incremental learning: – Learning from training sets of increasing sizes, – Algorithms to avoid full retrain of the system at each step. • Active learning: – Selective sampling: select the “most informative” samples for annotation: optimized human intervention. • Offline active learning: indexing (classification). • Online active learning: search (relevance feedback). 20

Supervised learning • A machine learning technique for creating a function from training data. • The training data consist of pairs of input objects (typically vectors) and desired outputs. • The output of the function can be a continuous value (regression) or a class label (classification) of the input object. • The task of the supervised learner is to predict the value of the function for any valid input object after having seen a number of training examples (i.e. pairs of input and target output). • To achieve this, the learner has to generalize from the presented data to unseen situations in a “reasonable” way. • The parallel task in human and animal psychology is often referred to as concept learning (in the case of classification). • Most commonly, supervised learning generates a global model that helps mapping input objects to desired outputs. (http://en.wikipedia.org/wiki/Supervised_learning) 21 Supervised learning • Target function: f : X → Y x → y = f ( x ) – x : input object (typically vector) – y : desired output (continuous value or class label) – X : set of valid input objects – Y : set of possible output values • Training data: S = ( x i , y i ) (1 ≤ i ≤ I ) – I : number of training samples • Learning algorithm: L : ( X×Y ) * → Y X S → f = L ( S ) • Regression or classification system: y = [ L ( S )]( x ) = g ( S , x ) ( ( X×Y ) * = ∪ n ∈ N ( X×Y ) n ) 22

Model based supervised learning • Two functions, “train” and “predict”, cooperating via a Model • General regression or classification system: y = [ L ( S )]( x ) = g ( S , x ) • Building of a model (train): M = T ( S ) • Prediction using a model (predict): y = [ L ( S )]( x ) = g ( S , x ) = P ( M , x ) = P ( T ( S ), x ) 23 Supervised learning Classification problem Training samples Train S = ( x i , y i ) (1 ≤ i ≤ I ) Model M = T ( S ) = T (( x i , y i ) (1 ≤ i ≤ I ) ) Predict Testing samples Predicted classes x y = P ( M , x ) = P ( T ( S ), x ) 24

Supervised learning Classification problem y i = A ( x i ) ( A ⇔ f ) Annotate C = ( y i ) (1 ≤ i ≤ I ) Class judgments Training samples Train U = ( x i ) (1 ≤ i ≤ I ) Model M = T ( S ) = T (( x i , y i ) (1 ≤ i ≤ I ) ) Testing samples Predict Predicted classes x y = P ( M , x ) = P ( T ( S ), x ) 25 Incremental supervised learning Classification problem • Training set of increasing sizes ( I k ) (1 ≤ k ≤ K ) : S k = ( x i , y i ) (1 ≤ i ≤ I k ) ( U k = ( x i ) (1 ≤ i ≤ I k ) C k = ( y i ) (1 ≤ i ≤ I k ) ) • Model refinement: M k = T ( S k ) • Prediction refinement: y k = P ( M k , x ) y = P ( M K , x ) • Possible incremental estimation ( k > 1 ): M k = T’ ( M k − 1 , S k − S k − 1 ) • Useful for large data sets, model adaptation (concept drift), … 26

Incremental supervised learning Classification problem Training samples Train S k = ( x i , y i ) (1 ≤ i ≤ I k ) Models M k = T ( S k ) = T (( x i , y i ) (1 ≤ i ≤ I k ) ) Testing samples Predict Predicted classes x y k = P ( M k , x ) = P ( T ( S k ), x ) y = P ( M K , x ) = P ( T ( S K ), x ) 27 Incremental supervised learning Classification problem y i = A ( x i ) ( A ⇔ f ) Annotate C k = ( y i ) (1 ≤ i ≤ I k ) Class judgments Training samples Train U = ( x i ) (1 ≤ i ≤ I ) Models M k = T ( S k ) = T (( x i , y i ) (1 ≤ i ≤ I k ) ) Testing samples Predict Predicted classes x y = P ( M K , x ) = P ( T ( S K ), x ) 28

Active Learning for Multimedia Georges Qunot Multimedia Information - PDF document

ACM Multimedia 2007 Half Day Tutorial Active Learning for Multimedia Georges Qunot Multimedia Information Retrieval Group L aboratoire d' I nformatique de G renoble September 24, 2007 1 Tutorial Outline Introduction / example

Multimedia Systems Definition of Multimedia System A Multimedia System is a system capable of

Multimedia Applications Multimedia Applications Srinidhi Varadarajan Multimedia Applications

Chapter 1 Introduction to Multimedia 1.1 What is Multimedia? 1.2 Multimedia and Hypermedia 1.3

Distributed Multimedia Systems 8. Multimedia Applications Multimedia Applications - 1 Lszl

Multimedia Information Retrieval 1 What is multimedia information retrieval? 2 Basic Multimedia

Summary User-centric Social Social Multimedia Multimedia Computing From Users: user-perceptive

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Multi-Task Active Learning Yi Zhang Outline Active Learning Multi-Task Active Learning

eyeShot Multimedia Search Engine Multimedia Search Engine eyeShot Extracting text patterns

The MeeGo Multimedia Stack Dr. Stefan Kost Nokia - The MeeGo Multimedia Stack - CELF Embedded

Streaming Multimedia Applications Multimedia Networking Multimedia Applications? What are

Multimedia Queries and Indexing Prof Stefan Rger Multimedia and Information Systems Knowledge

Multimedia Indexation Titus ZAHARIA, Pr. Titus.Zaharia@telecom-sudparis.eu Multimedia indexation

1 What is multimedia information retrieval? 1.1 Information retrieval 1.2 Multimedia 1.3

Machine Learning Applications for Personal Service Robots Jun Miura Active Intelligent Systems

More on T-graphs Nathana el Berestycki University of Cambridge with Benoit Laslier (Paris)

Parametrization of PSL(n,C)-representations of surface group II Yuichi Kabaya (Osaka

Topological Quantum Computation Parsa Bonderson Microsoft Station Q DIAS Theoretical Physics

RTP Interleaving Colin P erkins < c.p erkins@cs.ucl.ac.uk > Depa rtment of Computer

Setting DBMS must allow concurrent access to databases. Database Usage Imagine a bank

Automatic Interleaving for Testing Distributed Systems Mihal Brumbulli and Emmanuel Gaudin

Last time: Overloading val (=) : {E:EQ} E.t E.t bool 1/ 52 This time: monads (etc.)

Active Learning for Multimedia Georges Qunot Multimedia Information - PDF document

ACM Multimedia 2007 Half Day Tutorial Active Learning for Multimedia Georges Qunot Multimedia Information Retrieval Group L aboratoire d' I nformatique de G renoble September 24, 2007 1 Tutorial Outline Introduction / example

Multimedia Systems Definition of Multimedia System A Multimedia System is a system capable of

Multimedia Applications Multimedia Applications Srinidhi Varadarajan Multimedia Applications

Chapter 1 Introduction to Multimedia 1.1 What is Multimedia? 1.2 Multimedia and Hypermedia 1.3

Distributed Multimedia Systems 8. Multimedia Applications Multimedia Applications - 1 Lszl

Multimedia Information Retrieval 1 What is multimedia information retrieval? 2 Basic Multimedia

Summary User-centric Social Social Multimedia Multimedia Computing From Users: user-perceptive

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Multi-Task Active Learning Yi Zhang Outline Active Learning Multi-Task Active Learning

eyeShot Multimedia Search Engine Multimedia Search Engine eyeShot Extracting text patterns

The MeeGo Multimedia Stack Dr. Stefan Kost Nokia - The MeeGo Multimedia Stack - CELF Embedded

Streaming Multimedia Applications Multimedia Networking Multimedia Applications? What are

Multimedia Queries and Indexing Prof Stefan Rger Multimedia and Information Systems Knowledge

Multimedia Indexation Titus ZAHARIA, Pr. Titus.Zaharia@telecom-sudparis.eu Multimedia indexation

1 What is multimedia information retrieval? 1.1 Information retrieval 1.2 Multimedia 1.3

Machine Learning Applications for Personal Service Robots Jun Miura Active Intelligent Systems

More on T-graphs Nathana el Berestycki University of Cambridge with Benoit Laslier (Paris)

Parametrization of PSL(n,C)-representations of surface group II Yuichi Kabaya (Osaka

Topological Quantum Computation Parsa Bonderson Microsoft Station Q DIAS Theoretical Physics

RTP Interleaving Colin P erkins &lt; c.p erkins@cs.ucl.ac.uk &gt; Depa rtment of Computer

Setting DBMS must allow concurrent access to databases. Database Usage Imagine a bank

Automatic Interleaving for Testing Distributed Systems Mihal Brumbulli and Emmanuel Gaudin

Last time: Overloading val (=) : {E:EQ} E.t E.t bool 1/ 52 This time: monads (etc.)

RTP Interleaving Colin P erkins < c.p erkins@cs.ucl.ac.uk > Depa rtment of Computer