classification and machine learning techniques for cbir
play

Classification and Machine Learning techniques for CBIR: - PowerPoint PPT Presentation

Classification and Machine Learning techniques for CBIR: introduction to the RETIN system Matthieu Cord ETIS CNRS UMR 8051 Classification and Machine Learning techniques for CBIR: introduction to the RETIN system p.1/81 Content-Based Image


  1. Classification and Machine Learning techniques for CBIR: introduction to the RETIN system Matthieu Cord ETIS CNRS UMR 8051 Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.1/81

  2. Content-Based Image Retrieval • Retrieve large categories of pictures in generalist image database • Vector-based description of images • User interaction • Statistical learning approach → Multimodality (category retrieval) → Efficient strategies in text retrieval → Interactive strategies (active learning) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.2/81

  3. Outline 1. Binary Classification for CBIR 2. Active learning: (a) Error Reduction and Uncertainly-Based strategies (b) RETIN scheme: Boundary Correction and diversity 3. Semi-supervised classification 4. Long Term Learning Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.3/81

  4. Supervised Classification for CBIR Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.4/81

  5. Introduction • Vector-based description of images; Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.5/81

  6. Introduction • binary classification Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.6/81

  7. Supervised Classification Three representative methods for CBIR: • Bayes Classifiers (Vasconcelos) • k-Nearest Neighbors • Support Vector Machines (Chapelle) Specific characteristics [Chang ICIP’03]: (c1) High dimension and non-linearity of input space (c2) Few training data (c3) Many unlabelled data (c4) Interactive learning (Relevance feedback) (c5) Unbalanced training data Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.7/81

  8. Support Vector Machines (1/4) Classification by an hyperplan: Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.8/81

  9. Support Vector Machines (2/4) Choose the hyperplan which maximizes the margin: Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.9/81

  10. Support Vector Machines (3/4) Quadratic problem: n n α i − 1 � � α ⋆ = argmax α i α j y i y j < x i , x j > 2 α i =1 i,j =1  n �  α i y i = 0  with i =1   ∀ i ∈ [1 , n ] 0 ≤ α i ≤ C Decision function: n � y i α ⋆ f ( x ) = i < x , x i > + b i =1 Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.10/81

  11. Support Vector Machines (3/4) Support Vectors: Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.11/81

  12. "Kernelization" Kernelization of SVM: • SVM decision function: N � y i α ⋆ f ( x ) = i < x , x i > + b (1) i =1 • "Kernelized" version: N � y i α ⋆ f ( x ) = i k ( x , x i ) + b (2) i =1 Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.12/81

  13. Kernels Dealing with the class of kernels k corresponding to dot product in an induced space H via a map Φ : R p → H Φ : x �→ Φ( x ) that is k ( x , x ′ ) = < Φ( x ) , Φ( x ′ ) > Initial: X Induit: Φ( X ) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.13/81

  14. Kernels • Usual kernels: Lin., Polynomial, Sigmoid, RBF ... • Choice of a kernel depends on the database and its usage: → Different levels of performances for two different kernels; • In our experiments: Gaussian kernels give the best results → The most adapted to CBIR; → In the following experiments: Gaussian kernels with χ 2 distance, because feature vector are distributions. Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.14/81

  15. Spectral analysis of kernel matrices 1.2 1 0.8 Energie 0.6 Performances 0.4 Gaussien Chi2 Triangulaire 0.2 Polynomiale 3 Gaussien L2 Linéaire 0 0 200 400 600 800 1000 Valeur propre Large distribution ⇒ high performances; Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.15/81

  16. SVM and Kernels Deal with (c1) high dimension and non-linear input space: → Use of a kernel function to induce a feature space → Relevance function f using Kernel in SVM: N � y i α ⋆ f ( x ) = i k ( x , x i ) + b i =1 When a method cannot be directly "kernelized": KPCA. Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.16/81

  17. Experiments Protocol: • COREL Photo database (6,000 images); • 50 categories, 100-300 size; • Training set of 200 points (unbalanced). • Statistical measure: Mean Average Precision MAP Methods MAP(%) Time No learning 8 - Bayes/Parzen 18 0.09s k-NN 16 0.20s SVM 20 0.13s • SVM selected [Gosselin CVDB04] Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.17/81

  18. Experiments Training with 10 examples => poor top-similarity ranking results → User interaction (c4) to enhance the retrieval 2 components: the parameter tuning of f and the optimization of the set of examples Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.18/81

  19. Active learning for CBIR Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.19/81

  20. Active Learning Deal with the few training data (c2) and interactive learning (c4) characteristics → optimize training data to get the best classification with as few as possible user labeling Strategies of selective sampling : • Relevance-Based (RB): → Select the most relevant image • Uncertainly-Based (UB) • Error Reduction (ER) → Priority to the classification error minimization Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.20/81

  21. Labelling the most relevant (RB) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.21/81

  22. Labelling the most relevant (RB) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.22/81

  23. Labelling the most relevant (RB) Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.23/81

  24. Labelling the most relevant (RB) Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.24/81

  25. Labelling the most difficult to classify (UB) Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.25/81

  26. Labelling the most difficult to classify (UB) Revelant labelled picture Irrelevant labelled picture Relevant unlabelled picture Irrelevant unlabelled picture Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.26/81

  27. Active Learning The aim of an active learner is to select the most interesting picture x ⋆ → We propose to express the following methods as the minimization of a cost function g ( x ) : x ⋆ = argmin g ( x ) x For Relevance-Based active learning: g ( x ) = − f ( x ) where f ( x ) is the relevance function Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.27/81

  28. Uncertainly-based (UB) Active learner: x ⋆ = argmin g ( x ) x • UB strategy selects the picture which is the most difficult to classify: g ( x ) = | f ( x ) | • Method: • SVM active (Tong): → Works in the version space → Needs an accurate estimation of the boundary Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.28/81

  29. Error Reduction (ER) • ER strategy (Roy and McCallum): select the picture which will minimize the new expected test error: � P A +( x ,c ) ˆ g ( x ) = P A ( c | x ) E ˆ c ∈{− 1 , 1 } with: • ˆ P A ( c | x ) the estimation of the probability of class c given x , with the training set A • E ˆ P A +( x ,c ) the estimation of the expectation of the test error, with training set A + ( x , c ) • Require an accurate estimation of ˆ P A ( c | x ) Classification and Machine Learning techniques for CBIR: introduction to the RETIN system – p.29/81

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend