computer vision ii
play

Computer Vision II Bjoern Andres Machine Learning for Computer - PowerPoint PPT Presentation

Computer Vision II Bjoern Andres Machine Learning for Computer Vision TU Dresden Object recognition Object recognition is the task of finding any occurrences of an object in an image, given a model of the the geometry and appearance of the


  1. Computer Vision II Bjoern Andres Machine Learning for Computer Vision TU Dresden

  2. Object recognition Object recognition is the task of finding any occurrences of an object in an image, given a model of the the geometry and appearance of the object.

  3. Object recognition 2 5 1 0 0 − 1 ǫ (not part of the object) − 5 − 2 − 5 0 5 − 2 − 1 0 1 2 Set D of points in the image Set V of object key points

  4. Object recognition 5 5 0 0 − 5 − 5 − 5 0 5 − 5 0 5 Set D of points in the image Recognition

  5. Object recognition Decisions at points ◮ For any point d ∈ D in the image and any key point v ∈ V of the object, let y dv ∈ { 0 , 1 } indicate whether the point d is an occurrence of the key point v in the image ◮ We constrain each point in the image to be an occurrence of precisely one key point, possibly ǫ . Hence, we consider the feasible set � � � � � Y DV = y : D × V → { 0 , 1 } � ∀ d ∈ D : y dv = 1 . � � v ∈ V Costs at points ◮ For any point d ∈ D and any key point v ∈ V , let c dv ∈ R a cost associated with the decision y dv = 1 ◮ This cost typically depends on the contents of the image at the point d .

  6. Object recognition Decisions for pairs of points ◮ For any pair { d, d ′ } ∈ � D � of points, let x { d,d ′ } ∈ { 0 , 1 } indicate 2 whether d and d ′ belong to the same occurrence of an object in the image ◮ We require these decisions to be transitive, i.e. ∀ d ∈ D ∀ d ′ ∈ D \ { d } ∀ d ′′ ∈ D \ { d, d ′ } : x { d,d ′ } + x { d ′ ,d ′′ } − 1 ≤ x { d,d ′′ } (1) Hence, we consider the feasible set � � � � D � X D = x : → { 0 , 1 } � (1) � 2

  7. Object recognition Costs for pairs of points ◮ For any pair ( d, d ′ ) ∈ D 2 of points such that d � = d ′ and any pair ( v, w ) ∈ V 2 of key points, let ◮ c ′ dd ′ vw ∈ R a cost associated with the decision y dv y d ′ w x { d,d ′ } = 1 ◮ c ′′ dd ′ vw ∈ R a cost associated with the decision y dv y d ′ w (1 − x { d,d ′ } ) = 1 ◮ These costs can depend, e.g., on the distance between d and d ′ in the image plane.

  8. Object recognition Optimization problem ◮ The task of object recognition can now be stated as the optimization problem � � min c dv y dv ( x,y ) ∈ X D × Y DV d ∈ D v ∈ V � � � c ′ + dd ′ vw y dv y d ′ w x { d,d ′ } d ∈ D d ′ ∈ D \{ d } ( v,w ) ∈ V 2 � � � c ′′ + dd ′ vw y dv y d ′ w (1 − x { d,d ′ } ) d ∈ D d ′ ∈ D \{ d } ( v,w ) ∈ V 2 ◮ This is a joint graph decomposition and node labeling problem ◮ The local search algorithm we have considered before (for the task of joint image decomposition and pixel labeling) can be applied!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend