Computer Vision II Bjoern Andres Machine Learning for Computer - PowerPoint PPT Presentation

Computer Vision II Bjoern Andres Machine Learning for Computer Vision TU Dresden April 27, 2020 1 / 9

Pixel classification We consider: ◮ n 0 , n 1 ∈ N called the height and width of a digital image, V = [ n 0 ] × [ n 1 ] called the set of pixels, and the grid graph G = ( V , E ) ◮ A non-empty set R whose elements are called colors ◮ A function x : V → R called a digital image The task of pixel classification is concerned with making decisions at the pixels, e.g., decisions y : V → { 0 , 1 } indicating whether a pixel v ∈ V is of interest ( y v = 1) or not of interest ( y v = 0). 2 / 9

Pixel classification Source: https://www.pexels.com/photo/nature-flowers-garden-plant-67857/ For instance, we may wish to map to 1 precisely those pixels of the above image that depict the yellow part of any of the flowers. 3 / 9

Pixel classification We begin with a trivial mathematical abstraction of the task of pixel classification: Definition. For any c : V → R , the instance of the trivial pixel classification problem w.r.t. c has the form � min c v y v (1) y ∈{ 0 , 1 } V v ∈ V In practice, we would seek to construct the function c w.r.t. the image in such a way that ◮ c v < 0 if we consider y v = 1 the right decision ◮ c v > 0 if we consider y v = 0 the right decision 4 / 9

Pixel classification Assuming the decision for a pixel v ∈ V depends on the color x v ∈ R of that pixel only, we can ◮ construct a function ξ : R → R ◮ define c v = ξ ( x v ) for any v ∈ V . In some practical applications, e.g. photo editing, a suitable function ξ can be constructed manually, typically with the help of carefully designed GUIs. 5 / 9

Pixel classification Assuming the decision for a pixel v ∈ V depends on the location v and on the colors of all pixels in a neighborhood V d ( v ) ⊆ V around v , e.g. V d ( v ) = { w ∈ V | � v − w � max ≤ d } , we can ◮ construct, for any pixel v , a function ξ v : R V d ( v ) → R that assigns a real number ξ v ( x ′ ) to any coloring x ′ : V d ( v ) → R of the d -neighborhood of v ◮ define c v = ξ ( x V d ( v ) ) for any v ∈ V . The task of constructing such functions ξ v is typically addressed by means of machine learning , e.g., logistic regression or a CNN. 6 / 9

Pixel classification In practice, solutions to the trivial pixel classification problem can be improved by exploiting prior knowledge about feasible combinations of decisions. Firstly, we consider prior knowledge saying that decisions at neighboring pixels v , w ∈ V are more likely to be equal ( y v = v w ) than unequal ( y v � = y w ). Definition. For any c : V → R and any c ′ : E → R + 0 , the instance of the smooth pixel classification problem w.r.t. c and c ′ has the form � � c ′ { v , w } | y v − y w | min c v y v + (2) y ∈{ 0 , 1 } V v ∈ V { v , w }∈ E � �� ϕ ( y ) 7 / 9

Pixel classification A na¨ ıve algorithm for this problem is local search with a transformation T v : { 0 , 1 } V → { 0 , 1 } V that changes the decision for a single pixel, i.e., for any y : V → { 0 , 1 } and any v , w ∈ V : � 1 − y w if w = v T v ( y )( w ) = otherwise . y w Initially, y : V → { 0 , 1 } and W = V while W � = ∅ W ′ := ∅ for each v ∈ W if ϕ ( T v ( y )) − ϕ ( y ) < 0 y := T v ( y ) W ′ := W ′ ∪ { w ∈ V | { v , w } ∈ E } W := W ′ 8 / 9

Pixel classification Suggested self-study: ◮ Construct a function ξ (Slide 5) for the task and image shown on Slide 3; visualize the output of ξ . ◮ Implement the local search algorithm (Slide 8) for the smooth pixel classification problem (2) such that ϕ ( T v ( y )) − ϕ ( y ) is computed in constant time. ◮ Apply your implementation to c v = ξ ( x v ) and various positive constants c ′ . ◮ Discuss your results and compare these to the solutions of the trivial pixel classification problem (1) that is solved by your implementation for c ′ = 0. Advanced self-study: ◮ Generalize your implementation to operate on classifications y : V → { 0 , 1 , 2 } . ◮ Use your implementation to separate also the white leaves of the flowers in the image shown on Slide 3. 9 / 9

Computer Vision II Bjoern Andres Machine Learning for Computer - PowerPoint PPT Presentation

Computer Vision II Bjoern Andres Machine Learning for Computer Vision TU Dresden April 27, 2020 1 / 9 Pixel classification We consider: n 0 , n 1 N called the height and width of a digital image, V = [ n 0 ] [ n 1 ] called the set

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

Computer Vision Introduction Historical context Connections to other disciplines Vision and

HIM Without Walls Realizing Our Vision! Realizing Our Vision Realize Our Vision Realizing Our

Deep Learning in Computer Vision Caner Hazrba Deep Learning in Action 24. June 15

J J R R Our Vision . . . Our Vision . . . Our Vision . . . Our Vision . . . TO BE THE BEST

Post- -trauma vision trauma vision Post Post- -trauma vision trauma vision Post syndrome

2017 Humana Vision 130 LOOK Whats NEW! NEW RETAIL FRAME BENEFIT 2 Humana Vision 100

Vision What is the Vision? The American Fork Canyon Vision (Vision) will ho- Few places in the

Building Our Vision St. Andrews Vision and Mission Our Vision: Our Vision: The Tree of Life is

FLITTER FLITTER The Foldable Litter Pink B Our Vision Our Vision Our Vision Our Vision A

CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques John Magee 1 Computer

CS 4495 Computer Vision 3D Perception Kelsey Hawkins Robotics 3D Perception CS 4495 Computer

Expanding Use of Drones in the Railroad Environment Community of Interest Webinar for

Library of Congress Classification: Module 2.1 1 Library of Congress Classification: Module 2.1

An Introduction to Kernel Methods for Classification, Regression and Structured Data atsch

Neural State Classification for Hybrid Systems Nicola Paoletti Royal Holloway, University of

Science C Curriculum Briefing riday, 29 th th Fri January ry 2016 Primary ry Science

Object Detection using R-CNN Experiments CS381V: Visual Recognition, Spring 2016 William Xie

A traveling salesman problem with quadratic cost structure Anja Fischer, Christoph Helmberg

the robotics design lab Peer- Review Easily with Confidence : A Look at Replication and

Computer Vision II Bjoern Andres Machine Learning for Computer - PowerPoint PPT Presentation

Computer Vision II Bjoern Andres Machine Learning for Computer Vision TU Dresden April 27, 2020 1 / 9 Pixel classification We consider: n 0 , n 1 N called the height and width of a digital image, V = [ n 0 ] [ n 1 ] called the set

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Branding Presentation VISION Mevushal VISION Muscat of Alexandria &amp; Viognier VISION

Vision Services Vision Services &amp; &amp; Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

Computer Vision Introduction Historical context Connections to other disciplines Vision and

HIM Without Walls Realizing Our Vision! Realizing Our Vision Realize Our Vision Realizing Our

Deep Learning in Computer Vision Caner Hazrba Deep Learning in Action 24. June 15

J J R R Our Vision . . . Our Vision . . . Our Vision . . . Our Vision . . . TO BE THE BEST

Post- -trauma vision trauma vision Post Post- -trauma vision trauma vision Post syndrome

2017 Humana Vision 130 LOOK Whats NEW! NEW RETAIL FRAME BENEFIT 2 Humana Vision 100

Vision What is the Vision? The American Fork Canyon Vision (Vision) will ho- Few places in the

Building Our Vision St. Andrews Vision and Mission Our Vision: Our Vision: The Tree of Life is

FLITTER FLITTER The Foldable Litter Pink B Our Vision Our Vision Our Vision Our Vision A

CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques John Magee 1 Computer

CS 4495 Computer Vision 3D Perception Kelsey Hawkins Robotics 3D Perception CS 4495 Computer

Expanding Use of Drones in the Railroad Environment Community of Interest Webinar for

Library of Congress Classification: Module 2.1 1 Library of Congress Classification: Module 2.1

An Introduction to Kernel Methods for Classification, Regression and Structured Data atsch

Neural State Classification for Hybrid Systems Nicola Paoletti Royal Holloway, University of

Science C Curriculum Briefing riday, 29 th th Fri January ry 2016 Primary ry Science

Object Detection using R-CNN Experiments CS381V: Visual Recognition, Spring 2016 William Xie

A traveling salesman problem with quadratic cost structure Anja Fischer, Christoph Helmberg

the robotics design lab Peer- Review Easily with Confidence : A Look at Replication and

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007