image segmentation
play

Image Segmentation Perceptual and Sensory Augmented Computing Luc - PowerPoint PPT Presentation

Image Segmentation Perceptual and Sensory Augmented Computing Luc Van Gool, ETH Zurich With important contributions by Vittorio Ferrari, Un. of Edinburgh Computer Vision WS 0/09 Slide credits: K. Grauman, B. Leibe, S. Lazebnik, S. Seitz, Y


  1. Mixture of Gaussians Perceptual and Sensory Augmented Computing • One generative model is a mixture of Gaussians (MoG) Computer Vision WS 08/09 Ø K Gaussian blobs with means µ b covariance matrices V b , dimension d – Blob b defined by: K ∑ Ø Blob b is selected with probability ( ) α b = 1 b = 1 Ø The likelihood of observing x is a weighted mixture of Gaussians , Slide adapted from Steve Seitz

  2. Expectation Maximization (EM) Perceptual and Sensory Augmented Computing • Goal Find blob parameters θ that maximize the likelihood function Ø over all all datapoints Computer Vision WS 08/09 • Approach: E-step: given current guess of blobs, compute probabilistic ownership 1. of each point M-step: given ownership probabilities, update blobs to maximize 2. likelihood function Repeat until convergence 3. Slide adapted from Steve Seitz

  3. EM Details • E-step Ø Compute probability that point x is in blob b , given current Perceptual and Sensory Augmented Computing guess of θ • M-step Ø Compute overall probability that blob b is selected Computer Vision WS 08/09 ( N data points) Ø Mean of blob b Ø Covariance of blob b Slide adapted from Steve Seitz

  4. Segmentation with EM Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 K = 3 Slide credit: B. Leibe Image source: Serge Belongie

  5. Summary: Mixtures of Gaussians, EM • Pros Ø Probabilistic interpretation Perceptual and Sensory Augmented Computing Ø Soft assignments between data points and clusters Ø Generative model, can predict novel data points Ø Relatively compact storage • Cons Ø Initialization Computer Vision WS 08/09 – often a good idea to start from output of k-means Ø Local minima Ø Need to know number of components K – solutions: add a cost for model complexity Ø Need to choose generative model (math form of a cluster ?) Slide adapted from B. Leibe

  6. Topics of This Lecture • Introduction Ø Gestalt principles Perceptual and Sensory Augmented Computing Ø Image segmentation • Segmentation as clustering Ø k-Means Ø Feature spaces Ø Mixture of Gaussians, EM Computer Vision WS 08/09 • Model-free clustering: Mean-Shift • Graph theoretic segmentation: Normalized Cuts • Interactive Segmentation with GraphCuts

  7. Topics of This Lecture • Introduction Ø Gestalt principles Perceptual and Sensory Augmented Computing Ø Image segmentation • Segmentation as clustering Ø k-Means Ø Feature spaces Ø Mixture of Gaussians, EM Computer Vision WS 08/09 • Model-free clustering: Mean-Shift • Graph theoretic segmentation: Normalized Cuts • Interactive Segmentation with path search

  8. Finding Modes in a Histogram Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 • How many modes are there? Ø Mode = local maximum of a given distribution Ø Easy to see, hard to compute Slide adapted from Steve Seitz

  9. Mean-Shift Segmentation • An advanced and versatile technique for clustering- based segmentation Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 http://www.caip.rutgers.edu/~comanici/MSPAMI/msPamiResults.html D. Comaniciu and P. Meer, Mean Shift: A Robust Approach toward Feature Space Analysis, PAMI 2002. Slide credit: Svetlana Lazebnik

  10. Mean-Shift Algorithm Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 • Iterative Mode Search Initialize random seed center and window W 1. Calculate center of gravity (the “ mean ” ) of W: 2. Shift the search window to the mean 3. Repeat steps 2+3 until convergence 4. Slide adapted from Steve Seitz

  11. Mean-Shift Region of interest Center of mass Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Mean Shift vector Slide by Y . Ukrainitz & B. Sarel

  12. Mean-Shift Region of interest Center of mass Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Mean Shift vector Slide by Y . Ukrainitz & B. Sarel

  13. Mean-Shift Region of interest Center of mass Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Mean Shift vector Slide by Y . Ukrainitz & B. Sarel

  14. Mean-Shift Region of interest Center of mass Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Mean Shift vector Slide by Y . Ukrainitz & B. Sarel

  15. Mean-Shift Region of interest Center of mass Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Mean Shift vector Slide by Y . Ukrainitz & B. Sarel

  16. Mean-Shift Region of interest Center of mass Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Mean Shift vector Slide by Y . Ukrainitz & B. Sarel

  17. Mean-Shift Region of interest Center of mass Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Slide by Y . Ukrainitz & B. Sarel

  18. Real Modality Analysis Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Tessellate the space Run the procedure in parallel with windows Slide by Y . Ukrainitz & B. Sarel

  19. Real Modality Analysis Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 The blue data points were traversed by the windows towards the mode. Slide by Y . Ukrainitz & B. Sarel

  20. Mean-Shift Clustering • Cluster: all data points in the attraction basin of a mode • Attraction basin: the region for which all trajectories Perceptual and Sensory Augmented Computing lead to the same mode Computer Vision WS 08/09 Slide by Y . Ukrainitz & B. Sarel

  21. Mean-Shift Clustering/Segmentation • Choose features (color, gradients, texture, etc) • Initialize windows at individual pixel locations Perceptual and Sensory Augmented Computing • Start mean-shift from each window until convergence • Merge windows that end up near the same “ peak ” or mode Computer Vision WS 08/09 Slide adapted from Svetlana Lazebnik

  22. Mean-Shift Segmentation Results Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 http://www.caip.rutgers.edu/~comanici/MSPAMI/msPamiResults.html Slide credit: Svetlana Lazebnik

  23. Summary Mean-Shift • Pros Ø General, application-independent tool Perceptual and Sensory Augmented Computing Ø Model-free, does not assume any prior shape (spherical, elliptical, etc.) on data clusters Ø Just a single parameter (window size h) – h has a physical meaning (unlike k-means) == scale of clustering Ø Finds variable number of modes given the same h Ø Robust to outliers Computer Vision WS 08/09 • Cons Ø Output depends on window size h Ø Window size (bandwidth) selection is not trivial Ø Computationally rather expensive Ø Does not scale well with dimension of feature space (sparsity problems in high-dimensional spaces…) Slide adapted from Svetlana Lazebnik

  24. Topics of This Lecture • Introduction Ø Gestalt principles Perceptual and Sensory Augmented Computing Ø Image segmentation • Segmentation as clustering Ø k-Means Ø Feature spaces Ø Mixture of Gaussians, EM Computer Vision WS 08/09 • Model-free clustering: Mean-Shift • Graph theoretic segmentation: Normalized Cuts • Interactive Segmentation with path search

  25. Images as Graphs q Perceptual and Sensory Augmented Computing w pq w p Computer Vision WS 08/09 • Fully-connected graph Ø Node (vertex) for every pixel Ø Edge between every pair of pixels (p,q) Ø Affinity weight w pq for each edge – w pq measures similarity – Similarity is inversely proportional to difference (in color, texture, position, …) Slide adapted from Steve Seitz

  26. Measuring Affinity { } 2 • Distance = − − aff x y ( , ) exp x y 1 2 σ 2 d Perceptual and Sensory Augmented Computing { } 2 • Intensity = − − aff x y ( , ) exp 1 I x ( ) I y ( ) 2 σ 2 d { } ( ) 2 • Color = − aff x y ( , ) exp dist c x c y ( ), ( ) 1 Computer Vision WS 08/09 2 σ 2 d (some suitable color space distance) { } 2 • Texture = − − aff x y ( , ) exp f x ( ) f y ( ) 1 2 σ 2 d (vectors of filter outputs) Source: Forsyth & Ponce

  27. Segmentation by Graph Cuts Perceptual and Sensory Augmented Computing w A B C Computer Vision WS 08/09 • Break Graph into Segments Ø Delete edges crossing between segments Ø Easiest to break edges with low similarity (low weight) – Similar pixels should be in the same segments – Dissimilar pixels should be in different segments Slide adapted from Steve Seitz

  28. Graph Cut (GC) Perceptual and Sensory Augmented Computing B A • GC = edges whose removal partitions a graph in two Computer Vision WS 08/09 • Cost of a cut ∑ = cut ( A , B ) w Ø Sum of weights of cut edges: p , q ∈ ∈ p A , q B • A graph cut gives us a segmentation Ø What is a “ good ” graph cut and how do we find one? Slide adapted from Steve Seitz

  29. Minimum Cut • We can do segmentation by finding the minimum cut in a graph Perceptual and Sensory Augmented Computing Efficient algorithms exist for doing this Ø • Drawback: Weight of cut proportional to number of edges in the cut Ø Minimum cut tends to cut off very small, isolated components Ø Computer Vision WS 08/09 Cuts with lesser weight than the ideal cut Ideal Cut Slide credit: Khurram Hassan-Shafique

  30. Normalized Cut (NCut) • Min-cut has bias toward partitioning out small segments • This can be fixed by normalizing for size of segments Perceptual and Sensory Augmented Computing • The normalized cut cost is: cut ( A , B ) cut ( A , B ) assoc ( A , V ) + assoc ( B , V ) assoc ( A , V ) = sum of weights from A to all nodes in the graph Computer Vision WS 08/09 • The exact solution is NP-hard but an approximation can be computed by solving a generalized eigenvalue problem. J. Shi and J. Malik. Normalized cuts and image segmentation. PAMI 2000 Slide adapted from Svetlana Lazebnik

  31. Interpretation as a Dynamical System Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 • Treat the edges as springs and ‘ shake ’ the system Ø Elasticity proportional to cost Ø Vibration “ modes ” correspond to segments – Can compute these by solving a generalized eigenvector problem Slide adapted from Steve Seitz

  32. NCuts Example Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 NCuts segments Slide credit: B. Leibe Image source: Shi & Malik

  33. Color Image Segmentation with NCuts Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Slide credit: Steve Seitz Image Source: Shi & Malik

  34. Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Results with Color & Texture

  35. Summary: Normalized Cuts • Pros: Ø Generic framework, flexible to choice of function that computes Perceptual and Sensory Augmented Computing weights ( “ affinities ” ) between nodes Ø Does not require any model of the data distribution • Cons: Ø Time and memory complexity can be high Computer Vision WS 08/09 ⇒ – Dense, highly connected graphs many affinity computations – Solving eigenvalue problem Ø Preference for balanced partitions – If a region is uniform, NCuts will find the modes of vibration of the image dimensions Slide credit: Kristen Grauman

  36. Markov Random Fields • Allow rich probabilistic models for images • But built in a local, modular way Perceptual and Sensory Augmented Computing Ø Learn local effects, get global effects out Observed evidence Computer Vision WS 08/09 Hidden “ true states ” Neighborhood relations Slide credit: William Freeman

  37. MRF Nodes as Pixels (or Patches) Image pixels Perceptual and Sensory Augmented Computing Φ ( , x y ) Image Computer Vision WS 08/09 i i states (e.g. foreground/background) Ψ ( , x x ) i j Slide adapted from William Freeman

  38. Network Joint Probability Perceptual and Sensory Augmented Computing ∏ ∏ = Φ Ψ P x y ( , ) ( , x y ) ( , x x ) i i i j i i j , states Image-state state-state Computer Vision WS 08/09 compatibility compatibility Image function function Neighboring Local nodes observations Slide adapted from William Freeman

  39. Energy Formulation • Joint probability ∏ ∏ = Φ Ψ P x y ( , ) ( , x y ) ( , x x ) i i i j Perceptual and Sensory Augmented Computing i i j , • Maximizing the joint probability is the same as minimizing the -log ∑ ∑ − log P ( x , y ) = − log Φ ( x i , y i ) − Ψ ( x i , x j ) log i i , j Computer Vision WS 08/09 ∑ ∑ E ( x , y ) = ϕ ( x i , y i ) + ψ ( x i , x j ) i i , j • This is similar to free-energy problems in statistical mechanics (spin glass theory). We therefore draw the analogy and call E an energy function . ϕ ψ • and are called potentials . Slide credit: B. Leibe

  40. Energy Formulation • Energy function ϕ ( , x y ) i i ∑ ∑ = ϕ + ψ E x y ( , ) ( , x y ) ( , x x ) i i i j ψ ( , x x ) Perceptual and Sensory Augmented Computing i j i i j , Unary Pairwise potentials potentials ϕ • Unary potentials Ø Encode local information about the given pixel/patch Computer Vision WS 08/09 Ø How likely is a pixel/patch to be in a certain state ? (e.g. foreground/background)? ψ • Pairwise potentials Ø Encode neighborhood information Ø How different is a pixel/patch ’ s label from that of its neighbor? (e.g. here independent of image data, but later based on intensity/color/texture difference) Slide adapted from B. Leibe

  41. Energy Minimization • Goal: ϕ ( , x y ) i i Ø Infer the optimal labeling of the MRF. ψ ( , x x ) Perceptual and Sensory Augmented Computing i j • Many inference algorithms are available, e.g. Ø Gibbs sampling, simulated annealing Ø Iterated conditional modes (ICM) Ø Variational methods Ø Belief propagation Computer Vision WS 08/09 Ø Graph cuts • Recently, Graph Cuts have become a popular tool Ø Only suitable for a certain class of energy functions Ø But the solution can be obtained very fast for typical vision problems (~1MPixel/sec). Slide credit: B. Leibe

  42. Graph Cuts for Optimal Boundary Detection • Idea: convert MRF into source-sink graph t n-links a cut hard Perceptual and Sensory Augmented Computing constraint hard constraint Computer Vision WS 08/09 s Minimum cost cut can be computed in polynomial time (max-flow/min-cut algorithms) [Boykov & Jolly, ICCV ’ 01] Slide adapted from Yuri Boykov

  43. Adding Regional Properties t n-links a cut D p ( t ) Perceptual and Sensory Augmented Computing t-link w t-link pq D p ( s ) s Computer Vision WS 08/09 Regional bias example s t I and I ( ) Suppose are given 2 2 ∝ − − σ s 2 D ( s ) exp || I I || / “ expected ” intensities p p ( ) 2 2 ∝ − − σ t 2 of object and background D ( t ) exp || I I || / p p [Boykov & Jolly, ICCV ’ 01] Slide credit: Yuri Boykov

  44. Adding Regional Properties • More generally, regional bias can be based on any intensity models of object and background Perceptual and Sensory Augmented Computing t = − D L ( ) logPr( I | L ) a cut p p p p D p ( s ) Pr( I p | t ) Computer Vision WS 08/09 Pr( I p | s ) D p ( t ) I I s p given object and background intensity histograms [Boykov & Jolly, ICCV ’ 01] Slide credit: Yuri Boykov

  45. How Does it Work? The s-t-Mincut Problem Perceptual and Sensory Augmented Computing Source 2 9 Graph (V, E, C) Vertices V = {v 1 , v 2 ... v n } 1 Edges E = {(v 1 , v 2 ) ....} v 1 v 2 Computer Vision WS 08/09 2 Costs C = {c (1, 2) ....} 5 4 Sink Slide credit: Pushmeet Kohli

  46. The s-t-Mincut Problem What is an st-cut? An st-cut (S,T) divides the nodes Perceptual and Sensory Augmented Computing between source and sink. Source 2 9 What is the cost of a st-cut? 1 Sum of cost of all edges v 1 v 2 Computer Vision WS 08/09 going from S to T 2 5 4 Sink 5 + 2 + 9 = 16 Slide credit: Pushmeet Kohli

  47. The s-t-Mincut Problem What is an st-cut? An st-cut (S,T) divides the nodes Perceptual and Sensory Augmented Computing between source and sink. Source 2 9 What is the cost of a st-cut? 1 Sum of cost of all edges v 1 v 2 Computer Vision WS 08/09 going from S to T 2 5 4 What is the st-mincut? Sink st-cut with the minimum cost 2 + 1 + 4 = 7 Slide credit: Pushmeet Kohli

  48. How to Compute the s-t-Mincut? Solve the dual maximum flow problem Compute the maximum flow Perceptual and Sensory Augmented Computing Source between Source and Sink 2 9 Constraints 1 Edges: Flow < Capacity v 1 v 2 Nodes: Flow in = Flow out Computer Vision WS 08/09 2 5 4 Min-cut/Max-flow Theorem Sink In every network, the maximum flow equals the cost of the st-mincut Slide credit: Pushmeet Kohli

  49. Maxflow Algorithms Flow = 0 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 2 9 1. Find path from source to sink with positive capacity 1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 5 4 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  50. Maxflow Algorithms Flow = 0 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 2 9 1. Find path from source to sink with positive capacity 1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 5 4 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  51. Maxflow Algorithms Flow = 0 + 2 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 9 1. Find path from source to sink 2-2 with positive capacity 1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 4 5-2 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  52. Maxflow Algorithms Flow = 2 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 0 9 1. Find path from source to sink with positive capacity 1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 3 4 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  53. Maxflow Algorithms Flow = 2 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 0 9 1. Find path from source to sink with positive capacity 1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 3 4 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  54. Maxflow Algorithms Flow = 2 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 0 1. Find path from source to sink 9 with positive capacity 1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 3 4 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  55. Maxflow Algorithms Flow = 2 + 4 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 0 1. Find path from source to sink 5 with positive capacity 1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 3 0 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  56. Maxflow Algorithms Flow = 6 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 0 5 1. Find path from source to sink with positive capacity 1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 3 0 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  57. Maxflow Algorithms Flow = 6 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 0 1. Find path from source to sink 5 with positive capacity 1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 3 0 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  58. Maxflow Algorithms Flow = 6 + 1 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 0 1. Find path from source to sink 4 with positive capacity 1-1 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 0 2 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  59. Maxflow Algorithms Flow = 7 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 5 1. Find path from source to sink 0 with positive capacity 0 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 3 0 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  60. Maxflow Algorithms Flow = 7 Augmenting Path Based Algorithms Perceptual and Sensory Augmented Computing Source 5 1. Find path from source to sink 0 with positive capacity 0 v 1 v 2 Computer Vision WS 08/09 2. Push maximum possible flow 2 through this path 3 0 3. Repeat until no path can be Sink found Algorithms assume non-negative capacity Slide credit: Pushmeet Kohli

  61. Dealing with Non-Binary Cases • For image segmentation, the limitation to binary energies is a nuisance. Perceptual and Sensory Augmented Computing Binary segmentation only • We would like to solve also multi-label problems. Ø NP-hard problem with 3 or more labels • There exist some approximation algorithms which extend graph cuts to the multi-label case Computer Vision WS 08/09 α Ø E.g. -Expansion • They are no longer guaranteed to return the globally optimal result. α Ø But -Expansion has a guaranteed approximation quality and converges in a few iterations. Slide credit: B. Leibe

  62. Summary: Graph Cuts Segmentation • Pros Ø Powerful technique, based on probabilistic model (MRF). Perceptual and Sensory Augmented Computing Ø Applicable for a wide range of problems. Ø Very efficient algorithms available for vision problems. Ø Becoming a de-facto standard for many segmentation tasks. • Cons/Issues Ø Graph cuts can only solve a limited class of models Computer Vision WS 08/09 – Submodular energy functions – Can capture only part of the expressiveness of MRFs Ø Only approximate algorithms available for multi-label case Slide credit: B. Leibe

  63. Segmentation: Caveats • We ’ ve looked at bottom-up ways to segment an image into regions, yet finding meaningful segments is Perceptual and Sensory Augmented Computing intertwined with the recognition problem. • Often want to avoid making hard decisions too soon • Difficult to evaluate; when is a segmentation successful? Computer Vision WS 08/09 Slide credit: Kristen Grauman

  64. Speeding up 1: start from `superpixels’ • Start from an over-segmentation, similar-looking pixels have been grouped together quickly; requires object Perceptual and Sensory Augmented Computing boundaries to be preserved as part of superpixel edges ! Computer Vision WS 08/09 “ superpixels ” X. Ren and J. Malik. Learning a classification model for segmentation. ICCV 2003. Slide credit: Svetlana Lazebnik

  65. Speeding up 2: objectness Trying to draw bounding boxes around • Focus on regions that an `objectness’ score indicates as objects, probably containing an object without knowing what they are Perceptual and Sensory Augmented Computing Computer Vision WS 08/09 Figure 7: yellow: bb by computer / blue: by human

  66. Topics of This Lecture • Introduction Ø Gestalt principles Perceptual and Sensory Augmented Computing Ø Image segmentation • Segmentation as clustering Ø k-Means Ø Feature spaces Ø Mixture of Gaussians, EM Computer Vision WS 08/09 • Model-free clustering: Mean-Shift • Graph theoretic segmentation: Normalized Cuts • Interactive Segmentation with path search

  67. Dynamic path search: principle Guided by a user-supplied cost function, Perceptual and Sensory Augmented Computing expressing expectations like good edges to contain pixels with high gradients, edges to be smooth, etc. Computer Vision WS 08/09 find optimal path through the image: 1. having lowest cost 2. satisfying constraints (e.g. given endpoints) Useful in interactive applications (e.g. medical), or when environment constrained

  68. Dynamic path search : nomenclature A graph consists of nodes (pixels) connected by arcs (steps) Perceptual and Sensory Augmented Computing Nodes connected by steps are parents and successors Identifying a node ’ s successors is expansion Computer Vision WS 08/09 of that node A tree is a graph with 1 parent for the nodes (our arcs are undirected )

  69. Dynamic path search : nomenclature cont ’ d Often the arcs are assigned a cost Perceptual and Sensory Augmented Computing A sequence of nodes n 1 ,n 2 ,…,n k ( n i = sucessor of n i-1 ) is a path of length k Computer Vision WS 08/09 Usually path cost = Σ arc costs

  70. Dynamic path search : cost functions Cost function incorporates problem-specific information Perceptual and Sensory Augmented Computing e.g. penalize changes in edge direction e.g. penalize the inclusion of pixels with low intensity gradient Computer Vision WS 08/09 problem is one of optimization : l 1. gradient descent l 2. path array methods l 3. best-first search

  71. Gradient descent Always choose the next pixel that adds the smallest cost Perceptual and Sensory Augmented Computing Example : Computer Vision WS 08/09

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend