in the name of allah in the name of allah
play

In the name of Allah In the name of Allah the compassionate, the - PowerPoint PPT Presentation

In the name of Allah In the name of Allah the compassionate, the merciful Digital Video Processing S. Kasaei S. Kasaei R Room: CE 307 CE 307 Department of Computer Engineering Sharif University of Technology E M il E-Mail:


  1. In the name of Allah In the name of Allah the compassionate, the merciful

  2. Digital Video Processing S. Kasaei S. Kasaei R Room: CE 307 CE 307 Department of Computer Engineering Sharif University of Technology E M il E-Mail: skasaei@sharif.edu k i@ h if d Webpage: http://sharif.edu/~skasaei Lab. Website: http://ipl.ce.sharif.edu

  3. Acknowledgment Acknowledgment Most of the slides used in this course have been provided by: Prof. Yao Wang (Polytechnic University, Brooklyn) based on the book: based on the book: Video Processing & Communications written by: Yao Wang Jom Ostermann & Ya Oin Zhang written by: Yao Wang, Jom Ostermann, & Ya-Oin Zhang Prentice Hall, 1 st edition, 2001, ISBN: 0130175471. [SUT Code: TK 5105 .2 .W36 2001]

  4. Chapter 6 Chapter 6 2-D Motion Estimation 2 D Motion Estimation Part I: Fundamentals & Basic Techniques

  5. Outline � 2-D motion vs. optical flow � Optical flow equation & ambiguity in motion estimation estimation � General methodologies in motion estimation � Motion representation � Motion estimation criterion � Optimization methods � Gradient descent methods � Pixel-based motion estimation � Block-based motion estimation � EBMA algorithm 6 Kasaei

  6. 2 D Motion Estimation 2-D Motion Estimation � Motion estimation (ME) is an important part of many video processing tasks. � ME main applications are video compression, ME i li ti id i sampling rate conversion, filtering, … � For computer vision, motion vectors (MV) are used to � For computer vision motion vectors (MV) are used to deduce 3-D structure & motion parameters (sparse but accurate set of MVs are required). � For video coding MVs are used to produce motion � For video coding, MVs are used to produce motion- compensated predicted frame to reduce required bitrate for coding MVs & prediction errors (tense & accurate set of MVs are required) accurate set of MVs are required). 7 Kasaei

  7. 2-D Motion Estimation 2 D Motion Estimation � An ME problem is converted to an optimization problem that involves key p p y components of: � Parameterization of motion field. � Parameterization of motion field. � Formulation of optimization criterion. � Searching for optimal parameters � Searching for optimal parameters. Input Frames Optimal Motion Parameters Optimization Criteria Motion Field 8 Kasaei

  8. 2-D Motion vs Optical Flow 2 D Motion vs. Optical Flow � 2-D Motion: Projection of 3-D motion Depends on 3-D object motion & 2-D Motion: Projection of 3-D motion. Depends on 3-D object motion & projection operator (physical aspects). � Optical flow: “Perceived” 2-D motion based on changes in image pattern, also depends on illumination & object surface texture. also depends on illumination & object surface texture (a) A sphere is rotating under a ( ) A h i t ti d constant ambient illumination, but observed image does not change. g (b) A point light source is rotating around a stationary sphere, causing highlight point on causing highlight point on sphere to rotate. (a) (b) 9 Kasaei

  9. Correspondence & Optical Flow p & p � 2-D displacement & velocity fields are projections of respective 3-D fields into image plane. � Correspondence field & optical flow field are displacement & velocity functions “perceived” from the time-varying image intensity pattern. h i i i i i � Correspondence field & optical flow field are also � Correspondence field & optical flow field are also called “apparent 2-D displacement” field & “apparent 2-D velocity” field. 10 Kasaei

  10. Correspondence & Optical Flow p & p � Since we can only observe correspondence & optical flow fields, we assume that they are the same as the 2 D motion field 2-D motion field. � When illumination condition is unknown, the best one can do is to estimate the optical flow can do is to estimate the optical flow. � Constant intensity assumption : The image of the same object point at different time intervals have the same bj t i t t diff t ti i t l h th luminance value. � Constant intensity assumption (CIA) Optical flow C i i i (CIA) O i l fl (OF) equation. 11 Kasaei

  11. Optical Flow Equation p q Under " constant intensity assumption " : ψ + + + = ψ [(x,y,t) � (x+dx, y+dy, t+dt)] ( x d , y d , t d ) ( x , y , t ) x y t But, using Taylor' s expansion : ∂ ψ ∂ ψ ∂ ψ ψ + + + = ψ + + + ( x d , y d , t d ) ( x , y , t ) d d d x x y y t t x x y y t t ∂ ∂ ∂ ∂ ∂ ∂ x y y Compare the above two, we have the optical flow equation : ∂ ψ ∂ ψ ∂ ψ ∂ ψ ∂ ψ ∂ ψ ∂ ψ T + + = + + = ∇ ψ + = d d d 0 or v v 0 or v 0 x y t x y ∂ ∂ ∂ ∂ ∂ ∂ ∂ x y t x y t t spatial [The velocity vector ( flow vector ) , v, is the unknown parameter. gradient One equation with two unknowns.] vector 12 Kasaei

  12. Ambiguities in Motion Estimation Ambiguities in Motion Estimation � Optical flow equation only constrains the flow vector in aperture the gradient direction ( ) the gradient direction ( ). v v ? ? problem problem n � The flow vector in the tangent v direction ( ) is under- ( ) ? t t determined (aperture problem). � Also, in regions with constant v brightness ( brightness ( ), the flow is ∇ ψ ∇ ψ = ) the flow is 0 0 indeterminate Motion = + If: v v e v e n n t t estimation is unreliable in no ∂ ∂ ψ ψ vt ! t ! regions with flat texture, but ∇ ψ + = v n 0 gradient ∂ t more reliable near edges. vector magnitude 13 Kasaei

  13. Ambiguities in Motion Estimation Ambiguities in Motion Estimation � To solve the undetermined component problem ( ) of OFE, one must impose additional v t constraints constraints. � The most common constraints is that the flow vectors should vary smoothly spatially (to estimate the motion vector). ok ok ? 14 Kasaei

  14. General Considerations for ME General Considerations for ME � Two categories of approaches: � Feature-based: More often used in object tracking & 3-D reconstruction from 2-D (least-squares fitting of features, good for global motions). � Intensity-based: Based on CIA (no simple model) More � Intensity-based: Based on CIA (no simple model). More often used for motion compensated prediction (required in video coding), frame interpolation Our focus. � Three important questions: � How to represent (parameterize) the motion field? p (p ) � What criteria to use to estimate motion parameters? � How to search for optimal motion parameters? 15 Kasaei

  15. Motion Representation Motion Representation Pixel-based: Global: One MV at each pixel, Entire motion field is with some smoothness represented by a few constraint between global parameters adjacent MVs (very (global motion time consuming). time consuming) representation; camera representation; camera motion). Block-based: Region-based: E ti Entire frame is divided f i di id d Entire frame is divided into non-overlapping into regions, then each blocks, then motion in region corresponding to each block is an object (or sub- characterized by a few y object) with consistent object) with consistent parameters (good motion, is represented compromise between by a few parameters accuracy & complexity, (requires region discontinuous across segmentation map, blocks no multiple blocks, no multiple which pels have similar objects, scale, or motions?). rotation). 16 Kasaei

  16. Motion Representation Motion Representation � Other representation: mesh-based representation. � Underlying image frame is partitioned into non-overlapping polygonal elements. p yg � Mvs at the corners of polygonal elements determine the entire motion field. � Mvs at the interior points of an element are interpolated from the nodal MVs. h d l MV � Induced motion field is continuous everywhere. � Adaptive methods allow discontinuities when necessary (on object boundaries). 17 Kasaei

  17. Notations Notations ψ 1 x ( ) Anchor frame: ψ 2 x ( ) Target frame: Motion parameters: M ti t a Motion vector at a pixel in the anchor f frame: d d ( ( x ) ) Motion field: ∈ Λ d ( x ; a ), x Mapping function: = + ∈ Λ w ( x ; a ) x d ( x ; a ), x reference current frame frame [in video [in video coding] coding] 18 Kasaei

  18. Regularization Theory Regularization Theory Ill-posed problems. � Regularization methods Regularization methods. � � Stochastic regularization methods. � R l Relaxation labeling. ti l b li � � Discrete relaxation labeling. � Stochastic relaxation labeling. 19 Kasaei

  19. Well-Posed Problems Well Posed Problems A mathematical problem is well-posed � when its solution Exists, 1. is unique, and 2. is robust to noise. 3. Physical simulation problems are well- Physical simulation problems are well- � � posed, but “inverse” problems are usually ill-posed. p 20 Kasaei

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend