Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D VIDEO SYSTEMS 3D VIDEO SYSTEMS
Fernando Pereira Instituto Superior Técnico
3D VIDEO SYSTEMS 3D VIDEO SYSTEMS Fernando Pereira Instituto - - PowerPoint PPT Presentation
3D VIDEO SYSTEMS 3D VIDEO SYSTEMS Fernando Pereira Instituto Superior Tcnico Comunicao de udio e Vdeo, Fernando Pereira, 2012 Its a 3D World Its a 3D World, Stupid Its a 3D World Its a 3D World, Stupid , Stupid ! ,
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Fernando Pereira Instituto Superior Técnico
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
It’s a 3D World It’s a 3D World, Stupid , Stupid ! It’s a 3D World It’s a 3D World, Stupid , Stupid !
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Context and Motivation Context and Motivation Context and Motivation Context and Motivation
available to the consumer
cinema theaters with 3D capabilities
distribution, digital interfaces
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Stereoscopic Displays Sales Forecast Stereoscopic Displays Sales Forecast Stereoscopic Displays Sales Forecast Stereoscopic Displays Sales Forecast
Source: DisplaySearch 3D Display Technology and Market Forecast Report
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Critical Success Factors Critical Success Factors Critical Success Factors Critical Success Factors
transition costs or turned off by viewing discomfort or fatigue
technology, e.g., glasses vs no glasses, Transition costs
providing interoperability through the delivery chain and taking into consideration the constraints imposed by each delivery channel
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Experiences … and 3D Video … 3D Experiences … and 3D Video … 3D Experiences … and 3D Video … 3D Experiences … and 3D Video …
illusion of depth for the scene
which allow navigating the 3D scene by changing the viewpoint and view direction within certain ranges (each view may be stereo)
representation AND its important stereo-view special case.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Human Human Visual Visual System System Human Human Visual Visual System System
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Screen Parallax … Screen Parallax … Screen Parallax … Screen Parallax …
Screen Parallax = distance between Pleft and Pright
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Stereoscopic Vision Stereoscopic Vision Stereoscopic Vision Stereoscopic Vision
(focus) to the depth of the display
depth of the image
are not a natural situation as
and convergence occur at the same depth
eyestrain (especially in older viewers)
Disparity Positive Disparity
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Stereoscopy Stereoscopy Stereoscopy Stereoscopy
Stereoscopy (also called stereoscopic or 3D imaging) regards the capability of recreating 3D visual information or creating the illusion of depth in an image based on two appropriate views. The basic requirement is to recreate offset images (with parallax) that are presented separately to the left and right eye. Both of these 2D offset images are then combined in the brain to give the perception of 3D depth.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Stereoscopy: Better 3D Illusions … Stereoscopy: Better 3D Illusions … Stereoscopy: Better 3D Illusions … Stereoscopy: Better 3D Illusions …
available in 2D projections; this is why images on a television screen and at the cinema make sense. Perceptual cues for 3D perception include:
maintain a clear image (focus) on an object as its distance changes.
presenting a slightly different image to each eye. The movement parallax cue is still not satisfied with stereoscopy and, therefore, the illusion of depth is incomplete.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Free Viewpoint Systems Free Viewpoint Systems Free Viewpoint Systems Free Viewpoint Systems
Free viewpoint systems require the acquisition of multiple scene views taken from different angles, allowing the user to navigate around the scene.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Video Applications … 3D Video Applications … 3D Video Applications … 3D Video Applications …
The complete 3D video system is relevant for multiple applications such as broadcast TV, teleconference, surveillance, interactive video, cinema, gaming and other immersive video applications.
3D Home Master 3D Encoding & Video Compression 3D Video Distribution Channels Media Players & Set Top Boxes
Video Decompress 3D Format Decode
3D TV Left Eye Right Eye
Video Compress 3D Format Encode
Blu-ray Disc DVD Cable TV Satellite TV Terrestrial TV IPTV Internet
3D Home Package
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Video Content Chain … 3D Video Content Chain … 3D Video Content Chain … 3D Video Content Chain …
conventional 2D system but are quite different; they have all to evolve towards 3D regarding the 2D available solutions.
depth transitions.
formats; the best choice depends on distribution constraints, display capabilities, available equipment, target quality, etc.
displays, higher display resolutions, avoid uneasy feelings (headaches, nausea, eye strain, etc.).
Content acquisition and creation Content Representation Content Distribution Content Consumption
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
History of 3D Video … History of 3D Video … History of 3D Video … History of 3D Video …
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Content is Exploding … Again … 3D Content is Exploding … Again … 3D Content is Exploding … Again … 3D Content is Exploding … Again …
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Momentum … 3D Momentum … 3D Momentum … 3D Momentum …
unique, high-quality immersive 3D experiences in theaters
three times higher than traditional 2D screens
production and growing consumer appetite for 3D content
Avatar cost was around $500 million !!! Box office in Jan 2011 was $2,781,835,502 … Naturally, the sequel is coming .!
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Cameras … 3D Cameras … 3D Cameras … 3D Cameras …
sensor or film frame for each lens. This allows simulating human binocular vision, and gives it the ability to capture 3D images, a process known as stereo
pictures for movies.
distance) is about the distance between one's eyes (known as the intra-ocular distance); this is about 6.35 cm, although a longer base line (greater inter- camera distance) produces more extreme 3-dimensionality.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Video Formats/Standards … 3D Video Formats/Standards … 3D Video Formats/Standards … 3D Video Formats/Standards …
formats are closely coupled to 3D display types and application scenarios.
format/standard would be highly desirable to support any 3D video application in an efficient way, while decoupling content creation from display and application.
earlier this decade: a slow start, then a rapid ascent in sales once enough content exists to attract mainstream buyers.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Multi Multi-View Video Data View Video Data Multi Multi-View Video Data View Video Data
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Main 3D Video Format Requirements Main 3D Video Format Requirements Main 3D Video Format Requirements Main 3D Video Format Requirements
independent compression of each view.
displayed by starting the decoder at a random access point and decoding a relatively small quantity of data on which that image may depend.
reduced in quality to a degree commensurate with the quantity of data in the subset used for the decoding process – although accessing only a portion of a bitstream.
limited number subset of the set of encoded views.
view’ is decodable by an ordinary H.264/AVC decoder.
encoding quality of the various views.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Video Related Formats: the Menu … 3D Video Related Formats: the Menu … 3D Video Related Formats: the Menu … 3D Video Related Formats: the Menu …
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Display versus Distribution Formats Display versus Distribution Formats Display versus Distribution Formats Display versus Distribution Formats
converter box or TV itself
since both horizontal and vertical resolution are lost
STB Blu-ray Player HDMI Distribution format Display format
HDMI
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Multi Multi-View View Simulcasting Simulcasting Multi Multi-View View Simulcasting Simulcasting
(ignoring they are like ‘brothers’ due to the interview redundancy).
as H.264/AVC is more likely.
2010 World Cup games.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Frame Compatible Stereo Formats Frame Compatible Stereo Formats Frame Compatible Stereo Formats Frame Compatible Stereo Formats
to a class of formats in which the stereo signal is essentially a multiplex of the two views into a single frame or sequence of frames to be coded with 2D video coding solutions. They are also called stereo interleaving or spatial/temporal multiplexing formats.
frame-compatible formats has been standardized in H.264/AVC as supplemental enhancement information (SEI) messages.
initial phase of services.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Frame Compatible Stereo Formats Examples Frame Compatible Stereo Formats Examples Frame Compatible Stereo Formats Examples Frame Compatible Stereo Formats Examples
‘as usual’:
half represent the right view; thus, each coded view has half the resolution of the full coded frame.
Left Right Left Right time
Left Right
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Frame Compatible Formats: Pros and Cons Frame Compatible Formats: Pros and Cons Frame Compatible Formats: Pros and Cons Frame Compatible Formats: Pros and Cons
Advantages
compressed with existing encoders, transmitted through existing channels, and decoded by existing receivers)
(some rate increase expected)
consumer interfaces)
Drawbacks
native display formats (further quality degradation)
characteristics
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Preferred Frame Compatible Formats Preferred Frame Compatible Formats Preferred Frame Compatible Formats Preferred Frame Compatible Formats
primary formats: side-by-side and top-bottom
compression can be better maintained
introduce high frequency (higher bitrate, cross talk, color bleeding)
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Conventional Stereo Format Conventional Stereo Format Conventional Stereo Format Conventional Stereo Format
are coded exploiting their interview redundancy.
coding solutions with increased compression efficiency.
Combined temporal and interview prediction
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
2D+Depth Format 2D+Depth Format 2D+Depth Format 2D+Depth Format
range
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Multi Multi-View Video Coding Format View Video Coding Format Multi Multi-View Video Coding Format View Video Coding Format
Multi-view video (MVV) refers to a set of N temporally synchronized video streams coming from cameras capturing the same real scenery from different viewpoints.
in order to generate a depth impression
VIEW-1 VIEW-2 VIEW-3
TV/HDTV
3DTV
Stereo system
Channel
VIEW-1 VIEW-2 VIEW-3
TV/HDTV
3DTV
Stereo system
Channel
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
MPEG MPEG-2 Multiview Profile 2 Multiview Profile MPEG MPEG-2 Multiview Profile 2 Multiview Profile
scalability for coding second view
picture from the base view or from within the enhancement view
accessible encoded video segment
which minimizes the memory storage, but reduces compression efficiency
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Multi Multi-View Video Coding (MVC) Standard View Video Coding (MVC) Standard Multi Multi-View Video Coding (MVC) Standard View Video Coding (MVC) Standard
changes of the slice layer syntax and below and
multi-view.
inter-camera prediction to reduce the required bitrate.
include a base view, which is independently coded from other non-base views.
view is reduced around 30%
are about 25%
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Interview Prediction: Basics Interview Prediction: Basics Interview Prediction: Basics Interview Prediction: Basics
Many prediction structures possible to exploit interview redundancy, trading-off differently memory, delay, computation and coding efficiency.
View
MPEG-2 Video Multi-view profile
Pictures in the second view are not only predicted from temporal references (in the same view), but also from interview references (in the other view). The prediction is adaptive, so the best predictor among temporal and interview references can be selected on a block basis in terms of rate-distortion cost.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Interview Prediction in MVC Interview Prediction in MVC Interview Prediction in MVC Interview Prediction in MVC
Time View
temporal and spatial prediction.
way that makes use of the flexible reference picture management capabilities that had already been designed into H.264/AVC.
structuring the MVC bitstream to include a compatible ‘base view’.
Base View with GOP size 6 For complexity reasons, the MVC design does not allow the prediction of a picture in one view at a given time using a picture from another view at a different time.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
View Random Access View Random Access View Random Access View Random Access
views, random access in the view dimension must also be considered; this regards controlled delay view switching.
accessed (called the target views), any views on which they depend for purposes of interview referencing also need to be accessed and decoded, which typically requires some additional decoding time or delay.
important, the prediction structure has to be designed to minimize access delay, and the MVC design provides a way for an encoder to describe the prediction structure.
determine an appropriate access point. In H.264/AVC, each IDR picture provides a clean random access point, since these pictures can be independently decoded and all the coded pictures that follow them in bitstream order can also be decoded without temporal prediction from any picture decoded prior to the IDR picture.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
MVC: Technical Solution MVC: Technical Solution MVC: Technical Solution MVC: Technical Solution
The core macroblock-level and lower-level decoding modules of an MVC decoder are the same, regardless of whether a reference picture is a temporal or an interview
layer AVC hardware;
type)
view extension of the sequence parameter set (SPS) defined by H.264/AVC.
identification; ii) view dependency information; and iii) level index for operation points.
views to be inserted and removed from reference picture buffer
reference or multi-view reference
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
were also explored:
process; more efficient inter-view prediction when illumination mismatches exist
focus
Infer motion information from corresponding block in neighboring view; similar in concept to inter-layer motion prediction of SVC
not sufficient justification to change macroblock-level syntax of standard
market becomes more clear and needs are better understood
Proposed Proposed Tools Tools for MVC for MVC Proposed Proposed Tools Tools for MVC for MVC
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
MVC: Profiles and Levels MVC: Profiles and Levels MVC: Profiles and Levels MVC: Profiles and Levels
There are two MVC profiles with support for more than
H.264/AVC High profile:
supports multiple views and does not support interlaced coding tools.
limited to two views, but does support interlaced coding tools. Levels impose constraints on the MVC bitstreams to establish bounds on the necessary decoder resources and complexity. The level limits include limits on the amount of frame memory required for the decoding of a bitstream, the maximum throughput in terms of macroblocks per second, maximum picture size, overall bit rate, etc.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
MVC Compression Performance MVC Compression Performance MVC Compression Performance MVC Compression Performance
Simulcasting versus MVC comparison
8 views (with 640×480 resolution), and considering the rate for all views ~25% bit rate savings over all views
Ballroom
31 32 33 34 35 36 37 38 39 40 200 400 600 800 1000 1200 1400 1600 1800Bitrate (Kb/s) PSNR (db) Simulcast MVC Race1
32 33 34 35 36 37 38 39 40 41 42 200 400 600 800 1000 1200 1400 1600Bitrate (Kb/s) PSNR (db) Simulcast MVC
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
MVC: Subjective Stereo Performance MVC: Subjective Stereo Performance MVC: Subjective Stereo Performance MVC: Subjective Stereo Performance
for the dependent view (75% gain); this rate may have to be higher for lower rates than 12 Mbit/s for the main view.
1.00 1.50 2.00 2.50 3.00 3.50 4.00 4.50
Original Simulcast (AVC+AVC) 12L_50Pct 12L_35Pct 12L_25Pct 12L_20Pct 12L_15Pct 12L_10Pct 12L_5Pct Mean Opinion Score
Base view fixed at 12 Mbit/s; dependent view at varying percentage of base view rate.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
uncommon
the number of views
MVC MVC Limitations Limitations MVC MVC Limitations Limitations
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Multi Multi-View Video plus Depth (MVD) View Video plus Depth (MVD) Multi Multi-View Video plus Depth (MVD) View Video plus Depth (MVD)
views) with MVC.
MPEG as part of the 3D Video coding activity.
depth (encoded with MVC).
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
view synthesis of the video data (we never see depth maps!)
with different scene depth
than current coding methods such as AVC/MVC
Depth Coding Depth Coding vs vs Natural Image Coding Natural Image Coding Depth Coding Depth Coding vs vs Natural Image Coding Natural Image Coding
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
After MVC: the MPEG 3DVC Approach After MVC: the MPEG 3DVC Approach After MVC: the MPEG 3DVC Approach After MVC: the MPEG 3DVC Approach
to be rendered
Arbitrarily Large Number
Data Format Data Format Constrained Rate (based on distribution) Limited Camera Inputs
Left Right Auto-stereoscopic N-view displays Stereoscopic displays
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Bitrate versus 3D Rendering Capability Bitrate versus 3D Rendering Capability Bitrate versus 3D Rendering Capability Bitrate versus 3D Rendering Capability
2D 2D 2D+Depth 2D+Depth MVC MVC Simulcast Simulcast
3D Rendering Capability Bit Rate
3DVC
3DVC should be compatible with:
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
MPEG 3DVC Framework MPEG 3DVC Framework MPEG 3DVC Framework MPEG 3DVC Framework
Depth Estimation Video/Depth Codec View Synthesis Limited Video Inputs (e.g., 2 or 3 views) Larger # Output Views
1010001010001
Binary Representation & Reconstruction Process
+
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Quality Metrics: an Even Bigger Challenge Quality Metrics: an Even Bigger Challenge Quality Metrics: an Even Bigger Challenge Quality Metrics: an Even Bigger Challenge
How to measure the quality of the ‘synthetic’ views for which no ‘real’ references exist ? How do we know/measure what is ‘good quality’ ? Subjective testing is mostly being used by MPEG …
+
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
(after this slide the topics are not for the exam)
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
TV Service Transitions … TV Service Transitions … TV Service Transitions … TV Service Transitions …
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Blu Blu-Ray Disc Ray Disc Blu Blu-Ray Disc Ray Disc
backward compatibility, the MVC Stereo High profile was selected by the Blu-Ray Disc Association as the coding format for 3D video with high-definition resolution.
compatibility
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Cable and Satellite Transmission Cable and Satellite Transmission Cable and Satellite Transmission Cable and Satellite Transmission
boxes?
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Internet Internet Internet Internet
2013
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Terrestrial Broadcast Terrestrial Broadcast Terrestrial Broadcast Terrestrial Broadcast
and legacy devices
in most regions
significant bandwidth
Alternative Bandwidth Allocations
0% 20% 40% 60% 80% 100%
SDTV & 3DTV SDTV, Mobile & 3DTV HDTV & 3DTV HDTV, Mobile & 3DTV
3DTV Mobile SDTV HDTV
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Games, Really Helping to Boost 3D … Games, Really Helping to Boost 3D … Games, Really Helping to Boost 3D … Games, Really Helping to Boost 3D …
Games create another important channel to bring 3D content and displays to the home, especially when there are young people around …
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Displays: a Major Driving Force … 3D Displays: a Major Driving Force … 3D Displays: a Major Driving Force … 3D Displays: a Major Driving Force …
investment in auto-stereoscopic
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
About 3D and 3D Displays … About 3D and 3D Displays … About 3D and 3D Displays … About 3D and 3D Displays …
stereoscopic perception of 3D depth to the viewer.
to the left and right eye. Both of these 2D offset images are then combined in the brain to give the perception of 3D depth.
is distinctly different from displaying an image in three full dimensions. The most notable difference is that the observer is lacking any/full freedom of head movement (movement parallax) to increase information about the 3D objects being displayed.
display’ fits more accurately for such technology.
sound field merely with two stereophonic speakers, it is likewise an
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Display Formats: a Taxonomy 3D Display Formats: a Taxonomy 3D Display Formats: a Taxonomy 3D Display Formats: a Taxonomy
the same fundamental problem: how to direct a different image to the left and right eyes.
way have resulted in many 3D display designs commercially available, falling into two broad categories: stereoscopic and autostereoscopic.
each display type increases from left to right, from two-view stereoscopic displays, to horizontal parallax multi-view displays, to full horizontal and vertical parallax volumetric displays.
a critical factor for display designers, content producers and users since it directly affects the whole imaging process from capture through to display.
Light field reconstruction Discrete number of views
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Display Formats: Some Main Differences … 3D Display Formats: Some Main Differences … 3D Display Formats: Some Main Differences … 3D Display Formats: Some Main Differences …
angles - similar to looking through a window at the scene being reproduced
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Stereoscopic Anaglyphic Displays Stereoscopic Anaglyphic Displays Stereoscopic Anaglyphic Displays Stereoscopic Anaglyphic Displays
Anaglyph images are used to provide a stereoscopic 3D effect, when viewed with glasses where the two lenses are different (usually chromatically opposite) colors, such as red and cyan. Images are made up of two color layers, superimposed, but offset with respect to each other to produce a depth effect (significant problems with color representation).
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Stereoscopic Wavelength Multiplexing Displays Stereoscopic Wavelength Multiplexing Displays Stereoscopic Wavelength Multiplexing Displays Stereoscopic Wavelength Multiplexing Displays
Specific wavelengths of red, green, and blue are used for the right eye, and different wavelengths of red, green, and blue for the left eye. Eyeglasses which filter out the very specific wavelengths allow the wearer to see a 3D image (problems with color representation). This technology eliminates the expensive silver screens required for polarized systems, which is the most common 3D display system in
glasses than the polarized systems.
Left eye: Red 629nm, Green 532nm, Blue 446nm Right eye: Red 615nm, Green 518nm, Blue 432nm
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Stereoscopic Polarized Displays Stereoscopic Polarized Displays Stereoscopic Polarized Displays Stereoscopic Polarized Displays
reaches each eye, exploiting the polarization of light; referred as a passive system.
polarizing filters; the viewer wears low-cost eyeglasses which also contain a pair of different polarizing filters.
retarder.
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Polarized Stereo Display: Patterned Retarder Polarized Stereo Display: Patterned Retarder Polarized Stereo Display: Patterned Retarder Polarized Stereo Display: Patterned Retarder
Patterned Retarder (Micro-polarizer) Half vertical resolution to each eye Polarized 3D Glasses
(courtesy of Brad Hunt)
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Polarized Stereo Display: Active Retarder Polarized Stereo Display: Active Retarder Polarized Stereo Display: Active Retarder Polarized Stereo Display: Active Retarder
Active Retarder Full-resolution to each eye Polarized 3D Glasses
(courtesy of Brad Hunt)
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Stereoscopic Alternate Stereoscopic Alternate-Frame Sequencing Frame Sequencing Displays Displays Stereoscopic Alternate Stereoscopic Alternate-Frame Sequencing Frame Sequencing Displays Displays
Temporal 3D display technique creating a stereoscopic 3D effect by alternately displaying two different perspectives one for each eye. When viewed using active shutter glasses that alternate each eye between transparent and opaque in sync with the display at very high frame rate (e.g., 120 Hz), the brain merges the images into an integrated stereoscopic view; very used on PC systems to render 3D games. Frame rate has to be doubled to get an equivalent result or spatial resolution has to be halved if alternate field sequencing is used. Glass containing a liquid crystal and a polarizing filter becomes dark when a voltage is synchronously applied, but
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Displays: a Summary 3D Displays: a Summary 3D Displays: a Summary 3D Displays: a Summary
Many types of 3D displays are available
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
The Standardization Path … The Standardization Path … The Standardization Path … The Standardization Path …
JPEG JPEG-LS JPEG 2000 MJPEG 2000 JPEG XR AIC ? H.261 H.263 H.264/AVC/SVC/MVC MPEG-1 Video H.262/MPEG-2 Video MPEG-4 Visual HEVC RVC 3DV
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
3D Related Standards Landscape 3D Related Standards Landscape 3D Related Standards Landscape 3D Related Standards Landscape
Capture/Production Distribution Consumption Video Coding Formats
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Another Big Step Forward … Another Big Step Forward … Another Big Step Forward … Another Big Step Forward …
Blu-ray has decided on full ray has decided on full-resolution format resolution format
Broadcasters have embraced frame-compatible formats compatible formats
MVC/SVC or some hybrid)
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Video Coding Standards: a Summary Video Coding Standards: a Summary Video Coding Standards: a Summary Video Coding Standards: a Summary
Standard Year Main Applications Profiles Main Bitrates Frame Types Ref. Frames Transf
Number Motion Vectors (if any) Motion Vectors Precision Entropy Coding Deblocking Filter
H.261 1988 Videotelephony and videoconference No p× 64 kbit/s
DCT 1 per MB Integer pel Huffman based In loop MPEG
Video 1991 Digital storage in CD- ROM No Around 1- 1.2 Mbit/s I, P, B , and D 0-2 DCT 1 or 2 per MB (P and B) Half pel Huffman based Out of the loop H.262/MPEG- 2 Video 1994 Digital TV and DVD Yes, most used is Main Profile From 2 to 10 Mbit/s I, P and B 0-2 DCT 1 or 2 per MB (2 to 4 for interlaced video ) Half pel Huffman based Out of the loop H.263 1995 Videotelephony and videoconference and more Only in extensions From very low rates to around 1 Mbit/s I, P and B 0-2 DCT 1 or 2 per MB (4 in the optional modes) Half pel Huffman based Out of the loop MPEG
Visual 1998 Large range with
Yes, most used are Simple and Advanced Simple Very large range using levels I, P and B 0-2 DCT 1 or 2 per MB (4 in the optional modes); also global motion vectors 1/4 pel Huffman based; arithmetic coding for the shape Out of the loop H.264/AVC 2004 Large range, from mobile to Blu-ray Yes, most used are Baseline, Main and High Very large range using levels I, P, generalize d B, SP and SI Up to 16 Integer DCT 1 to 16 per MB (P slices) and 1to 32 (B slices) 1/4 pel CAVLC and CABAC In loop SVC 2007 Robust delivery, graceful deletion, broadcasting, Yes Very large range using layers I, P and generalize d B, Up to 16 Integer DCT 1 to 16 per MB (?) 1/4 pel CAVLC and CABAC In loop MVC 2009 Stereo TV, Free viewpoint TV Yes Very large range using levels I, P, B, Up to 16 Integer DCT 1 to 16 per MB (?) 1/4 pel CAVLC and CABAC In loop
Comunicação de Áudio e Vídeo, Fernando Pereira, 2012
Bibliography Bibliography Bibliography Bibliography
H.264/AVC Standard, A. Vetro, T. Wiegand, G. Sullivan, Proceedings of the IEEE, April 2011
Muller, T. Chen, IEEE Transactions on Broadcasting, June 2011
Transactions on Broadcasting, June 2011
Many of these slides have been inspired (and even more ;-) by materials provided by several friends with especial emphasis for Anthony Vetro, MERL,