PHOTOGRAPHIC IMAGING Fernando Pereira Instituto Superior Tcnico - PowerPoint PPT Presentation

PHOTOGRAPHIC IMAGING Fernando Pereira Instituto Superior Técnico Multimedia Communication, Fernando Pereira, 2016/2017

Many, Many Pictures ... Source: KPCB 2014 Internet Trends, estimates based on publicly disclosed company data Multimedia Communication, Fernando Pereira, 2016/2017

Multilevel Photographic Image Coding (gray and colour) OBJECTIVE Efficient representation of multilevel photographic images (still pictures) for storage and transmission. Multimedia Communication, Fernando Pereira, 2016/2017

Applications  Digital cameras  Image databases, e.g. museums, maps  Desktop publishing  Colour fax  Medical images  ...  and Digital cinema (!) Multimedia Communication, Fernando Pereira, 2016/2017

Typical Digital Transmission Chain ... Source Channel Digitalization Source Channel (sampling + Modulation Coding Coding quantization + PCM) Analog Compressed ‘Channel Modulated PCM bits signal bits Protected’ symbols bits Multimedia Communication, Fernando Pereira, 2016/2017

The Image Representation Problem ... A image is represented as a set of M  N luminance and chrominance samples ( spatial sampling and quantization ) with a certain number of bits per sample, P ( PCM coding ). Thus, the total number of bits (M  N  P) - and so the memory and bandwidth – necessary to PCM digitally represent an image is HUGE !!! This is the so-called RAW image ! Multimedia Communication, Fernando Pereira, 2016/2017

Image (Source) Coding Objective Image coding/compression deals with the efficient representation of images, satisfying the relevant requirements. And these requirements keep changing, e.g., coding efficiency, error resilience, random access, interaction, editing, to address new applications and functionalities ... Multimedia Communication, Fernando Pereira, 2016/2017

Where does Compression come from ?  REDUNDANCY – Regards the similarities, correlation and predictability of samples and symbols corresponding to the image/audio/video data. -> redundancy reduction does not involve any information loss, implying it is a reversible process – > lossless coding  IRRELEVANCY – Regards the part of the information which is imperceptible for the visual or auditory human systems. -> irrelevancy reduction involves removing non-redundant information, implying it is an irreversible process -> lossy coding Source coding exploits these two concepts: for this, it is necessary to know the source statistics and the human visual/auditory systems characteristics. Multimedia Communication, Fernando Pereira, 2016/2017

Source Coding: Original Data, Symbols and Bits Encoder Compressed Original data, Symbols bits e.g. PCM bits Data Model Entropy Coder Source Coding implies two main steps:  Data modeling – By adopting a more powerful data representation model the raw PCM symbols are converted into more efficient and ‘sophisticated’ symbols, notably exploiting spatial and temporal redundancies as well as irrelevancy, targeting the relevant representation requirements  Entropy coding – By exploiting the statistical characteristics of the symbols produced by the data modeling process, a set of bits is produced Multimedia Communication, Fernando Pereira, 2016/2017

Image Coding: Multiple Solutions  DCT-based transform coding, e.g. JPEG standard  Fractal-based coding  Vector quantization coding  Wavelet-based coding, e.g. JPEG 2000 standard  Lapped biorthogonal-based transform coding, e.g. JPEG XR standard  … Multimedia Communication, Fernando Pereira, 2016/2017

~1990 The JPEG Standard (Joint Photographic Experts Group, joint ISO & ITU-T) Multimedia Communication, Fernando Pereira, 2016/2017

Objective Definition of a generic compression standard for multilevel photographic images considering the requirements of most applications. Multimedia Communication, Fernando Pereira, 2016/2017

Interoperability, thus Standards !  Image coding is used in the context of many applications where interoperability is an essential requirement.  The interoperability requirement is satisfied through the specification of a coding standard which represents a voluntary agreement between multiple parties.  To foster evolution and competition, standards must offer interoperability through the specification of the minimum essential number of tools. Multimedia Communication, Fernando Pereira, 2016/2017

JPEG Standard Major Requirements ≈1985  Efficiency - The standard must be based on the most efficient compression techniques, notably for high quality.  Compression/Quality Tunable - The standard shall allow tuning the quality versus compression efficiency.  Generic - The standard must be applicable to any type of multilevel photographic images without restrictions in resolution, aspect ratio, color space, content, etc.  Low Complexity - The standard must be implementable with a reasonable complexity; notably, its software implementation on a large range of CPUs must be possible.  Functional Flexibility - The standard must provide various relevant operation modes, notably sequential, progressive, lossless and hierarchical. Multimedia Communication, Fernando Pereira, 2016/2017

JPEG Elements Encoder Coded bitstream v Tables Original image Decoder Coded bitstream v Tables Decoded image Multimedia Communication, Fernando Pereira, 2016/2017

What Images can JPEG Encode ?  Size between 1 × 1 and 65535 × 65535  1 to 255 colour components or spectral bands (typically YC R C B or RGB)  Each component, C i , consists of a matrix with x i columns and y i lines  8 or 12 bits per sample for (lossy) DCT based compression  2 to 16 bits per sample for lossless compression Multimedia Communication, Fernando Pereira, 2016/2017

Types of JPEG Compression  LOSSLESS - The image is reconstructed with no losses, this means it is mathematically equal to the original; compression factors of about 2-3 may be achieved, depending on the image content.  LOSSY - The image is reconstructed with losses but, if desired, with a very high fidelity to the original (transparent coding); this type of coding allows achieving higher compression factors, e.g. 10, 20 or more; in the JPEG standard, this type of coding is based on the Discrete Cosine Transform (DCT). Multimedia Communication, Fernando Pereira, 2016/2017

JPEG Baseline Process The most used JPEG coding solution is DCT based (lossy), called BASELINE SEQUENTIAL PROCESS and it is appropriate to inumerous applications. This process is mandatory for all systems claiming JPEG compliance. Multimedia Communication, Fernando Pereira, 2016/2017

DCT Based Coding Data Entropy Model Coder The joint action of the various JPEG Baseline encoder modules targets the reduction of the redundancy and irrelevancy contained in the images. The first encoder part (data modeling) targets the generation of a signal without memory (elimination of spatial redundancy) and without irrelevancy. The final entropy coding module targets the generation of equiprobable symbols in order to minimize the data to transmit (elimination of statistical redundancy). Multimedia Communication, Fernando Pereira, 2016/2017

DCT Based Image Coding Statistical Redundancy Spatial Quantization Redundancy Coding tables tables Block Entropy DCT Quantization splitting coder ≠ Transmission or storage Irrelevancy Quantization Coding tables tables Inverse Entropy Block IDCT quantization decoder assembling Multimedia Communication, Fernando Pereira, 2016/2017

What is Really a 8×8 Block ... Imagine a block where all the samples are similar, this means have the same value ... Multimedia Communication, Fernando Pereira, 2016/2017

Why do we Transform Blocks ? Basically, the transform represents the original signal in another domain where there is less spatial redundancy.  The full exploitation of the spatial redundancy in the image would require applying the transform to blocks as big as possible, ideally to the full image; however, the redundancy is rather ‘regional’ ...  The computational effort associated to the transform grows quickly with the size of the block used … and the added spatial redundancy decreases … So some trade -off is needed ... Applying the transform to blocks, typically of 8 × 8 samples, was a good trade-off between the exploitation of the spatial redundancy and the associated computational effort. Multimedia Communication, Fernando Pereira, 2016/2017

What is Transformed ? Transform is applied block after block in the image ...   87 89 101 106 118 130 142 155   85 91 101 105 116 129 135 149     86 92 96 105 112 128 131 144   92 88 102 101 116 129 135 147   Same process (in Y =   88 94 94 98 113 122 130 139   parallel) for luminance   88 95 98 97 113 119 133 141 and the chrominances !   92 99 98 106 107 118 135 145       89 95 98 107 104 112 130 144 Multimedia Communication, Fernando Pereira, 2016/2017

JPEG Block Coding Sequence Multimedia Communication, Fernando Pereira, 2016/2017

The Block Effect … Multimedia Communication, Fernando Pereira, 2016/2017

PHOTOGRAPHIC IMAGING Fernando Pereira Instituto Superior Tcnico - PowerPoint PPT Presentation

PHOTOGRAPHIC IMAGING Fernando Pereira Instituto Superior Tcnico Multimedia Communication, Fernando Pereira, 2016/2017 Many, Many Pictures ... Source: KPCB 2014 Internet Trends, estimates based on publicly disclosed company data Multimedia

Protect and Enjoy Your Photographic Memories by Jerry Hug, APSA and Jim Bodkin Saving the

Topic 7: Photographic Process Aim: To cover the basics of the photographic process, and the prop-

PHOTOGRAPHIC PHOTOGRAPHIC IMAGING IMAGING Fernando Pereira Fernando Pereira Instituto

PENELOPE SLINGER B ACK COVER : EAT MY WORDS F RONT COVER : I HEAR WHAT YOU SAY photographic

PRESENTATION PRICES HIGH QUALITY PRINTS CARD MOUNTED PRINTS High quality photographic prints

N3TC 2019 Photographic Competition OVERALL WINNER 1 1 Overall Winner: Leon Heyes Arum Mono

PENELOPE SLINGER B ACK COVER : EAT MY WORDS F RONT COVER : I HEAR WHAT YOU SAY photographic

Photographic Image Synthesis with Cascaded Refinement Networks Qifeng Chen and Vladlen Koltun

ARIADNE: A Photographic 1-ton Two-Phase Liquid Argon Detector Jared Vann (on behalf of the

El Limon-Guajes Mine Photographic Report First Ounces December 2015 2 First Shipment January

Yvette Harvey Keeper of the RHS Herbarium Welcome to WSY specimens: 83,000 plant portraits:

pler Sulfide Expansion Project Photographic Update February 2017 TSX: ASR / ASX: AQG / 1

- when and how the Royalty-Free JPEG patent policy got lost JPEG = JOINT PHOTOGRAPHIC EXPERTS

A photographic essay exploring human suffering and persecution through the journeys and lives of

Creating Digital Photo Books Step-By-Step by Jim and Diane Bodkin Previously We Showed: Saving

Physics-Motivated Features for Distinguishing Photographic I mages and Computer Graphics

Quantum decoupling via efficient classical operations and the entanglement cost of one-shot

Integer-Forcing for Channels, Sources and ADCs Or Ordentlich Tel Aviv University Or Ordentlich

Katalin Marton Abbas El Gamal Stanford University Withits 2010 A. El Gamal (Stanford

PriSec Research Group Datavetenskap, Karlstads universitet Christer Andersson , Reine Lundin On

From Analogue to Digital: From Analogue to Digital: From Analogue to Digital: Concepts and

Natural Language Processing (CSE 490U): Generation: Translation & Summarization Noah Smith c

Machine Translation 5LN426 and 5LN711 Sara Stymne Uppsala University Slides mainly from Jrg

VERY LARGE DEVELOPMENT HOW TO RUN CODE REVIEW FOR 1000+ OPEN SOURCE DEVELOPERS Joe Gordon

Sambuz

Useful Links

Newsletter

Mail Us

PHOTOGRAPHIC IMAGING Fernando Pereira Instituto Superior Tcnico - PowerPoint PPT Presentation

PHOTOGRAPHIC IMAGING Fernando Pereira Instituto Superior Tcnico Multimedia Communication, Fernando Pereira, 2016/2017 Many, Many Pictures ... Source: KPCB 2014 Internet Trends, estimates based on publicly disclosed company data Multimedia

Protect and Enjoy Your Photographic Memories by Jerry Hug, APSA and Jim Bodkin Saving the

Topic 7: Photographic Process Aim: To cover the basics of the photographic process, and the prop-

PHOTOGRAPHIC PHOTOGRAPHIC IMAGING IMAGING Fernando Pereira Fernando Pereira Instituto

PENELOPE SLINGER B ACK COVER : EAT MY WORDS F RONT COVER : I HEAR WHAT YOU SAY photographic

PRESENTATION PRICES HIGH QUALITY PRINTS CARD MOUNTED PRINTS High quality photographic prints

N3TC 2019 Photographic Competition OVERALL WINNER 1 1 Overall Winner: Leon Heyes Arum Mono

PENELOPE SLINGER B ACK COVER : EAT MY WORDS F RONT COVER : I HEAR WHAT YOU SAY photographic

Photographic Image Synthesis with Cascaded Refinement Networks Qifeng Chen and Vladlen Koltun

ARIADNE: A Photographic 1-ton Two-Phase Liquid Argon Detector Jared Vann (on behalf of the

El Limon-Guajes Mine Photographic Report First Ounces December 2015 2 First Shipment January

Yvette Harvey Keeper of the RHS Herbarium Welcome to WSY specimens: 83,000 plant portraits:

pler Sulfide Expansion Project Photographic Update February 2017 TSX: ASR / ASX: AQG / 1

- when and how the Royalty-Free JPEG patent policy got lost JPEG = JOINT PHOTOGRAPHIC EXPERTS

A photographic essay exploring human suffering and persecution through the journeys and lives of

Creating Digital Photo Books Step-By-Step by Jim and Diane Bodkin Previously We Showed: Saving

Physics-Motivated Features for Distinguishing Photographic I mages and Computer Graphics

Quantum decoupling via efficient classical operations and the entanglement cost of one-shot

Integer-Forcing for Channels, Sources and ADCs Or Ordentlich Tel Aviv University Or Ordentlich

Katalin Marton Abbas El Gamal Stanford University Withits 2010 A. El Gamal (Stanford

PriSec Research Group Datavetenskap, Karlstads universitet Christer Andersson , Reine Lundin On

From Analogue to Digital: From Analogue to Digital: From Analogue to Digital: Concepts and

Natural Language Processing (CSE 490U): Generation: Translation &amp; Summarization Noah Smith c

Machine Translation 5LN426 and 5LN711 Sara Stymne Uppsala University Slides mainly from Jrg

VERY LARGE DEVELOPMENT HOW TO RUN CODE REVIEW FOR 1000+ OPEN SOURCE DEVELOPERS Joe Gordon

Sambuz

Useful Links

Newsletter

Mail Us

Natural Language Processing (CSE 490U): Generation: Translation & Summarization Noah Smith c