Keeping Video Quality Pristine throughout the Production Process: - - PowerPoint PPT Presentation

keeping video quality pristine throughout the production
SMART_READER_LITE
LIVE PREVIEW

Keeping Video Quality Pristine throughout the Production Process: - - PowerPoint PPT Presentation

Keeping Video Quality Pristine throughout the Production Process: 4:2:2 10-bit AVC/H.264 encoding Pierre Larbier ATEME (Burbank, USA & Bievres, France) Agenda Current States Talk about the AVC/H.264 4:2:2 10-bit profile


slide-1
SLIDE 1

Keeping Video Quality Pristine throughout the Production Process: 4:2:2 10-bit AVC/H.264 encoding

Pierre Larbier ATEME (Burbank, USA & Bievres, France)

slide-2
SLIDE 2

Agenda

  • Current States
  • Talk about the AVC/H.264 4:2:2 10-bit profile
  • Empirical comparisons against 4:2:0 8-bit &

4:2:2 8-bit

  • Multi-Generation Encoding
  • Production Cases

2

slide-3
SLIDE 3

Production & Transmission Processes

  • Production uses 4:2:2 10-bit over SDI

– Used in video editing, tape decks, file storage, etc – Video needs to be transmitted by internal system or across networks

  • Today’s transmission mostly uses

– 4:2:0 8-bit (MPEG-2 or AVC/H.264) for consumer applications – 4:2:2 8-bit(MPEG-2) for contribution applications

  • 20-60 Mbps with latency from 1 sec-250ms

3

slide-4
SLIDE 4

AVC/H.264 Studio Profile VS. Current Implementations

  • Most AVC/H.264 products are currently geared

towards consumer video market

– 4:2:0 8-bit and <30Mbps encode rates @ High Profile – Using this equipment will necessitate resampling & incur mismatching if used for production transmission applications

  • High 4:2:2 Profile

– Can handle 4:2:2 processing @ 10-bit and is ideal for production transmission applications – Can handle upto 200 Mbps(HD) and 40 Mbps (SD)

4

slide-5
SLIDE 5

About 4:2:0 Compression

  • Professional video sources are 4:2:2
  • 4:2:0 compression requires resampling filters

5

  • Progressive and Interlace chroma location schemes are different
  • Challenges for existing chroma resampling architecture:
  • Possible mismatch between the 2 resampling filters
  • Resampling filters quality not guaranteed
  • Choosing between Progressive and Interlace filters is not obvious

4:2:2 4:2:2 → 4:2:0 Downscaling 4:2:0 Encoding 4:2:2 4:2:0 → 4:2:2 Upscaling 4:2:0 Decoding SDI SDI Encoder Decoder

slide-6
SLIDE 6

4:2:0 Chroma Artifacts

  • Mainly caused by a mismatch between downscale/upscale filters
  • Color bleeding
  • Chroma detail loss
  • Interlace/Progressive artifacts
  • Worsen with each generation
  • Difficult to control in a Production chain

6

After 5 4:2:2 ↔ 4:2:0 conversions Source picture

Detail Loss Color Smearing Interlace artifacts

slide-7
SLIDE 7

About 4:2:2 Compression

  • At the 1st generation, with perfect resampling filters, 4:2:2 and 4:2:0
  • verall quality is roughly the same:

7

  • But in multi-generation and/or with mismatched sampling filters,

4:2:2 quality is much better than 4:2:0

CrowdRun, 1080i25 33 35 37 39 41 43 45 47 49 20 40 60 80 100 120 140 160 180 200 220 Bitrate (Mbps) Combined PSNR (dB) 4:2:0 4:2:2

4:2:2 vs 4:2:0 with perfect resampling filters

slide-8
SLIDE 8

Current 8-bit Compression

  • Professional video sources are 10-bit
  • 8-bit compression requires rounding before compression

8

  • Motion compensation and loop-filtering performed using 8-bit samples
  • Challenges for 8-bit architecture:
  • Scalers match and quality not guaranteed
  • Reduced accuracy in the encoding process reduces coding efficiency
  • Luma shift caused by lack of rounding control

10-bit 10-bit → 8-bit Downscaling 8-bit Encoding 10-bit 8-bit → 10-bit Upscaling 8-bit Decoding SDI SDI Encoder Decoder

slide-9
SLIDE 9

10-bit compression: banding removal

  • Processing video with 8-bit sample depth may create banding artifacts in

shallow changing light scenes:

  • Blue skies
  • Underwater scenes
  • Sunsets
  • These defects are not visible at 10-bit or more

9

10-bit compression 8-bit compression

Banding artifacts

slide-10
SLIDE 10

10-bit compression: increased coding efficiency

  • Coding efficiency is increased: less bit-rate for the same quality
  • Rate saving between 5% and 20% on most sources
  • Most of the gain is provided with 10-bit coding. With 12-bit and 14-bit,

coding efficiency gain is there but much smaller

10

Woman with a Bird Cage, 1080i30 28 30 32 34 36 38 40 42 44 46 48 50 15 30 45 60 75 90 105 120 135 150 Bitrate (Mbps) PSNR Y (dB) 8 bit 10 bit 12 bit Woman with a Bird Cage, 1080i30 0.00 0.10 0.20 0.30 0.40 0.50 0.60 20 40 60 80 100 120 140 160 180 200 220 Bitrate (Mbps) PSNR Y gain relative to 8-bit (dB) 10 bit 12 bit

RD curve at 8,10 and 12-bit Distortion gain vs 8-bit compression

slide-11
SLIDE 11

10-bit compression: artifacts reduction

  • 10-bit compression reduces 3 kind of artifacts:
  • Contouring
  • Smearing
  • Mosquito noise
  • As a side-effect, 10-bit compression also reduces random distortion of

shallow textured objects (clouds for instance)

11

Contouring Mosquito Noise

8-bit compression 10-bit compression

slide-12
SLIDE 12

AVC H422P vs MPEG-2 422P

  • H.264 H422P outperforms MPEG-2 422P in Contribution applications:
  • 10-bit support
  • More than 20Mbps saved at Contribution rates!

12

CrowdRun, 1080i25 28 30 32 34 36 38 40 42 44 46 20 40 60 80 100 120 140 160 Bitrate (Mbps)

Combined PSNR (dB)

MPEG-2 AVC/H.264

Woman with a Bird Cage, 1080i30 38 39 40 41 42 43 44 45 46 47 48 49 50 51 20 40 60 80 100 120 140 160 Bitrate (Mbps)

Combined PSNR (dB)

MPEG-2 AVC/H.264

Example 2 Example 1

slide-13
SLIDE 13

GOP Structure Types for Contribution

  • GOP structure needs to be carefully considered
  • Possible Contribution GOP Structures

– Intra-only, AVC-I, Short GOP, Adaptive Long GOP

13

32 34 36 38 40 42 44 46 48 50 20 40 60 80 100 120 140 160 180 200 Average PSNR Y Bitrate (Mbps) Into Tree, 1080i25 AVC-I 100 Intra Only IbBbI Long GOP

Importance of GOP structure on coding efficiency

  • Adaptive Long GOP and Short GOP give

better performance at same bit rate

  • Guide: let encoder make picture type

decision wherever possible

slide-14
SLIDE 14

Role of GOP in Multi-Generation Encoding

  • 3-4 Re-encodes can happen in a production transmission workflow path
  • Aligning Encodes at GOP Boundaries can retain quality

– PSNR Drop 3dB (Aligned) Vs 5dB (non-aligned) @ 7th GEN

14

27 28 29 30 31 32 33 34 35 36 1 2 3 4 5 6 7 Average PSNR Y (dB) Generation ParkJoy 1080i25 @ 60Mbps - Aligned GOP I IbBbI Long GOP 27 28 29 30 31 32 33 34 35 36 1 2 3 4 5 6 7 Average PSNR Y (dB) Generation ParkJoy 1080i25 @ 60Mbps - Unaligned GOP I IbBbI Long GOP

Multiple generations with aligned GOPS Multiple generations without aligned GOPS

slide-15
SLIDE 15

Production Uses

  • Remote Truck Uplinks

– Transmit in AVC 422 10-bit and at lower bit rates than MPEG-2 422P

  • Playout to Tape Infrastructures

– Easily integrate transmission back into an SDI infrastructure (international applications)

  • Remote Collaboration

– Realtime 422 10-bit at lower bit rates

  • Multiple Generations Re-encodes

– Typical Streams can re-encode 3-4 times before it gets to consumer, so transmission in 422 10-bit aligned GOP will preserve quality

15

slide-16
SLIDE 16

Advantages

  • Significantly lower transmission costs
  • Improve video quality on existing transmission links

16

slide-17
SLIDE 17

Summary

Using 4:2:2 10-bit allows:

  • Process original source video in as close to an original form
  • Enables most demanding application in terms of quality & rate
  • Offers significant gain in quality and/or rate over existing solutions
  • Reduces banding, smearing, and mosquito noise
  • Retains quality in production processes
  • Integrates into an SDI infrastructure at highest allowable quality

17

slide-18
SLIDE 18

Thank-You

p.larbier@ateme.com