Measuring Video Quality with VMAF: Why You Should Care Christos - - PowerPoint PPT Presentation

measuring video quality with vmaf why you should care
SMART_READER_LITE
LIVE PREVIEW

Measuring Video Quality with VMAF: Why You Should Care Christos - - PowerPoint PPT Presentation

Measuring Video Quality with VMAF: Why You Should Care Christos Bampis Encoding Technologies, Netflix AOMedia Research Symposium San Francisco, October 15, 2019 Overview history and introduction to VMAF adoption challenges


slide-1
SLIDE 1

Measuring Video Quality with VMAF: Why You Should Care

AOMedia Research Symposium San Francisco, October 15, 2019 Christos Bampis Encoding Technologies, Netflix

slide-2
SLIDE 2
  • history and introduction to VMAF
  • adoption
  • challenges
  • why is VMAF becoming more useful?

Overview

slide-3
SLIDE 3

Need a better perceptual metric

PSNR 29.1 dB PSNR 29.3 dB Humans 19 69

slide-4
SLIDE 4
  • accurately measures human

perception of video quality

  • consistent across content
  • works well for picture artifacts relevant

to adaptive streaming

○ compression artifacts ○ scaling artifacts

  • pen-source!

VMAF: Video Multimethod Assessment Fusion

VMAF

slide-5
SLIDE 5

The VMAF chronicle

2014 2015 2016 2017 2018

Started collaboration with USC Started collaboration with U. Nantes First VMAF running in prod @ Netflix Started collaboration with UT Austin VMAF went live on Github; first VMAF techblog published VMAF 0.6.1 published; added a phone model libvmaf published; VMAF supported by FFmpeg Speed optimization; added a 4K model; added confidence interval First public showing at ICIP VMAF-enabled video optimization in prod @ Netflix

2019

Speed

  • ptimization
slide-6
SLIDE 6

human visual system (HVS) modeling: simulate low-level neuro-circuits

VMAF framework

spatial feature extraction (VIF, DLM) Pixel Neighborhood within-frame spatial pooling Frame Level SVM prediction training with subjective data per-frame score trained model temporal pooling temporal feature extraction (TI) “Fusion”

slide-7
SLIDE 7

HVS modeling: contrast masking

  • One signal (e.g. compression artifacts) becomes more difficult to be detected

by human eye when it is superimposed on another masker signal (e.g. the pristine source) of similar spatial frequency and orientation

masking

[Source: HDR-VDP2, Mantiuk et al. 2011]

slide-8
SLIDE 8

VMAF framework

spatial feature extraction (VIF, DLM) Pixel Neighborhood within-frame spatial pooling Frame Level SVM prediction training with subjective data per-frame score trained model temporal pooling temporal feature extraction (TI) “Fusion” Machine learning: align features with subjective scores

slide-9
SLIDE 9

Lab test: collect subjective scores

Bad Poor Fair Good Excellent

Absolute Category Rating (ACR) Scale

slide-10
SLIDE 10

Map ACR scale to VMAF scale

VMAF Scale

100 20 40 60 80 Bad Poor Fair Good Excellent

Absolute Category Rating (ACR) Scale

True Score (ACR Scale) VMAF Score

slide-11
SLIDE 11

Demo time!

slide-12
SLIDE 12
  • industry
  • research community

VMAF adoption examples

slide-13
SLIDE 13

Integration in 3rd-party tools

slide-14
SLIDE 14

VMAF in codec comparisons

[Source: JVET-O0451 Subjective Comparison of VVC and HEVC, JVET 15th meeting: Gothenburg, SE, 3–12 July 2019]

Resolution BD-rate (PSNR) BD-rate (VMAF) BD-rate (MOS) HD

  • 31.24%
  • 35.18%
  • 36%

UHD

  • 34.42%
  • 40.44%
  • 40%
slide-15
SLIDE 15

VMAF in research papers

slide-16
SLIDE 16
  • design dimensionality
  • dealing with noise

What are the challenges?

SDR/HDR codecs resolution

slide-17
SLIDE 17

increased number of dimensions:

  • different encoders: H.264/AVC, HEVC, VP9, AV1
  • SDR vs. HDR, dark vs. bright scenes
  • different viewing conditions (phone vs. TV, 1080 vs. 4K)
  • key question: how to design a model that is extensible

and consistent?

Design dimensionality

slide-18
SLIDE 18

Dealing with noise

  • VMAF underpredicts under noisy source
  • assess film-grain synthesis tools (e.g. AV1)

source noise model denoise encode decode add noise

slide-19
SLIDE 19

Why is VMAF becoming more useful?

  • newer codecs (e.g. AV1) add more perceptual tools to their

arsenal and PSNR is not enough to evaluate them

  • pen-source and well-adopted: problems are easier to find
  • we are committed to further improving VMAF’s accuracy and

speed

slide-20
SLIDE 20

Summary

  • VMAF aims to fill the gap in perceptual video quality metrics
  • adopted by industry and academia, but there is room for

improvement

  • becomes more relevant for new and future codecs (AV1, AV2),

e.g., for codec comparison, encoding optimization

slide-21
SLIDE 21

Questions?