Fraunhofer FOKUS Institut für Offene Kommunikationssysteme
Deep Encode: Machine Learning for Per-Title Encoding
Daniel Silhavy| IBC20|
Deep Encode: Machine Learning for Per-Title Encoding Daniel - - PowerPoint PPT Presentation
Fraunhofer FOKUS Institut fr Offene Kommunikationssysteme Deep Encode: Machine Learning for Per-Title Encoding Daniel Silhavy| IBC20| Per-Title Encoding What & Why & 1 How? Benefits of Per-Title Encoding Storage and bitrate
Fraunhofer FOKUS Institut für Offene Kommunikationssysteme
Daniel Silhavy| IBC20|
Per-Title Encoding – What & Why & How?
Benefits of Per-Title Encoding
3 Deep Encode –Machine Learning for Per-Title Encoding
Conventional/Static Encoding Ladder Per-Title Encoding Ladder
ü Quality increase at identical bitrates ü Storage and bitrate savings while preserving
Benefits of Per-Title Encoding: the Numbers
5 Deep Encode –Machine Learning for Per-Title Encoding
Average values
Bitrate (kbit/s) VMAF PSNR (dB) Storage (MB)
Conventional
7648.18 94.92 44.37 1397.7
Per-Title
4941.75 93.06 42.41 675.2
Difference Abs
+2706.43
+722.5 MB
Difference (%)
+36%
+52% * Based on a streaming session with a 100Mbit/s connection in dash.js 3.1.1
How does Per-Title Encoding work?
Test Encodes
different settings and calculate corresponding VMAF values.
Convex Hull Estimation
that are close to the convex hull.
Production encoding
encoding using the optimal encoding ladder.
6 Deep Encode –Machine Learning for Per-Title Encoding
A large amount of test encodes is required to derive a sufficient amount of data points.
C
v e x H u l l
Machine Learning for Per-Title Encoding
How to avoid the computationally heavy test encodes
8
ML-based predictions
/ VMAF pairs using machine learning.
Convex Hull
pairs that are close to the convex hull
Production encoding
Two-pass constrained VBR encoding using the
Deep Encode –Machine Learning for Per-Title Encoding
Use ML-based predictions to avoid test encodes and still derive a sufficient amount data points.
Deep Encode – ML for Video Metric Prediction
Content Type 1 Content Type 2 Content Type 3
9
Training Data
Regression
Predicted [Bitrate, Resolution, Quality]
Metadata extraction
spatial complexities
Classification
Categories & Labels Deep Encode –Machine Learning for Per-Title Encoding
Convex Hull Estimation Encoding Ladder
[Bitrate,Resolution,Quality]
Test Encodes Database
Deep Encode: Hands-on UI
| Deep Encode –Machine Learning for Per-Title Encoding 10
Summary, Outlook & Next Steps
Deep Encode ü No computationally heavy test encodes ü Metadata extraction and AI-based image processing for content analysis
ü Deep Learning for
Bitrate] pairs
encoding ladder ü Enhancements
Encoding
Deep Encode: Towards Context-Aware Encoding
Per-Scene Encoding Conventional & Per-Title Encoding
Conventional Static Encoding
content
costs
content
| Deep Encode –Machine Learning for Per-Title Encoding 12
Conventional Per-Title Encoding
test encodes
complex scenes within a movie
Daniel Silhavy
Project Manager Future Applications and Media (FAME) daniel.silhavy@fokus.fraunhofer.de Fraunhofer FOKUS Berlin, Germany FAME Video Development Blog: https://websites.fraunhofer.de/video-dev/