SLIDE 1
2 22 June 2016
- VRT STON project: Subtitling of TV shows by using speech technology
- Subtitle generation is a time-consuming process which can be (partially)
automated
Speaker Diarization for Automatic Subtitle Generation
Why solve the “who-spoke-when?” problem?
- Subtitles with color codes
- Enable the use of speaker-adapted models for speech recognition (SR)
- Extra information for the SR language model through detected sentence boundaries
Speaker 1 Speaker 1 Speaker 2 Speaker 2 Speaker 2