muscle wp5 showcase
play

MUSCLE WP5 Showcase: M. Perakakis E. Sanchez-Soto Real-Time - PowerPoint PPT Presentation

Groups and Researchers Involved TSI-TUC A. Potamianos (showcase leader) MUSCLE WP5 Showcase: M. Perakakis E. Sanchez-Soto Real-Time Audio-Visual ICCS-NTUA Automatic Speech Recognition P. Maragos (group leader)


  1. Groups and Researchers Involved � TSI-TUC � A. Potamianos (showcase leader) MUSCLE WP5 Showcase: � M. Perakakis � E. Sanchez-Soto Real-Time Audio-Visual � ICCS-NTUA Automatic Speech Recognition � P. Maragos (group leader) Demonstrator � G. Papandreou (visual/fusion) � A. Katsamanis (audio/fusion) � V. Pitsikalis (audio/fusion) TSI-TUC (leader) � INRIA-TEXMEX: ICCS-NTUA � P. Gros (group leader) INRIA-TEXMEX � G. Gravier (fusion) MUSCLE MUSCLE April 2007 ICCS-NTUA Showcase Main Points Audio-Visual Automatic Speech Recognition � Shortcomings of current AV-ASR systems � Research-level set-ups � videos shot under carefully controlled conditions � processing is performed off-line � Goal: build a proof-of-concept practically deployable laptop-based AV-ASR prototype which: � uses low-end consumer microphone and camera to Audio Recognized capture the speaker Video Speech � performs visual/audio feature extraction, as well as � Audio-only Automatic Speech Recognition (ASR) degrades speech recognition on the laptop in real-time under noise � is robust to failures of a single modality, such as visual � Use video for lip-reading to boost ASR performance occlusion of the speaker's face MUSCLE MUSCLE MUSCLE MUSCLE April 2007 April 2007 ICCS-NTUA ICCS-NTUA

  2. Tasks Visual Front-End � T1: Visual Front-end � Analyze face expression and appearance � Face detector (DONE) � Face tracking and feature extraction (DONE) � Real-time feature extraction algorithms � Optimization for real-time performance (IN PROGRESS) � Excellent performance in AV-ASR experiments � T2: Audio-Visual Recognition Model and Fusion � Advanced baseline audio front-end (DONE) � HMM-based recognition back-end (DONE) � Model training on audio-visual corpora (DONE) + p 1 + p 2 = � Adaptive audio-visual fusion (IN PROGRESS) � T3: System Integration � Laptop-based system (IN PROGRESS) + λ + λ � Usable for live AV-ASR demonstrations (IN PROGRESS) = 1 2 � Project duration: December 2006 – June 2007 MUSCLE MUSCLE MUSCLE MUSCLE April 2007 April 2007 ICCS-NTUA ICCS-NTUA Feature Fusion � Goal: � Adaptive fusion heterogeneous information streams � Stream weights improve recognition performance � Test alternative techniques for stream weight computation � Minimum classification error � Feature measurement uncertainty compensation � Previous work by all three partners � Stream weight adaptation � Depending on auditory SNR � Either static or fully dynamic MUSCLE MUSCLE MUSCLE MUSCLE April 2007 April 2007 ICCS-NTUA ICCS-NTUA

  3. Tasks Audio-Only ASR Live Demo � T1: Visual Front-end � Real-Time continuous digits ASR � Face detector (DONE) � Model Training on the WSJ database � Face tracking and feature extraction (DONE) � Optimization for real-time performance (IN PROGRESS) � T2: Audio-Visual Recognition Model and Fusion � Advanced baseline audio front-end (DONE) � HMM-based recognition back-end (DONE) � Model training on audio-visual corpora (DONE) � Adaptive audio-visual fusion (IN PROGRESS) � T3: System Integration � Laptop-based system (IN PROGRESS) � Usable for live AV-ASR demonstrations (IN PROGRESS) � Project duration: December 2006 – June 2007 MUSCLE MUSCLE MUSCLE MUSCLE April 2007 April 2007 ICCS-NTUA ICCS-NTUA Audio-Visual Speech Recognition Demo AV A MUSCLE MUSCLE April 2007 ICCS-NTUA

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend