Voice Controlled Smart Spaces Florian Gratzer Advisor: Marc-Oliver - - PowerPoint PPT Presentation

voice controlled smart spaces
SMART_READER_LITE
LIVE PREVIEW

Voice Controlled Smart Spaces Florian Gratzer Advisor: Marc-Oliver - - PowerPoint PPT Presentation

Chair of Network Architecures and Services Departments of Informatics Technical University of Munich Voice Controlled Smart Spaces Florian Gratzer Advisor: Marc-Oliver Pahl Garching, June 26 th 2015 Motivation Controlling the environment


slide-1
SLIDE 1

Chair of Network Architecures and Services Departments of Informatics Technical University of Munich

Florian Gratzer Advisor: Marc-Oliver Pahl Garching, June 26th 2015

Voice Controlled Smart Spaces

slide-2
SLIDE 2
  • Controlling the environment via voice commands
  • Using
  • DS2OS as middleware
  • Devices with low energy consumption (Raspberry Pi, Arduino, …)
  • Pocketsphinx for voice recognition
  • Smartphones for voice input
  • Mapping of voice input to actions within DS2OS

2 Florian Gratzer | Voice Controlled Smart Spaces

Motivation

slide-3
SLIDE 3
  • Human speech waveform contains a large amount of information
  • Dependent on
  • Speaker
  • Speaking rate
  • Acoustic conditions
  • Hardly possible to match samples directly
  • Multiple processing steps are required
  • Using “Features” for matching

3 Florian Gratzer | Voice Controlled Smart Spaces

Voice recognition

slide-4
SLIDE 4
  • Start and end time detection
  • Manual
  • (Semi-)automatic
  • Feature Extraction
  • Filtering
  • Windowing
  • Extracting features
  • Feature Matching
  • Change features before matching
  • Calculate distance between said utterance and known samples

Processing steps in voice recognition

Florian Gratzer | Voice Controlled Smart Spaces 4

[B. Pfister – Sprachverarbeitung ISBN: 9788578110796]

slide-5
SLIDE 5

Interface of DS2OS VSL

Florian Gratzer | Voice Controlled Smart Spaces 5

[http://pahl.de/?site=__publications#pahlDissertation_ds2os_2014]

slide-6
SLIDE 6
  • Usability
  • Offline functionality
  • Runnable on low power hardware
  • Low error rate
  • Voice feedback

Requirements

Florian Gratzer | Voice Controlled Smart Spaces 6

slide-7
SLIDE 7
  • Cloud based solutions

Related work

[https://lh3.googleusercontent.com/E4UjLg_CtHlcAHKQlR X044ZtRERwWPqgD0VLL2TLo3LAGH8hZssK84SEwyNS sLFsfdO2=w300] [http://www.windowsphonearea.com/wp- content/uploads/2014/04/cortana-face-logo.jpg] [https://govivace.com/images/logo_govivac.png]

[https://upload.wikimedia.org/wikipedia/de/archive/8/86/20120916104811!Siri-Logo.png]

[http://byticious.com/wp-content/uploads/2016/03/Google_Cloud_Speech_API.jpg] [http://img.scoop.it/4nryVafz-j2Vj_x-SlxB-Tl72eJkfbmt4t8yenImKBVvK0kTmF0xjctABnaLJIm9]

Florian Gratzer | Voice Controlled Smart Spaces 7

slide-8
SLIDE 8
  • MOVI Arduino Shield
  • 150 full-sentence voice commands
  • CMUSphinx for voice recognition
  • Debian based
  • System of c’t
  • Controlling Phillips Hue lamps
  • REST API
  • Jasper

Related work

[http://www.audeme.com/movi.html] [https://cdn-cx-images.dynamite.myharmony.com/mh/ssv2/devices/philips-hue-glamour-v2.png]

Florian Gratzer | Voice Controlled Smart Spaces 8

slide-9
SLIDE 9

Design

[http://img2.jalousiescout.de/products/20000020/20000020_i2_weiss.jpg] [http://cdn.idealo.com/folder/Product/1941/4/1941409/s1_produktbild_mid/falcon-appliances-classic-90-induktion.jpg] [http://d2rormqr1qwzpz.cloudfront.net/photos/2013/04/03/47185-arduino.jpg] [https://upload.wikimedia.org/wikipedia/commons/3/31/Raspberry_Pi_2_Model_B_v1.1_top_new_(bg_cut_out).jpg] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Microphone-icon.png] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Speaker-2-icon.png]

Florian Gratzer | Voice Controlled Smart Spaces 9

slide-10
SLIDE 10
  • 5 types of services
  • Voice recognition services

10 Florian Gratzer | Voice Controlled Smart Spaces

Implementation

[https://upload.wikimedia.org/wikipedia/commons/3/31/Raspberry_Pi_2_Model_B_v1.1_top_new_(bg_cut_out).jpg] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Microphone-icon.png]

slide-11
SLIDE 11
  • 5 types of services
  • Voice recognition services
  • Voice output services

11 Florian Gratzer | Voice Controlled Smart Spaces

Implementation

[https://upload.wikimedia.org/wikipedia/commons/3/31/Raspberry_Pi_2_Model_B_v1.1_top_new_(bg_cut_out).jpg] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Microphone-icon.png] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Speaker-2-icon.png]

slide-12
SLIDE 12
  • 5 types of services
  • Voice recognition services
  • Voice output services
  • Adaption Services

12 Florian Gratzer | Voice Controlled Smart Spaces

Implementation

[http://img2.jalousiescout.de/products/20000020/20000020_i2_weiss.jpg] [http://d2rormqr1qwzpz.cloudfront.net/photos/2013/04/03/47185-arduino.jpg] [https://upload.wikimedia.org/wikipedia/commons/3/31/Raspberry_Pi_2_Model_B_v1.1_top_new_(bg_cut_out).jpg] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Microphone-icon.png] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Speaker-2-icon.png]

slide-13
SLIDE 13
  • 5 types of services
  • Command mapping service

13 Florian Gratzer | Voice Controlled Smart Spaces

Implementation

[http://img2.jalousiescout.de/products/20000020/20000020_i2_weiss.jpg] [http://cdn.idealo.com/folder/Product/1941/4/1941409/s1_produktbild_mid/falcon-appliances-classic-90-induktion.jpg] [http://d2rormqr1qwzpz.cloudfront.net/photos/2013/04/03/47185-arduino.jpg] [https://upload.wikimedia.org/wikipedia/commons/3/31/Raspberry_Pi_2_Model_B_v1.1_top_new_(bg_cut_out).jpg] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Microphone-icon.png] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Speaker-2-icon.png]

slide-14
SLIDE 14
  • 5 types of services
  • Configuration interface for command mapping

14 Florian Gratzer | Voice Controlled Smart Spaces

Implementation

[http://img2.jalousiescout.de/products/20000020/20000020_i2_weiss.jpg] [http://cdn.idealo.com/folder/Product/1941/4/1941409/s1_produktbild_mid/falcon-appliances-classic-90-induktion.jpg] [http://d2rormqr1qwzpz.cloudfront.net/photos/2013/04/03/47185-arduino.jpg] [https://upload.wikimedia.org/wikipedia/commons/3/31/Raspberry_Pi_2_Model_B_v1.1_top_new_(bg_cut_out).jpg] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Microphone-icon.png] [http://icons.iconarchive.com/icons/iconsmind/outline/128/Speaker-2-icon.png]

slide-15
SLIDE 15
  • Test the system in different scenarios
  • Model house with different scenarios
  • Real room at university
  • Does the system fulfill the requirements?
  • Usability
  • Error Rate

Evaluation

Florian Gratzer | Voice Controlled Smart Spaces 15

[http://pahl.de/?site=projects/experimentalS2oSpaces]

slide-16
SLIDE 16

Thank you for your attention!

Florian Gratzer | Voice Controlled Smart Spaces 16