Effective Open Source Speech Recognition in Your Application - - PowerPoint PPT Presentation

effective open source speech recognition in your
SMART_READER_LITE
LIVE PREVIEW

Effective Open Source Speech Recognition in Your Application - - PowerPoint PPT Presentation

Effective Open Source Speech Recognition in Your Application #kde-speech Peter Grasch peter@grasch.net The Basics Speech model Decoder Acoustic model Language model Sounds Vocabulary Grammar Open Source Speech Recognition


slide-1
SLIDE 1

Effective Open Source Speech Recognition in Your Application

#kde-speech Peter Grasch peter@grasch.net

slide-2
SLIDE 2

The Basics

Speech model Acoustic model Language model Decoder

  • Sounds
  • Vocabulary
  • Grammar
slide-3
SLIDE 3

Open Source Speech Recognition

Decoder Trainer UI CMU SPHINX

(PocketSphinx, SphinxTrain)

✓ ✓ Julius ✓ KALDI ✓ ✓ Simon ✓ ✓ ✓

slide-4
SLIDE 4

Standard Architecture

Simon Acoustic model Language model Simond Commands

?

Your application

slide-5
SLIDE 5

Standard Architecture

Simon Acoustic model Language model Your application Scenario Scenario Scenario Simond Commands

slide-6
SLIDE 6

Headless Architecture

Simon Acoustic model Language model Simond Commands Your application

slide-7
SLIDE 7

Embedded Architecture

Simon Acoustic model Language model Commands Your application Simond Decoder

slide-8
SLIDE 8

Standard Architecture

Simon Acoustic model Language model Your application Scenario Scenario Scenario Simond Commands

slide-9
SLIDE 9

Writing your Scenario

  • Lay out the commands you want to support
  • Create:

– Vocabulary – Grammar – Commands

slide-10
SLIDE 10

Writing your Scenario

Demonstration

slide-11
SLIDE 11

Tighter Integration: Write a Custom Command Plug-In

  • Full, programmatic control of the scenario
  • Meta information of recognition results:

– Phonetic transcriptions – Confidence scores* – Alternative results*

slide-12
SLIDE 12

Tighter Integration: Write a Custom Command Plug-In

Demonstration

slide-13
SLIDE 13

Q & A

#kde-speech Peter Grasch peter@grasch.net

slide-14
SLIDE 14

Thank you for your attention