{ Daniel Wilkey John Graham CS6998 Given speech, was the speaker - PowerPoint PPT Presentation

Oct 02, 2023 •138 likes •292 views

Detecting Intoxicated Speech { Daniel Wilkey John Graham CS6998 Given speech, was the speaker intoxicated? Interspeech 2011 Intoxication Challenge Application for field sobriety testing, ignition-guards Background ALC

Detecting Intoxicated Speech { Daniel Wilkey John Graham CS6998
 Given speech, was the speaker intoxicated?  Interspeech 2011 Intoxication Challenge  Application for field sobriety testing, ignition-guards Background
 ALC – Alcohol Language Corpus  162 total participants: 84 male, 78 female  Participants reached a BAC .28 – 1.75  Read 15 minutes of intoxicated speech  Returned 2 weeks later  Read 30 minutes of sober speech The Corpus
 5400 samples in total, 75 per person  Divided into 3 sets:  Development, Training, Test  Development & Training are labeled with 4368 features  Used cross validation to obtain results The Corpus p2
 Shrikanth Narayanan of UCLA  Global speaker normalization  Normalizing by the sober class  Relative improvement of 7.04% overall  Professor Hirchberg  Phonotactic and phonetic cues  Experiment tests un- weighted average recall… why?  We chose f-measure  Includes recall and precision Prior Research
 Remove extraneous features with WEKA  Info-gain ratio algorithm  MFCC features performed well  No F0-based features near the top Experiment Preparation
 Ignore test set  unlabeled  Down-sampling the training set  Achieved 50/50 ratio of alcoholised to non- alcoholised speech Experiment Preparation
 Global Speaker Normalization (Narayanan)  Insignificant negative change  Sober class normalization (Narayanan)  Insignificant negative change  Gender class normalization  Insignificant positive change  Combining global speaker with gender normalization  10.75% relative improvement in f-measure  Poor performance potentially related to some F0 features being filtered out Normalization Attempts
 Tried retesting data with fringe cases omitted  Fringe case BAC between .08% and .16% proposed by Batliner  We tried .02% to .08%  Difference in data set and threshold  Relative decrease of F-measure by 3.25% On the Fringe
Machine Learning Optimizations
 Varied polynomial kernels  Radial basis function (RBF) Optimizing the SVM
 Varying number  Folds  Iterations Optimization Techniques
 Configuration  SVM kernel n=3  10-fold cross validation  Gender normaliation  Sober class normalization Difficult to compare!! Final Results
 Difficult to compare results  Need better corpus  Extend with GMM super-vectors Conclusions / Extensions

Recommend

Speech Processing Speech Processing Using Speech with Computers Overview Overview Speech vs

Speech Processing Speech Processing Using Speech with Computers Overview Overview Speech vs Text Speech vs Text Same but different Same but different Core Speech Technologies Core Speech Technologies Speech Recognition Speech

705 views • 38 slides

6-Text To Speech (TTS) Speech Synthesis Speech Synthesis Concept Speech Naturalness Phone

6-Text To Speech (TTS) Speech Synthesis Speech Synthesis Concept Speech Naturalness Phone Sequence To Speech Articulatory Approaches Concatenative Approaches HMM-based Approaches Rule-Based Approaches 1 Speech Synthesis Concept

749 views • 57 slides

Course Information Course Website: http://www.cs.columbia.edu/~smaskey/CS6998 Discussions

Statistical Methods for NLP Introduction, Text Mining, Linear Methods of Regression Sameer Maskey Week 1, January 19, 2010 Course Information Course Website: http://www.cs.columbia.edu/~smaskey/CS6998 Discussions in courseworks

873 views • 49 slides

a language for designing board games Lauren Pully (Project Manager) Jesse Bentert (Tools &

a language for designing board games Lauren Pully (Project Manager) Jesse Bentert (Tools & Language Guru) John Graham (System Architect) Daniel Wilkey (System Integrator) Yipeng Huang (Tester & Validator) Overview 1 John Graham

669 views • 25 slides

EECS E6870 converting speech to text Speech Recognition automatic speech recognition

What Is Speech Recognition? EECS E6870 converting speech to text Speech Recognition automatic speech recognition (ASR), speech-to-text (STT) what its not Michael Picheny,

345 views • 22 slides

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Synthesis Evaluation

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Synthesis Evaluation Evaluating Speech Synthesis Evaluating Speech Synthesis How good is the voice? How good is the voice? This voice is a 45.67 This voice is a

463 views • 24 slides

Speech Processing 15-492/18-492 Speech Synthesis Overview Text processing Speech Synthesis

Speech Processing 15-492/18-492 Speech Synthesis Overview Text processing Speech Synthesis From text to speech From text to speech Text Analysis Text Analysis Strings of characters to words Strings of characters to words

667 views • 25 slides

Speech Processing 15- -492/18 492/18- -492 492 Speech Processing 15 Speech Synthesis Prosody

Speech Processing 15- -492/18 492/18- -492 492 Speech Processing 15 Speech Synthesis Prosody Speech Synthesis Speech Synthesis Linguistic Analysis Linguistic Analysis Pronunciations Pronunciations Prosody Prosody

420 views • 24 slides

Project Overview Speech Speech Generation Generation Common Semantic Frame Speech Speech

9807-11 Multilingual Conversational System Research James Glass and Stephanie Seneff Project Overview Speech Speech Generation Generation Common Semantic Frame Speech Speech Understanding Understanding DATABASE Explore

748 views • 3 slides

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 25: Speech

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 25: Speech synthesis (Concluding lecture) Instructor: Preethi Jyothi Nov 6, 2017 Recall: SPSS framework O Speech Speech Train Parameter

273 views • 26 slides

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Acoustic

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Acoustic modeling Pronunciation dictionary Acoustic Modeling Acoustic Modeling Speech and Signal Variability Speech and Signal Variability Measuring

622 views • 27 slides

HMMS and Speech HMMS and Speech HMMS and Speech Recognition Recognition Recognition Presented

HMMS and Speech HMMS and Speech HMMS and Speech Recognition Recognition Recognition Presented by Jen-Wei Kuo Reference 1. X. Huang et. al., Spoken Language Processing, Chapter 8 2. Daniel Jurafsky and James H. Martin, Speech and Language

1.05k views • 65 slides

Speech sound disorder by Sajjal (2018) Definition A speech sound disorder (SSD) is a speech

Speech sound disorder by Sajjal (2018) Definition A speech sound disorder (SSD) is a speech disorder in which some speech sounds (called phonemes) in a child's (or, sometimes, an adult's) language are either not produced, not produced correctly,

540 views • 16 slides

Speech of Greta Thunberg at the UN Climate Change COP24 Conference in Katowice Content -Greta

Political speech Speech of Greta Thunberg at the UN Climate Change COP24 Conference in Katowice Content -Greta Thunberg -Analysis of the speech -Video of the speech -Result of the survey What kind of speech is this? A political speech,

447 views • 11 slides

Chapter 1 Introduction to Speech Signal Processing 1 Outline The

Chapter 1 Introduction to Speech Signal Processing 1 Outline The Speech Signal Speech Signal Processing Speech Production/Perception Model and the Speech Chain The Speech Stack Applications

668 views • 51 slides

Speech and Language CS 188: Artificial Intelligence Speech technologies Automatic

Speech and Language CS 188: Artificial Intelligence Speech technologies Automatic speech recognition (ASR) Text-to-speech synthesis (TTS) Dialog systems Language processing technologies Lecture 18: Speech

193 views • 3 slides

Fast Cross-Validation for Incremental Learning Pooria Joulani, Andr as Gy orgy, Csaba

Fast Cross-Validation for Incremental Learning Pooria Joulani, Andr as Gy orgy, Csaba Szepesv ari Department of Computing Science University of Alberta Edmonton, Alberta July 11, 2015 Appearing in the International Joint Conference on

623 views • 12 slides

Evaluate Deep Q-Learning for Sequential Targeted Marketing with 10-fold Cross Validation

Agent Actions Evaluate Deep Q-Learning for Sequential Targeted Marketing with 10-fold Cross Validation Rewards Observations Jian Wu Full Stack Engineer Marketing Environment The idea of this talk was developed at Samsung SDSRA AI Lab

208 views • 19 slides

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method Transfer, partial and cross validation Team members: In scope Life cycle of a method after first full validation or relation Team

252 views • 10 slides

Connecting the Disadvantaged by E-Government Programs Workshop on ICT Applications for People

Taiwan wan E-Gove vernance rnance Research arch Center Connecting the Disadvantaged by E-Government Programs Workshop on ICT Applications for People with Special Needs APEC TEL 45 th , Da Nang, Vietnam April 7, 2012 Dr. Naiyi Hsiao Director

380 views • 16 slides

Phantom project Alexandre Ancel 2 Alexandre Fortin 1 Simon Garnotel 3 Olivia Miraucourt 1

Phantom project Alexandre Ancel 2 Alexandre Fortin 1 Simon Garnotel 3 Olivia Miraucourt 1 Stphanie Salmon 1 Ranine Tarabay 2 1 University of Reims Champagne-Ardenne, Reims, France 2 University of Strasbourg, IRMA / UMR 7501, Strasbourg, France 3

581 views • 23 slides

The AI Thunderdome Using OpenStack to accelerate AI training with Sahara, Spark, and Swift Sean

The AI Thunderdome Using OpenStack to accelerate AI training with Sahara, Spark, and Swift Sean Pryor, Sr. Cloud Consultant, RHCE Red Hat https://www.redhat.com spryor@redhat.com Overview This talk will cover Brief explanations of ML,

624 views • 24 slides

Coordination Request Capture exercise, Validation, and Correction 1 SpaceCap: First steps

Coordination Request Capture exercise, Validation, and Correction 1 SpaceCap: First steps Launch SAM Launch SpaceCap Coordination: capture 2 SpaceCap: new database Create new database Call it GSOSAT Coordination: capture

360 views • 19 slides

EVALUATION OF STUDENT PERFORMANCE WITH DATA MINING: AN APPLICATION OF ID3 AND CART ALGORITHMS

EVALUATION OF STUDENT PERFORMANCE WITH DATA MINING: AN APPLICATION OF ID3 AND CART ALGORITHMS Manawin Songkroh (Ph.D) College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, Thailand Andrea K (Ph.D) Corvinus University of

750 views • 25 slides