Configuration and Management of Speaker Verification Systems W3C - PowerPoint PPT Presentation

Configuration and Management of Speaker Verification Systems W3C Workshop on Speaker Biometrics and VoiceXML 3.0 Chuck Johnson Architect iBiometrics, Inc.

Introduction For peak performance of a Speaker Verification solution, the VoiceXML client (voice application) needs to be able to query and set the necessary initialization and configuration (setup) parameters, control the operation of Speaker Verification resources (engines), and interpret the verification results. W3C Workshop on Speaker Biometrics and VoiceXML 3.0

Interpretation of Verification Results Speaker Verification engines, depending on the vendor, are configured to return raw (numeric) verification scores, normalized verification scores, verification decisions, or some combination of scores and decisions - or error results. In addition, some engines return confidence scores. Some standardization of return data/info is necessary e.g. a consistent range for normalized scores. The engine should return a basic (minimum) set of error results. The engine should return a pass/fail W3C Workshop on Speaker Biometrics and or a pass/fail/inconclusive decision. VoiceXML 3.0

Enrollment – Voice Model Creation Traditionally (in voice model enrollment scenarios) the client application implements the enrollment dialog - manages the voice dialog, the error handling and the associated call flow. Within the context of an enterprise security framework, the client application should manage the enrollment process: start/stop/resume/abort enrollment, query enrollment status (in-progress, aborted, etc), and retrieve the enrollment outcome. Some engines support multiple modes of operation. The client should be able to W3C Workshop on Speaker Biometrics and query and set the mode of operation [for VoiceXML 3.0 enrollment].

Voice Model Database Management Set up of the voice model (voiceprint) database entails: creation of the schema (tables), creation of database users, and establishment of rights and access privileges - tasks that are governed by enterprises guidelines and security policies. Those tasks are usually performed by a system administrator or DBA – not by the client application. Client applications may be able to manage the voice models: copy the voice model, delete the voice model and/or rename the W3C Workshop on Speaker Biometrics and VoiceXML 3.0 voice model identifier.

Distinct User Populations Many SIV applications have distinct user populations e.g. Financial Services, Community Corrections, and Social Services. These populations (groups) include children, females, ethnic groups, regional speakers, and application specific groups. Some world [background] models are not optimized for distinct user populations. Custom or group specific background models can improve verification performance (accuracy). Client applications should be able to utilize custom or group specific background models. W3C Workshop on Speaker Biometrics and And, optionally, update (adapt) group VoiceXML 3.0

Different Classes of Users In Financial Services Applications, access to different features of the service may have a higher (or lower) security setting. In Corrections Applications, different classes of users will have different security settings or levels. The users are often put into classes based on risk – typically high, medium, and low. Client applications should be able to: query and set the current operating point (security level), query and set unsupervised adaptation thresholds, and manage and W3C Workshop on Speaker Biometrics and VoiceXML 3.0 control supervised adaptation.

Voice Model Adaptation The idea of voice model adaptation is not intuitive! In the ‘not so distant past’ there were articles saying that voice model adaptation not always needed or questing the efficacy of the adaptation process. Numerous articles/reports from vendors and industry experts have clearly demonstrated the need for, and effectiveness of voice, model adaptation. The client application should be able to query unsupervised adaptation settings, enable/disable adaptation, query adaptation outcome (result) and, optionally, set adaptation threshold and rollback adaptation [from the last turn]. The client should manage supervised adaptation: control audio buffers, make adaption requests, and, W3C Workshop on Speaker Biometrics and VoiceXML 3.0 optionally, rollback adaptation [from the last turn].

Solution Architecture Issues ** Optional Slide ** Time permitting, I may give a brief presentation of interface, security and architectural issues associated with ‘loosely coupled’ SIV systems. These are systems where most or all of the SIV components/resources (app server, data store, voice interpreter and/or SIV engine) are distributed across multiple systems, across the enterprise, or across multiple enterprises. W3C Workshop on Speaker Biometrics and VoiceXML 3.0

Summary and Wrap up

Configuration and Management of Speaker Verification Systems W3C - PowerPoint PPT Presentation

Configuration and Management of Speaker Verification Systems W3C Workshop on Speaker Biometrics and VoiceXML 3.0 Chuck Johnson Architect iBiometrics, Inc. Introduction For peak performance of a Speaker Verification solution, the VoiceXML

Configuration management Configuration management Configuration management Configuration

Augeas a configuration API Raphal Pinson Configuration Management Sitewide configuration

EPiServer och Configuration Management EPiServer och Configuration Management Configuration

W3C Speaker Identification W3C Speaker Identification and Verification Workshop and Verification

A New Adaptation Method for Speaker- -Model Model A New Adaptation Method for Speaker Creation

CNC PINpad USA, December 2014 Configuration Configuration Description POS Dollar General

DIVS DL/ID Verification Systems Verification of Legal Status DIVS Passport Verification

Speech Processing 15-492/18-492 Speaker ID Who is speaking? Speaker ID, Speaker Recognition

Software Development: Tools and Processes Lecture - 13: Software Configuration Management

Distributed Configuration Management: Mercurial CSCI 5828 Spring 2012 Mark Grebe Configuration

Configuration Management Chapter 13 Outline of the Lecture Purpose of Software Configuration

CS314 Software Engineering Configuration Management Dave Matthews Configuration Management

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Configuration 2 What system configuration does zsim simulate? Type and number of cores,

SUSE Cloud 8 CLM Overview Input Model and Configuration Processor Configuration Processing Flow

Configuration spaces: combinatorics, topology, and physics Triangle lectures in combinatorics

Configuration Space companion slides for the blackboard lecture C-obstacles when rotations are

Seeking Closure in an Open World: a Behavioral Agent Approach to Configuration Management Alva

Configuring random graph models with fixed degree sequences Daniel B. Larremore Santa Fe

Parallel Algorithm Configuration Frank Hutter, Holger Hoos, Kevin Leyton-Brown University of

Sun Grid Engine Package for OSCAR A Google SoC 2005 Project Babu Sundaram, Barbara Chapman

Real-time Lane Configuration with Coordinated Reinforcement Learning Presenter: Udesh Gunarathna

TRANSFERRING PARETO FRONTIERS ACROSS HETEROGENEOUS HARDWARE ENVIRONMENTS 4/20/2020 Authors:

CREATIVE CONSTRUCTS WITH CONFIG ENTITIES Twin Cities Drupal Camp T H , 2 0 1 8 J U N E 7

Sambuz

Useful Links

Newsletter

Mail Us