SSML 1.1 Daniel C. Burnett Nuance Communications J anuary 13, - PowerPoint PPT Presentation

SSML 1.1 Daniel C. Burnett Nuance Communications J anuary 13, 2007

Overview • SSML 1.1 Charter • Goals of SSML 1.1 • Development process • Requirements (so far) • Specification changes (so far)

SSML 1.1 Charter • Extend SSML 1.0 to – Provide enhanced language support – Fix incompatibilities with other VBWG specs • Out of scope are – VCR- like audio controls – < say- as> changes (but not requirements) (See Requirements Sec. 1.2 for details)

My goals for SSML 1.1 • Satisfy the charter – Making only minimal changes to SSML 1.0 – While satisfying the subgroup members • Defer out- of- scope changes/ requests to SSML 2.0 (potentially a re- write)

Development process • Review topics from Workshops • Categories & Categorization • Write concise problem statements • Develop requirements from problem statements • Agree on major points of proposals to address requirements • Write proposals

Topic review • Review all the topics from the workshops • Assign each to a category: – Short- term (will work on these now) – Long- term (will revisit when short- term is done) – Experts needed (we will only work on this if experts in the related languages join the working group) – Other SSML work (say- as, SSML 2.0 items, etc.)

Categories and categorization Experts needed Long- term (after (in order to Short- term (group short- term work Other SSML work (SSML make decision agrees to work on will revisit to 2.0 or later, < say- to work on this) determine if as> Note, etc. this in this belongs in group) subgroup) Providing number, Background sound (may Token/ word Expand Part- Of- Speech case, gender be handled best by boundaries support agreement info VoiceXML3 work) Ruby Tones Syllable markup Tone sandhi Enhance prosody rate to Verify that RFC3066 Text with multiple include "speech units language languages per time unit" where categories are (changing xml:lang speech units would complete enough without changing Diacritics, SMS text, be syllable, mora, that we do not voice; separately simplified/ alter phoneme, foot, etc. need anything new specifying nate text and time unit would beyond xml:lang to language of be seconds, ms, identify languages content and minutes, etc.(would and dialects language to speak) address mora/ sec request) Sub- word unit Chinese names (say- as demarcation Special words requirements) and annotation Phonetic alphabets Expressive elements Sentence structure

Problem statements • Statement for “xml:lang” topic – The xml:lang attribute in SSML is the only way to identify the language. It represents both the natural (human) language of the text content and the natural (human) language the synthesis processor is to produce. For languages whose scripts are ideographs rather than pronunciation- related, we are not sure that the permitted values for xml:lang, as specified by RFC3066, are detailed enough to distinguish among languages (and their dialects) that use the same ideographs.

Requirements • Requirements for “xml:lang” problem statement: – SSML 1.1 must ensure the use of a version of xml:lang that uses the successor specification to [RFC3066] (for example, [BCP47]). – SSML 1.1 must clearly state that the 'xml:lang' attribute identifies the language of the content. – SSML 1.1 must clearly state that processors are expected to determine how to render the content based on the value of the 'xml:lang' attribute and must document expected rendering behavior for the xml:lang values they support. – SSML 1.1 must specify that selection of xml:lang and voice are independent. It is the responsibility of the TTS vendor to decide and document which languages are supported by which voices and in what way.

Major points • Approach – We will modify the descriptions of xml:lang and < voice> to clarify that language and voice values may be set separately. Synthesis processors should document supported combinations of voice and language and the behavior for unsupported combinations. • Proposal 1 – create/ use a new element that can be used to set xml:lang for text sizes between sentence and word. Its sole function is language annotation. Modify existing voice element description to clearly separate language setting and voice setting and require one of name, age, gender, or variant to be set. – < s xml:lang= "zh- cmn- HK"> foo < lang xml:lang= "en- GB"> bar < / lang> < / s> – Major points: • In addition to what it says in the approach, update voice selection algorithm appropriately. – Volunteer: Paul has lead. J erry Carter and Dan Burnett can help. • Proposal 2 – modify existing voice element description to clearly separate language setting and voice setting but still permit both to occur with this element. – < s xml:lang= "zh- cmn- HK"> foo < voice xml:lang= "en- GB"> bar < / voice> < / s> – Major points: • In addition to what it says in the approach, update voice selection algorithm appropriately. – Volunteer: Lou Xiaoyan

Proposals • Detailed text proposals are folded into the specification and reviewed. • Outstanding issues are noted in the specification for later discussion.

Requirements so far • General requirements • Speech Interface Framework requirements – Requirements needed to interoperate with other Voice Browser Working Group specifications • Language- related requirements – These have been the primary focus of the subgroup

Speech Interface Framework requirements • Caching • error messages • type attribute on < audio> • VoiceXML VCR control support • Lexicon activation control • prefetching support • external reference to < p> , < s> , < w>

Language- related requirements • Token/ word boundary requirements – Mechanism to disambiguate word boundaries – Mechanism to indicate the language of the word – Mechanism to indicate lexicon entry to use for the word • Phonetic Alphabet and Pronunciation Script Requirements – Registry for alternative pronunciation alphabets • Language Category requirements – Successor to RFC3066 – xml:lang requirements • Name/ proper noun id requirements (future) – Identify content as proper noun – Identify content as name – Identify name sub- contant as surname

Specification changes so far • < w> element • < lang> element and lang- voice attribute • pronunciation alphabet registry • < lookup> element • < role> attribute

SSML 1.1 Daniel C. Burnett Nuance Communications J anuary 13, - PowerPoint PPT Presentation

SSML 1.1 Daniel C. Burnett Nuance Communications J anuary 13, 2007 Overview SSML 1.1 Charter Goals of SSML 1.1 Development process Requirements (so far) Specification changes (so far) SSML 1.1 Charter Extend SSML 1.0 to

SSML for Indian Languages Text to Speech Synthesis Presented by: Vibhu Agarwal President and co-

SSML for Urdu Speech Synthesis Sarmad Hussain Professor and Head Center for Research in Urdu

Pronunciation Lexicon Background Outline Brief Introduction on Pronunciation Lexicon

Statistical Analysis for M edical and Public Health Data Qazvin University of M edical Sciences

Spatio-temporal correlations across the melting of 2 D Wigner molecules Amit Ghosal IISER

Relational Learning Expressive Background Knowledge can be incorporated easily

Early Action in GHG Mitigation and Role of Information Disclosure Mechanisms Donna Ramirez

Skeletons Animated characters are usually built on top of an underlying skeleton The

2. Cross products Lets suppose you want to calculate the area of a polygon in the plane.

Projections MCV4U: Calculus & Vectors In the real world, a projection occurs when an object

Scalar and vector fields What is a field in mathematics? Roughly speaking, a field defines how a

Review: integrating vector fields along curves Fix r > 0 and let C r = { x 2 + y 2 = r 2 } ,

Today Johnson-Lindenstrass Random subspace. Points: x 1 ,..., x n R d . Method 1: Random k =

MA 105: Calculus Lecture 20 . Prof. B.V. Limaye IIT Bombay Friday, 17 March 2017 B.V. Limaye

Spatial Data: 3D Scalar Fields CSC444 Recap: 2D contouring

Last time: vector fields Let C be the line segment from (0 , 0) to (1 , 2). Consider the vector

Morse-Conley theory for combinatorial vector fields P Q L M N O G H I J K C D E F A

Some results on GAN dynamics Ioannis Mitliagkas Game dynamics are weird fascinating Start with

Math 5490 11/10/2014 Dynamical Systems Math 5490 Nonlinear Systems November 10, 2014 dx dt

HAMILTONIAN VECTOR FIELDS, OBSERVABLES AND LIE SERIES Bronis law Jakubczyk Institute of

Homotopy Poisson actions Rajan Mehta November 8, 2010 Conventional perspectives Definition A

Effective Field Theory of Anisotroic Inflation Asieh Karami IPM - School of Astronomy GC2018

Complex manifolds of dimension 1 lecture 17: Riemann-Hilbert correspondence Misha Verbitsky

Dynamical vector fields on the manifold of quantum states G. Marmo Universit di Napoli

Sambuz

Useful Links

Newsletter

Mail Us

SSML 1.1 Daniel C. Burnett Nuance Communications J anuary 13, - PowerPoint PPT Presentation

SSML 1.1 Daniel C. Burnett Nuance Communications J anuary 13, 2007 Overview SSML 1.1 Charter Goals of SSML 1.1 Development process Requirements (so far) Specification changes (so far) SSML 1.1 Charter Extend SSML 1.0 to

SSML for Indian Languages Text to Speech Synthesis Presented by: Vibhu Agarwal President and co-

SSML for Urdu Speech Synthesis Sarmad Hussain Professor and Head Center for Research in Urdu

Pronunciation Lexicon Background Outline Brief Introduction on Pronunciation Lexicon

Statistical Analysis for M edical and Public Health Data Qazvin University of M edical Sciences

Spatio-temporal correlations across the melting of 2 D Wigner molecules Amit Ghosal IISER

Relational Learning Expressive Background Knowledge can be incorporated easily

Early Action in GHG Mitigation and Role of Information Disclosure Mechanisms Donna Ramirez

Skeletons Animated characters are usually built on top of an underlying skeleton The

2. Cross products Lets suppose you want to calculate the area of a polygon in the plane.

Projections MCV4U: Calculus &amp; Vectors In the real world, a projection occurs when an object

Scalar and vector fields What is a field in mathematics? Roughly speaking, a field defines how a

Review: integrating vector fields along curves Fix r &gt; 0 and let C r = { x 2 + y 2 = r 2 } ,

Today Johnson-Lindenstrass Random subspace. Points: x 1 ,..., x n R d . Method 1: Random k =

MA 105: Calculus Lecture 20 . Prof. B.V. Limaye IIT Bombay Friday, 17 March 2017 B.V. Limaye

Spatial Data: 3D Scalar Fields CSC444 Recap: 2D contouring

Last time: vector fields Let C be the line segment from (0 , 0) to (1 , 2). Consider the vector

Morse-Conley theory for combinatorial vector fields P Q L M N O G H I J K C D E F A

Some results on GAN dynamics Ioannis Mitliagkas Game dynamics are weird fascinating Start with

Math 5490 11/10/2014 Dynamical Systems Math 5490 Nonlinear Systems November 10, 2014 dx dt

HAMILTONIAN VECTOR FIELDS, OBSERVABLES AND LIE SERIES Bronis law Jakubczyk Institute of

Homotopy Poisson actions Rajan Mehta November 8, 2010 Conventional perspectives Definition A

Effective Field Theory of Anisotroic Inflation Asieh Karami IPM - School of Astronomy GC2018

Complex manifolds of dimension 1 lecture 17: Riemann-Hilbert correspondence Misha Verbitsky

Dynamical vector fields on the manifold of quantum states G. Marmo Universit di Napoli

Sambuz

Useful Links

Newsletter

Mail Us

Projections MCV4U: Calculus & Vectors In the real world, a projection occurs when an object

Review: integrating vector fields along curves Fix r > 0 and let C r = { x 2 + y 2 = r 2 } ,