Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor - PowerPoint PPT Presentation

Fast Two–Level Fast Two–Level HMM Decodi HMM Decoding ng Algor gorithm for thm for Large Vo Vocabulary ry Han Handwr writin ing Re Reco cognit itio ion Alessandro L. Koerich, Robert Sabourin & Ching Y. Suen Pontifical Catholic University of Paraná (PUCPR), Brazil École de Technologie Supérieure, Université du Québec, Canada CENPARMI, Concordia University, Canada 9 th International Workshop on Frontiers in Handwriting Recognition, Tokyo, Japan October 2004

Ou Outline tline • Motivation & Challenge • Background on LVHR • Goal • Methodology • Handwriting Recognition System • Fast Two–Level HMM Decoding Algorithm • Experimental Results • Summary, Conclusion & Future Work

Mo Motivat tivation • A baseline off-line handwritten recognition system developed by A. El–Yacoubi in 1998 at the SRTP had the following performance: 100–word vocabulary – Recognition rate: 95.89% (4,481 out of 4,674 words) – Speed: 2 sec/word 30,000–word vocabulary – Recognition rate: 73.70% (3,445 out of 4,674 words) – Speed: 8.2 min/word 26 days for the whole test set !!!!

Large V e Vocabu cabula lary Ha Handwr ndwrit itin ing R g Reco cogn gnit itio ion (L (LVHR) • Most of the research in handwriting recognition has focused on relatively simple problems. → Less than 100 classes – digits (10 classes) – characters ( 26 to 52 classes) – words (up to 100 words) • To pass from few classes to a large number of classes (> 1,000) is a real challenge.

Large V e Vocabu cabula lary Ha Handwr ndwrit itin ing R g Reco cogn gnit itio ion (L (LVHR) • Most of the classification algorithms current used in handwriting recognition are not suitable for large number of classes. • Few large datasets to allow training and performance evaluation. • Few results have been reported in literature.

Large V e Vocabu cabula lary Ha Handwr ndwrit itin ing R g Reco cogn gnit itio ion (L (LVHR)

Current Current Methods for Methods for LVHR LVHR [speed] [speed] • Lexicon pruning (prior to the recognition) – Application environment – Word length and shape • Organization of the search space – Lexical tree x Flat lexicon • Search strategy – Viterbi beam search – A* – Multi–pass Most of these methods are not very efficient or/and they introduce errors which affect the recognition accuracy.

Current Current Methods for Methods for LVHR LVHR [accur [accuracy] cy] • Improvements in accuracy are associated with: – Feature set – Modeling of reference patterns – More than one model for each character class – Combination of different feature sets / classifiers The complexity of the recognition process has been steadily increasing with the recognition accuracy.

Ch Challeng allenge • We have to account for two aspects that are in mutual conflict: recognition speed and recognition accuracy ! • Is it possible to overcome the accuracy and speed problems to make large vocabulary off-line handwriting recognition feasible ?

Ch Challeng allenge • It is relatively easy to improve the recognition speed while trading away some accuracy. • But it is much harder to improve the recognition speed while preserving (or even improving) the original accuracy.

Goal Goal • To address the problems related to accuracy and speed • Build an off–line handwritten word recognition system which has the following characteristics: – Omniwriter (writer independent) – Very–large vocabulary (80,000 words) – Unconstrained handwriting (cursive, handprinted, mixed) – Acceptable recognition accuracy – Acceptable recognition speed

Met Methodolo odology • Build a lexicon-driven LV handwritten word recognition system based on HMMs to generate a list of N –best word hypotheses as well as the segmentation of such word hypotheses into characters. • Pr Prob oblem: Current decoding algorithms are not efficient to deal with large vocabularies. • So Solution lution: Speedup the recognition process using a novel decoding strategy that reduces the repeated computation and preserves the recognition accuracy.

Met Methodolo odology • The idea is to take into account particular aspects of the handwriting recognition system: – Architecture of the hidden Markov models (characters). – Feature extraction and segmentation (perceptual features) – Lexicon-driven approach

Han Handwritin writing Reco Recognition System stem • Segmentation–recognition approach • Lexicon–driven approach where character HMMs are concatenated to build up words according to the lexicon • Global recognition approach to account for unconstrained handwriting UU UU UU UU P A R I S 0U UL UL UL UL B E LU LU LU LU 0L a i s p r LL LL LL LL

Han Handwritin writing Reco Recognition System stem

Conventional Approach Conventional Approach • Given: – An input word – A lexicon with V words – Character HMMs (a-z, A-Z, 0-9, symbols) 1. Extract features from the input word. 2. Build up word HMM for a word in the lexicon. 3. Align the sequence of features (observation sequence) with the word HMM. 4. Decode the word HMM (estimate a confidence score). 5. Repeat Step 2 until all words in the lexicon are decoded. 6. Select those words which provide the highest confidence scores.

Conventional Approach Conventional Approach B Y E Es-sCu| Character BYE Lexicon HMMs

Conventional Approach Conventional Approach E Y P(O|w) or P(“ Es-sCu |” | “ BYE ”) B Es-sCu|

Co Convent ention onal al A Approa pproach (Shortcom ch (Shortcomings ngs) • We have observed that there is a great number of repeated computation during the decoding of words in the lexicon. • The current algorithms decode an observation sequence in a time–synchronous fashion. • The probability scores of a character within a word depends on the probability scores of the immediate preceding character.

Char Character HMMs acter HMMs UU UU UU UU P I A R S 0U UL UL UL UL E B LU LU LU LU 0L a i s p r LL LL LL LL

Fast Two–Level H Fast Two–Level HMM MM Deco Decoding ding Algor Algorithm thm • Main ideas: – Avoid repeated computation of state sequences – Reusability of character likelihoods – Context independent (lexicon)

Fast Two–Level H Fast Two–Level HMM MM Deco Decoding ding Algor Algorithm thm During the recognition is it possible to decode the character “a” only once since it is always represented by the same character model ?

Fast Two–Level H Fast Two–Level HMM MM Deco Decoding ding Algor Algorithm thm • To solve this problem of repeated computation a novel algorithm that breaks up the decoding of words into two levels is proposed: – Fi First L Level: Character HMMs are decoded considering each possible entry and exit point in the trellis and the results are stored into arrays. – Sec Second Level nd Level: Words from the lexicon are decoded but reusing the results of first level. Only character boundaries are decoded.

FTLDA: First FTLDA: First Level Level • The idea is to avoid repeated computation • We evaluate the matching between O and each λ • Assume that each λ has a single initial state (entry) and final state (exit). • Compute best state sequences between initial state and final state considering a single beginning frame ( b ) at time and all possible ending frames ( e ) • Store in an array best state sequences and probabilities of all pairs of beginning and ending frames P A ( b , e )

FTL HMM Decod FTL HMM Decoding Algor ng Algorithm: First thm: First Level Level A B Z . . . (1,T) (2,T) (3,T) . . . (T,T) (1,T) (2,T) (3,T) . . . (T,T) (1,T) (2,T) (3,T) . . . (T,T) . . . . . . . . . . . . . . . . . . . . . . . . . . . (1,3) (2,3) (3,3) . . . (T,3) (1,3) (2,3) (3,3) . . . (T,3) (1,3) (2,3) (3,3) . . . (T,3) (1,2) (2,2) (3,2) . . . (T,2) (1,2) (2,2) (3,2) . . . (T,2) (1,2) (2,2) (3,2) . . . (T,2) e = 3 e = 4 e = 5 e = 6 e = 7 (1,1) (2,1) (3,1) . . . (T,1) (1,1) (2,1) (3,1) . . . (T,1) (1,1) (2,1) (3,1) . . . (T,1) (1,T) (2,T) (3,T) . . . (T,T) (1,T) (2,T) (3,T) . . . (T,T) (1,T) (2,T) (3,T) . . . (T,T) max . . . . . . . . . . . . . . . . . . . . . . . . . . . (1,3) (2,3) (3,3) . . . (T,3) (1,3) (2,3) (3,3) . . . (T,3) (1,3) (2,3) (3,3) . . . (T,3) A B Z (1,2) (2,2) (3,2) . . . (T,2) (1,2) (2,2) (3,2) . . . (T,2) (1,2) (2,2) (3,2) . . . (T,2) (1,1) (2,1) (3,1) . . . (T,1) (1,1) (2,1) (3,1) . . . (T,1) (1,1) (2,1) (3,1) . . . (T,1) • We end up with arrays of best state sequences and probabilities for each character HMM • They are independent of the context (position within b = 2 b = 3 b = 4 b = 5 b = 1 the word) P(1,3) P(1,3) P(1,3) • Reuse “pre-decoded” characters to decode any P(1,4) P(1,4) P(2,4) P(2,4) P(2,4) P(1,4) P(2,5) P(2,5) P(2,5) P(3,5) P(3,5) P(3,5) P(1,5) P(1,5) P(1,5) word P(1,6) P(1,6) P(2,6) P(2,6) P(2,6) P(3,6) P(3,6) P(3,6) P(4,6) P(4,6) P(1,6) P(4,6) Es-sCu| P(2,7) P(2,7) P(3,7) P(3,7) P(1,7) P(1,7) P(2,7) P(3,7) P(4,7)P(5,7) P(4,7)P(5,7) P(1,7) P(4,7) P(5,7)

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor - PowerPoint PPT Presentation

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor gorithm for thm for Large Vo Vocabulary ry Han Handwr writin ing Re Reco cognit itio ion Alessandro L. Koerich, Robert Sabourin & Ching Y. Suen Pontifical

Introduction to Hmm Introduction to Hmm Joe Wu Nov 4 th 2011 Agenda The applications of HMM.

Cell implementation HMM (HMM hidden Markov model) Authors: Jakub Hork Ji Hona

By et al Siegfried Engelmann Decoding Strategies: Decoding B1- Teacher's Presentation Book

Decoding Philipp Koehn 17 September 2020 Philipp Koehn Machine Translation: Decoding 17

List Decoding of Algebraic Codes Peter Beelen, Kristian Brander and Johan S.R. Nielsen DTU

Chapter 6 Decoding Statistical Machine Translation Decoding We have a mathematical model for

Using HMM to Blur the Lines between CPU and GPU Programming John Hubbard, May 10, 2017

9: Viterbi Algorithm for HMM Decoding Machine Learning and Real-world Data Helen Yannakoudakis 1

9: Viterbi Algorithm for HMM Decoding Machine Learning and Real-world Data Simone Teufel and Ann

Efficient Video Decoding on GPUs Efficient Video Decoding on GPUs by Point Based Rendering by

Observation Decoding with Sensor Models: Recognition Tasks via Classical Planning Diego Aineto,

A Talk on Protein Homology Detection by HMM-HMM comparisons[1] Sding, J Qing Ye Department of

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Global Robot Ego-Localization C Combining Image Retrieval and HMM- bi i I R i l d HMM

ANLP Lecture 9: Algorithms for HMMs Sharon Goldwater 4 Oct 2019 Recap: HMM Elements of HMM:

CSI5126 . Algorithms in bioinformatics Probabilistic Sequence Motifs Marcel Turcotte School of

Markov Models for Handwriting Recognition DAS 2012 Tutorial, Gold Coast, Australia Gernot

Scholarship & Bursary Information Grad 2013 Research, Research, Research Researching

Using Performance to Link Land Use, Transportation, and Community Character 2013 AMPO Conference /

Plainview-Old Bethpage CSD 2019-2020 Middle Schools Budget Presentation March 19, 2019 Keys to

The View to the South Dave Sanford Director, Navigation Policy & Legislation American

Probable answers on IDN ccTLD issues March 28, 2007 For ccNSO meeting discussion Prepared by

Diversity & Representation in Childrens Literature By: Paul Barnes Whats The Big

Washington Street Study Community Working Group #1 Kickoff Meeting February 2, 2017 Agenda

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor - PowerPoint PPT Presentation

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor gorithm for thm for Large Vo Vocabulary ry Han Handwr writin ing Re Reco cognit itio ion Alessandro L. Koerich, Robert Sabourin & Ching Y. Suen Pontifical

Introduction to Hmm Introduction to Hmm Joe Wu Nov 4 th 2011 Agenda The applications of HMM.

Cell implementation HMM (HMM hidden Markov model) Authors: Jakub Hork Ji Hona

By et al Siegfried Engelmann Decoding Strategies: Decoding B1- Teacher's Presentation Book

Decoding Philipp Koehn 17 September 2020 Philipp Koehn Machine Translation: Decoding 17

List Decoding of Algebraic Codes Peter Beelen, Kristian Brander and Johan S.R. Nielsen DTU

Chapter 6 Decoding Statistical Machine Translation Decoding We have a mathematical model for

Using HMM to Blur the Lines between CPU and GPU Programming John Hubbard, May 10, 2017

9: Viterbi Algorithm for HMM Decoding Machine Learning and Real-world Data Helen Yannakoudakis 1

9: Viterbi Algorithm for HMM Decoding Machine Learning and Real-world Data Simone Teufel and Ann

Efficient Video Decoding on GPUs Efficient Video Decoding on GPUs by Point Based Rendering by

Observation Decoding with Sensor Models: Recognition Tasks via Classical Planning Diego Aineto,

A Talk on Protein Homology Detection by HMM-HMM comparisons[1] Sding, J Qing Ye Department of

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

Global Robot Ego-Localization C Combining Image Retrieval and HMM- bi i I R i l d HMM

ANLP Lecture 9: Algorithms for HMMs Sharon Goldwater 4 Oct 2019 Recap: HMM Elements of HMM:

CSI5126 . Algorithms in bioinformatics Probabilistic Sequence Motifs Marcel Turcotte School of

Markov Models for Handwriting Recognition DAS 2012 Tutorial, Gold Coast, Australia Gernot

Scholarship &amp; Bursary Information Grad 2013 Research, Research, Research Researching

Using Performance to Link Land Use, Transportation, and Community Character 2013 AMPO Conference /

Plainview-Old Bethpage CSD 2019-2020 Middle Schools Budget Presentation March 19, 2019 Keys to

The View to the South Dave Sanford Director, Navigation Policy &amp; Legislation American

Probable answers on IDN ccTLD issues March 28, 2007 For ccNSO meeting discussion Prepared by

Diversity &amp; Representation in Childrens Literature By: Paul Barnes Whats The Big

Washington Street Study Community Working Group #1 Kickoff Meeting February 2, 2017 Agenda

Scholarship & Bursary Information Grad 2013 Research, Research, Research Researching

The View to the South Dave Sanford Director, Navigation Policy & Legislation American

Diversity & Representation in Childrens Literature By: Paul Barnes Whats The Big