Markov Models for Handwriting Recognition DAS 2012 Tutorial, Gold - PowerPoint PPT Presentation

Markov Models for Handwriting Recognition — DAS 2012 Tutorial, Gold Coast, Australia — Gernot A. Fink TU Dortmund University, Dortmund, Germany March 26, 2012 ◮ Introduction ◮ Markov Model-Based Handwriting Recognition . . . Fundamentals ◮ Hidden Markov Models . . . Definition, Use Cases, Algorithms ◮ Language Models . . . Definition & Robust Estimation ◮ Integrated Search . . . Combining HMMs and n-Gram Models ◮ Summary . . . and Further Reading

Why Should Machines Be Able to Read? Because it’s cool ? ... but probably not cool enough! For Automation in document processing, e.g.: ◮ Reading of addresses, ◮ Analysis of forms, ◮ Classification of business mail pieces ◮ Archiving & retrieval For Communication with humans (ˆ = Man-Machine-Interaction) on small, portable devices (e.g. SmartPhones, Tablet-PCs) As Support in, e.g., automatically reading business cards Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 1

Why Handwriting? In Communication: Interactivity required! ⇒ Capturing of the pen trajectory online In Automatition: Capturing of the script image offline ◮ Postal addresses: 10–20% handwritten (more before Christmas, trend: increasing!) [Source: M.-P. Schambach, Siemens] ◮ Forms (Money-transfers, checks, ...) ◮ Historical documents (Letters, reports from the adminitration) ⇒ Handwriting — still going strong ! Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 2

Why is Handwriting Recognition Difficult? ◮ Considerable freedom in the script appearance Typical handwriting ˆ = cursive writing Also: “hand printed” characters Mostly: Combination ˆ = unconstrained ... ◮ Large Variability of individual symbols ◮ Writing style ◮ Stroke width and quality ◮ Considerable variations even for the same writer! ◮ Segmentiation problematic (especially for cursive writing) “Merging” of neighboring symbols Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 3

Focus of this Tutorial Processing type: Offline (documents captured by scanner or camera) Script type & Writing style: ◮ Alphabetic scripts, especially Roman script ◮ No restriction w.r.t. writing style, size etc. ⇒ Unconstrained handwriting! Methods: Statistical Recognition Paradigm ◮ Markov Models for segmentation free recognition ◮ Statistical n -gram models for text-level restrictions Goal: Understand ... ◮ ... concepts and methods behind Markov-Model based recognizers and ... ◮ ... how these are applied in handwriting recognition. With Self-Study Materials: ◮ Build a working handwriting recognizer using ESMERALDA. Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 4

Overview ◮ Introduction ◮ Markov Model-Based Handwriting Recognition . . . Fundamentals ◮ Motivation . . . Why MM-based HWR? ◮ Data Peparation . . . Preprocessing and Feature Extraction ◮ Hidden Markov Models . . . Definition, Use Cases, Algorithms ◮ Language Models . . . Definition & Robust Estimation ◮ Integrated Search . . . Combining HMMs and n-Gram Models ◮ Summary . . . and Further Reading Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 5

“Traditional” Recognition Paradigm Original Image Alternative segmentations 1. M a r k o v Segmentation 2. + Classification: H a c k e r Potential elementary segments, strokes, ... . . . n. I c u n u l l E Segmentation is � Segment-wise classification possible using various ◮ costly, standard techniques ◮ heuristic, and ◮ needs to be optimized manually E Segmentation is especially problematic for unconstrained handwriting! Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 6

Statistical Recognition Paradigm: The Channel Model (Model originally proposed for automatic speech recognition) Source Channel Recognition w X w ˆ Text Script Feature Statistical Production Realization Extraction Decoding P ( w ) P ( X | w ) P ( w | X ) argmax w Sequence of words/characters ˆ w , which is most probable for given Wanted: signal/features X P ( w ) P ( X | w ) w = argmax ˆ P ( w | X ) = argmax = argmax P ( w ) P ( X | w ) P ( X ) w w w Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 7

The Channel Model II P ( w ) P ( X | w ) w = argmax ˆ P ( w | X ) = argmax = argmax P ( w ) P ( X | w ) P ( X ) w w w Two aspects of modeling: ◮ Script (appearance) model: P ( X | w ) ⇒ Representation of words/characters Hidden-Markov-Models ◮ Language model: P ( w ) ⇒ Restrictions for sequences of words/characters Markov Chain Models / n-Gram-Models Script or trajectories of the pen (or features, respectively) Specialty: interpreted as temporal data � Segmentation performed implicitly! ⇒ “segmentation free” approach ! Script or pen movements, respectively, must be serialized! Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 8

Overview ◮ Introduction ◮ Markov Model-Based Handwriting Recognition . . . Fundamentals ◮ Motivation . . . Why MM-based HWR? ◮ Data Peparation . . . Preprocessing and Feature Extraction ◮ Hidden Markov Models . . . Definition, Use Cases, Algorithms ◮ Language Models . . . Definition & Robust Estimation ◮ Integrated Search . . . Combining HMMs and n-Gram Models ◮ Summary . . . and Further Reading Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 9

Preprocessing I Assumption: Documents are already segmented into text lines (Text detection and line extraction highly application specific!) Baseline Estimation: ◮ Initial estimate based on horiz. projection histogram Potential method: ◮ Iterative refinement and outlier removal (cf. [2, 10]) Skew and Displacement Correction: Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 10

Preprocessing II Slant estimation: E.g. via mean orientation of edges obtained by Canny operator (cf. e.g. [12]) −80 −60 −40 −20 0 20 40 60 80 Slant normalization (by applying a shear transform) Original Corrected Slant Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 11

Preprocessing III Note: Depending on writer and context script might largely vary in size! Size normalization methods: ◮ “manually”, heuristically, to predefined width/height??? ◮ depending on estimated core size ( ← estimation crucial!) ◮ depending on estimated character width [7] Original text lines (from IAM−DB) Results of size normalization (avg. distance of contour minima) Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 12

Serialization: The Sliding Window Method Problem: Data is two-dimensional, images of writing! E No chronological structure inherently defined! Exception: Logical sequence of characters within texts Solution: Sliding-window approach First proposed by researchers at Daimler-Benz Research Center, Ulm [3], pioneered by researchers at BBN [11] ◮ Time axis runs in writing direction / along baseline ◮ Extract small overlapping analysis windows . . . . . . Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 13 [Frames shown are for illustration only but actually too large!]

Feature Extraction Basic Idea: Describe appearance of writing within analysis window E No “standard” approaches or feature sets ! No holistic features used in HMM-based systems Potential Methods: ◮ (For OCR) Local analysis of gray-value distributions (cf. e.g. [1]) ◮ Salient elementary geometric shapes (e.g. vertices, cusps) ◮ Heuristic geometric properties (cf. e.g. [13]) orientation orientation upper contour lower contour average upper contour lower contour 1) 2) 3) 4) 5) avg. avg. avg. baseline orientation # black−white # ink pixels / # ink pixels / average transitions col_height (max − min) 6) 7) 8) 9) 1 1.25 0.2 0.9 1 avg. avg. avg. 1 2 1 Additionally: Compute dynamic features (i.e. discrete approximations of temporal derivatives, cf. e.g. [5]) Fink ¶ · º » Markov Models for Handwriting Recognition Introduction MM-based HWR HMMs LM Search Summary References 14

Markov Models for Handwriting Recognition DAS 2012 Tutorial, Gold - PowerPoint PPT Presentation

Markov Models for Handwriting Recognition DAS 2012 Tutorial, Gold Coast, Australia Gernot A. Fink TU Dortmund University, Dortmund, Germany March 26, 2012 Introduction Markov Model-Based Handwriting Recognition . . . Fundamentals

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Handwriting Recognition Handwriting Recognition for Genealogical Records for Genealogical

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Handwriting and Presentation Policy Nelson Handwriting provides a clear, practical framework for

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Handwriting Recognition Handwriting Recognition for Genealogical Records for Genealogical

End-to-end, Full Page, Handwriting Recognition Curtis Wigington, Brian Davis, Chris Tensmeyer,

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

Marine Academy Primary Handwriting and Presentation Policy Handwriting and Presentation Policy

Handwriting and Presentation Policy Manageable and effective ways of teaching handwriting and

Introduction Here at Easington Lane Primary School we strive to improve pupil s handwriting and

Collierley Primary School Guidance on Handwriting and Presentation Handwriting Programmes of

Handwriting and Presentation Policy Rationale Handwriting and presentation are fundamental skills

St. Matt hews C .E. Primary School Handwriting and Presentation Policy Values Handwriting is

St. Matt hews CE Primary School Handwriting and Presentation Policy Values Handwriting is a

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Scholarship & Bursary Information Grad 2013 Research, Research, Research Researching

Using Performance to Link Land Use, Transportation, and Community Character 2013 AMPO Conference /

Plainview-Old Bethpage CSD 2019-2020 Middle Schools Budget Presentation March 19, 2019 Keys to

S E E D / / S E A R C H F O R E X T R A O R D I N A R Y E X P E R I E N C E S D I V I S

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor gorithm for thm for Large

The View to the South Dave Sanford Director, Navigation Policy & Legislation American

Probable answers on IDN ccTLD issues March 28, 2007 For ccNSO meeting discussion Prepared by

Diversity & Representation in Childrens Literature By: Paul Barnes Whats The Big

Markov Models for Handwriting Recognition DAS 2012 Tutorial, Gold - PowerPoint PPT Presentation

Markov Models for Handwriting Recognition DAS 2012 Tutorial, Gold Coast, Australia Gernot A. Fink TU Dortmund University, Dortmund, Germany March 26, 2012 Introduction Markov Model-Based Handwriting Recognition . . . Fundamentals

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Handwriting Recognition Handwriting Recognition for Genealogical Records for Genealogical

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Handwriting and Presentation Policy Nelson Handwriting provides a clear, practical framework for

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Handwriting Recognition Handwriting Recognition for Genealogical Records for Genealogical

End-to-end, Full Page, Handwriting Recognition Curtis Wigington, Brian Davis, Chris Tensmeyer,

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

Marine Academy Primary Handwriting and Presentation Policy Handwriting and Presentation Policy

Handwriting and Presentation Policy Manageable and effective ways of teaching handwriting and

Introduction Here at Easington Lane Primary School we strive to improve pupil s handwriting and

Collierley Primary School Guidance on Handwriting and Presentation Handwriting Programmes of

Handwriting and Presentation Policy Rationale Handwriting and presentation are fundamental skills

St. Matt hews C .E. Primary School Handwriting and Presentation Policy Values Handwriting is

St. Matt hews CE Primary School Handwriting and Presentation Policy Values Handwriting is a

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Scholarship &amp; Bursary Information Grad 2013 Research, Research, Research Researching

Using Performance to Link Land Use, Transportation, and Community Character 2013 AMPO Conference /

Plainview-Old Bethpage CSD 2019-2020 Middle Schools Budget Presentation March 19, 2019 Keys to

S E E D / / S E A R C H F O R E X T R A O R D I N A R Y E X P E R I E N C E S D I V I S

Fast TwoLevel Fast TwoLevel HMM Decodi HMM Decoding ng Algor gorithm for thm for Large

The View to the South Dave Sanford Director, Navigation Policy &amp; Legislation American

Probable answers on IDN ccTLD issues March 28, 2007 For ccNSO meeting discussion Prepared by

Diversity &amp; Representation in Childrens Literature By: Paul Barnes Whats The Big

Scholarship & Bursary Information Grad 2013 Research, Research, Research Researching

The View to the South Dave Sanford Director, Navigation Policy & Legislation American

Diversity & Representation in Childrens Literature By: Paul Barnes Whats The Big