Recognition of Japanese Historical Hand- Written Characters Based - PowerPoint PPT Presentation

Recognition of Japanese Historical Hand- Written Characters Based on Object Detection Methods Yiping Tang, Kohei Hatano, Eiji Takimoto Kyushu university

What is Kuzushiji? • Definition(*)： Kuzushiji is written with handwritten characters in Japanese historical literature. • Difficulty in recognition: (i) characters are often connected without explicit spaces (ii) Characters are often simplified or abbreviated. • Segmentation is not easy Kuzushiji character of 「あ,a」 • http://wwwap.hi.u- tokyo.ac.jp/ships/shipscontroller https://www.nijl.ac.jp/pages/event/seminar/2015/old_books_text.html 1

Recognition of Single Kuzushiji character • Single Kuzushiji characters can be recognized with high accuracy by deep learning. • [hayasaka+ 16]48 kinds of kuzushiji hiragana…70-80% • [kitamoto16] 10 most frequent characters in CODH dataset …96-97% • PRMU2017 contest, 46 kinds of single kuzushiji…97.2% • [Clanuwat+ 18] Kuzushiji-49 dataset… 97.33% 2

The background of Kuzushiji recognition --how to segmentation • [Nguyen+ 17] 1. Find bounding boxes by multiple fixed size sliding windows. 2. Extract and process features using CNN, RNN. 3. Use CTC(Connectionist Temporal Classification) to derive the result. Problem: 1. The predicted boxes in result will be some fixed size, and cannot fit the shape of character. 2. There will be lots of bounding boxes that only circle the part of character but seem as a full character. 3

The background of Kuzushiji recognition --how to segmentation • [Kitamoto+ 19] 1. Create an annotation dataset for pixel units for learning 2. Train by U-net network 3. Predict the label of each pixel in full book page Problem: 1. Need annotation data of each pixel 1. Hard to train 2. Take up lots of memory 4

Our approach(1): Segmentation and recognition at the same time based on object detection method Input: • digital image Output: • pair of label and bounding box for each object 5

Object detection ーーLearn segmentation/recognition data simultaneously {bounding box1,label confidence 1} {bounding box2,label confidence 2} {bounding box3,label confidence 3} prediction aggregation {bounding box4,label confidence 4} {bounding box5,label confidence 5} {bounding box6,label confidence 6} Weight file {bounding box7,label confidence 7} {bounding box8,label confidence 8} learn Images of consecutive characters with Weight file label and segmentation information • Problem: How can we obtain learning data with segmentation information?

Kuzushiji segmentation dataset[Tang+18] Use for learn , Character segmentation segmentation dataset but have no segmentation information Character segmentation information of each character information, (all of hiragana) Use for learn Base on CODH dataset and PRMU contest dataset, have segmentation • information and label information of image of each character. 77953 three-letter images and 12582 multi-letter images • Removal of difficult data or erroneous data by double check by manual • operation

Proposed method① －－ get bounding box and label confidence information simultaneously Apply object detection[Redmon+ 18] to recognition of Kuzushiji • {bounding box1,label confidence 1} {bounding box2,label confidence 2} {bounding box3,label confidence 3} {bounding box4,label confidence 4} {bounding box5,label confidence 5} {bounding box6,label confidence 6} {bounding box7,label confidence 7} {bounding box1,label confidence 1} {bounding box2,label confidence 2} {bounding box3,label confidence 3} aggregation {bounding box4,label confidence 4} Yolov3-darknet54 {bounding box5,label confidence 5} {bounding box6,label confidence 6} {bounding box7,label confidence 7} The darknet53 model is used as backbone network. •

Aggregation method of Yolov3 --non maximum suppression (NMS) 0.7 0.9 1. Set the label confidence threshold and the overlap threshold. 0.4 2. Find the highest score box without repeating 3. Two proposals are considered to be in the same cluster when their IoU(Intersection over Union) is larger than the overlap threshold, 0.3 0.6 only keep the one with the highest score in the cluster. 0.4 4. Loop 2, 3 until there are no new box can be find Problem: 1. Unable to guarantee the number of output characters 2. Bad handling overlay problem of characters

Proposed method① ーー aggregation method {coordinate/label info…} {coordinate/label info…} {coordinate/label info…} {coordinate/label info…} {coordinate/label info…} {coordinate/label info…} {coordinate/label info…} Weight file {coordinate/label info…} {coordinate/label info…} 1. Record the center of each box. 2. Assume the number of clusters of Kuzushiji characters as K. 3. A box with a maximum label confidence of character in each cluster regarded as the representative． Advantage: Since a plausible box is selected for each character cluster, recognition is rarely discarded or passed. 10

Evaluation criteria for bounding boxes Given the sequence of predicted bounding boxes ( ) and ground truth bounding boxes ( ), the consistency rate (CR) of the predicted sequence of boxes is defined as for formula. Parameter: CR only focuses on differences of bounding boxes in the vertical direction, which is sufficient for our purpose. 12

• Training 70,000 images(three characters) from dataset[Tang+18] for training. • evaluation Other 7,000 images(three characters) from dataset[Tang+18] for testing. • Results ① ② [ Nguyen+ 17 ] ③ ④ ⑤ ⑥ ④FGDM-a is denoted as the result of FGDM with the same learning rate of YOLOv3 and ⑤FGDM-b is the one with decreasing learning rate by multiplying 0.1 in every 40000 rounds.

Future work Recognition for Kuzushiji images of more than three characters(Lv3). (Use original YOLOv3)

• Thanks 16

Recognition of Japanese Historical Hand- Written Characters Based - PowerPoint PPT Presentation

Recognition of Japanese Historical Hand- Written Characters Based on Object Detection Methods Yiping Tang, Kohei Hatano, Eiji Takimoto Kyushu university What is Kuzushiji? Definition(*) Kuzushiji is written with hand- written

List of hand outs for this session Hand out 1: Incident decision tree Hand out 2: Yorkshire

Japanese Layout Requirements Richard Ishida 1 Japanese Layout Requirements This presentation

Hand Hygiene Stefan Morton Hand Hygiene Coordinator Evidence Improved adherence to hand

Hand Hand- -Written Written aspects One- One -Pass Compilers Pass Compilers 14

Creators of the finest hand painted wallpapers and fabrics, hand carved furniture and hand painted

ROUNDERS (1998) CASINO ROYALE (2006) HAND RANKINGS HIGH CARD HAND RANKINGS PAIR HIGH CARD

Raise your hand in Zoom Click on Participants Your hand is raised Click hand to lower it

Action recognition in videos Action recognition in videos Cordelia Schmid Cordelia Schmid

Action recognition in videos II Action recognition in videos II Cordelia Schmid INRIA Grenoble

Japanese waste paper trend Japanese waste paper trend High collection & reuse High

Rev 5:1, And I saw in the right hand of Him And I saw in the right hand of Him Rev 5:1,

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Toronto Bloorview Macmillan (TBM) Hand Multi-Fingered, Adaptive Grasp Prosthetic Hand: Better

Right Hand Coordinate System Right Hand Rule A rectangular or Cartesian coordinate system

EMPLOYEE RECOGNITION OBJECTIVES Types of recognition Creating a culture of recognition

Topological Features for Recognizing Printed and Handwritten Bangla Characters Soumen Bag,

Mode-Adaptive Neural Networks for Quadruped Motion Control He Zhang he.zhang@ed.ac.uk CGVU Group

Characters } The Java char primitive type } Represents a single character, and is given values

PHYLIP Joe Felsenstein Depts. of Genome Sciences and of Biology, University of Washington PHYLIP

ICFHR 2010 Introductory words Lambert Schomaker International Workshop Conference on

Arabic Language Challenges Boring (promise) Walid Magdy This lecture is about This sentence is

Arabic Language Challenges Walid Magdy This lecture is not About Arabic language technologies

Character Polynomials Problem From Stanleys Positivity Problems in Algebraic

Recognition of Japanese Historical Hand- Written Characters Based - PowerPoint PPT Presentation

Recognition of Japanese Historical Hand- Written Characters Based on Object Detection Methods Yiping Tang, Kohei Hatano, Eiji Takimoto Kyushu university What is Kuzushiji? Definition(*) Kuzushiji is written with hand- written

List of hand outs for this session Hand out 1: Incident decision tree Hand out 2: Yorkshire

Japanese Layout Requirements Richard Ishida 1 Japanese Layout Requirements This presentation

Hand Hygiene Stefan Morton Hand Hygiene Coordinator Evidence Improved adherence to hand

Hand Hand- -Written Written aspects One- One -Pass Compilers Pass Compilers 14

Creators of the finest hand painted wallpapers and fabrics, hand carved furniture and hand painted

ROUNDERS (1998) CASINO ROYALE (2006) HAND RANKINGS HIGH CARD HAND RANKINGS PAIR HIGH CARD

Raise your hand in Zoom Click on Participants Your hand is raised Click hand to lower it

Action recognition in videos Action recognition in videos Cordelia Schmid Cordelia Schmid

Action recognition in videos II Action recognition in videos II Cordelia Schmid INRIA Grenoble

Japanese waste paper trend Japanese waste paper trend High collection &amp; reuse High

Rev 5:1, And I saw in the right hand of Him And I saw in the right hand of Him Rev 5:1,

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Toronto Bloorview Macmillan (TBM) Hand Multi-Fingered, Adaptive Grasp Prosthetic Hand: Better

Right Hand Coordinate System Right Hand Rule A rectangular or Cartesian coordinate system

EMPLOYEE RECOGNITION OBJECTIVES Types of recognition Creating a culture of recognition

Topological Features for Recognizing Printed and Handwritten Bangla Characters Soumen Bag,

Mode-Adaptive Neural Networks for Quadruped Motion Control He Zhang he.zhang@ed.ac.uk CGVU Group

Characters } The Java char primitive type } Represents a single character, and is given values

PHYLIP Joe Felsenstein Depts. of Genome Sciences and of Biology, University of Washington PHYLIP

ICFHR 2010 Introductory words Lambert Schomaker International Workshop Conference on

Arabic Language Challenges Boring (promise) Walid Magdy This lecture is about This sentence is

Arabic Language Challenges Walid Magdy This lecture is not About Arabic language technologies

Character Polynomials Problem From Stanleys Positivity Problems in Algebraic

Japanese waste paper trend Japanese waste paper trend High collection & reuse High