Table Detection in Invoice Documents by Graph Neural Networks Pau - - PowerPoint PPT Presentation

table detection in invoice documents by graph neural
SMART_READER_LITE
LIVE PREVIEW

Table Detection in Invoice Documents by Graph Neural Networks Pau - - PowerPoint PPT Presentation

Table Detection in Invoice Documents by Graph Neural Networks Pau Riba , Anjan Dutta, Lutz Goldmann, Alicia Forn es, Oriol Ramos, Josep Llad os Computer Vision Center, omni:us ICDAR, Sydney, Australia, 23rd September, 2019 Introduction


slide-1
SLIDE 1

Table Detection in Invoice Documents by Graph Neural Networks

Pau Riba, Anjan Dutta, Lutz Goldmann, Alicia Forn´ es, Oriol Ramos, Josep Llad´

  • s

Computer Vision Center, omni:us ICDAR, Sydney, Australia, 23rd September, 2019

slide-2
SLIDE 2

Introduction Table Detection Framework Experimental Validation Conclusion

Outline

Introduction Table Detection Framework Graph Representation Network Architecture Experimental Validation Datasets and Statistics Node/Edge Classification Table Detection Conclusions and Future Work

2 Table Detection by GNN Riba et al.

slide-3
SLIDE 3

Introduction

3 Table Detection by GNN Riba et al.

slide-4
SLIDE 4

Introduction Table Detection Framework Experimental Validation Conclusion

Introduction

Business Documents

◮ Information extraction: Finance, insurance, manufacturing... ◮ Manual extraction: Tedious and time consuming. ◮ Automatic extraction: Reduced time and improved quality.

4 Table Detection by GNN Riba et al.

slide-5
SLIDE 5

Introduction Table Detection Framework Experimental Validation Conclusion

Introduction

Semi-Structured Documents

◮ Structured Documents: Existing methods, high accuracy. ◮ Unstructured Documents: Human assistance and validation. ◮ Semi-structured Documents:

◮ Without a fixed spatial layout. ◮ Sharing a common set of components. 5 Table Detection by GNN Riba et al.

slide-6
SLIDE 6

Introduction Table Detection Framework Experimental Validation Conclusion

Introduction

Invoice Documents

◮ Semi-structured documents with flexible layouts ◮ Spatial arrangement roughly perceived as a tabular layout ◮ Tables are commonly used to condense information

6 Table Detection by GNN Riba et al.

slide-7
SLIDE 7

Introduction Table Detection Framework Experimental Validation Conclusion

Introduction

Constraint

Industrial collaboration - Anonymized data

7 Table Detection by GNN Riba et al.

slide-8
SLIDE 8

Introduction Table Detection Framework Experimental Validation Conclusion

Introduction

Objective

◮ Graph based representation: Exploit repetitive patterns

8 Table Detection by GNN Riba et al.

slide-9
SLIDE 9

Introduction Table Detection Framework Experimental Validation Conclusion

Introduction

Objective

◮ Graph based representation: Exploit repetitive patterns ◮ Classification: GNN classification for nodes and edges

9 Table Detection by GNN Riba et al.

slide-10
SLIDE 10

Introduction Table Detection Framework Experimental Validation Conclusion

Introduction

Objective

◮ Graph based representation: Exploit repetitive patterns ◮ Classification: GNN classification for nodes and edges ◮ Table detection: Group nodes into table regions

10 Table Detection by GNN Riba et al.

slide-11
SLIDE 11

Table Detection Framework

11 Table Detection by GNN Riba et al.

slide-12
SLIDE 12

Introduction Table Detection Framework Experimental Validation Conclusion

Graph Representation

◮ Commercial OCR (by the industrial partner) ◮ Textual attributes (numeric, alphabet or symbol) ◮ Visibility graph:

◮ Nodes: Document regions ◮ Edges: Visibility relations (vertical and horizontal) 12 Table Detection by GNN Riba et al.

slide-13
SLIDE 13

Introduction Table Detection Framework Experimental Validation Conclusion

Graph Representation

◮ Commercial OCR (by the industrial partner) ◮ Textual attributes (numeric, alphabet or symbol) ◮ Visibility graph:

◮ Nodes: Document regions ◮ Edges: Visibility relations (vertical and horizontal) 13 Table Detection by GNN Riba et al.

slide-14
SLIDE 14

Introduction Table Detection Framework Experimental Validation Conclusion

Graph Neural Network

GNN layer

◮ Notation introduced in [2] ◮ A - Graph intrinsic linear

  • perators

◮ ρ - Activation function

(ReLU)

◮ θ - Learnable parameters

x(k+1) = GC(x(k)) = ρ  

B∈A(k)

Bx(k)θ(k)

B

 

[2] V. Garcia et. al., Few-shot learning with graph neural networks, in ICLR, 2018.

14 Table Detection by GNN Riba et al.

slide-15
SLIDE 15

Introduction Table Detection Framework Experimental Validation Conclusion

Graph Neural Network

Graph Adjacency Layer

◮ Importance of the neighbourhood connection ◮ MLP - MultiLayer Perceptron ◮ σ - Activation function (Sigmoid) ◮ Absolute difference provides the symmetry property

φk(B)i,j=

  • if Bi,j = 0

σ

  • MLP˜

θ

  • x(k)

i

− x(k)

j

  • therwise

15 Table Detection by GNN Riba et al.

slide-16
SLIDE 16

Introduction Table Detection Framework Experimental Validation Conclusion

GNN Architecture

Pipeline

16 Table Detection by GNN Riba et al.

slide-17
SLIDE 17

Introduction Table Detection Framework Experimental Validation Conclusion

GNN Architecture

Graph Residual Block

◮ Idea of ResNet [1] ◮ GNN layers with a skip

connection

◮ Edge weights are learned at

the beginning of the block

[1] K. He et. al., Deep residual learning for image recognition, in CVPR, 2016.

17 Table Detection by GNN Riba et al.

slide-18
SLIDE 18

Introduction Table Detection Framework Experimental Validation Conclusion

GNN Architecture

Objective functions

◮ Node classifier: Linear classifier with Softmax operation ◮ Edge classifier: Binary Cross entropy

◮ 0 - Edge connects two different regions ◮ 1 - Edge connects elements in the same region 18 Table Detection by GNN Riba et al.

slide-19
SLIDE 19

Introduction Table Detection Framework Experimental Validation Conclusion

Table Detection

◮ Discard 0’ed edges ◮ Subgraphs with nodes classified as Table are considered ◮ The confidence score of these subgraphs are thresholded for

the final decision

19 Table Detection by GNN Riba et al.

slide-20
SLIDE 20

Experimental Validation

20 Table Detection by GNN Riba et al.

slide-21
SLIDE 21

Introduction Table Detection Framework Experimental Validation Conclusion

Datasets

CON-ANONYM

◮ 960 documents ◮ 8 region annotation ◮ Common car invoices ◮ Not publicly available

RVL-CDIP

◮ Overall 25,000 images ◮ 5 region annotation ◮ Selected 518 invoice class ◮ Publicly available 1

1 https://zenodo.org/record/3257319

[3] A. W. Harley et. al., Evaluation of deep convolutional nets for document image classification and retrieval, in ICDAR, 2015.

21 Table Detection by GNN Riba et al.

slide-22
SLIDE 22

Introduction Table Detection Framework Experimental Validation Conclusion

Datasets

CON-ANONYM RVL-CDIP Total # documents (tr, va, te) 950 (665, 95, 195) 518 (362, 52, 104) Total # pages 1252 518 Total # tables 1202 485 Total # classes 8 6

  • Avg. # nodes/page

245.50 124.03

  • Avg. # edges/page

1354.81 619.55

22 Table Detection by GNN Riba et al.

slide-23
SLIDE 23

Introduction Table Detection Framework Experimental Validation Conclusion

Node/Edge Classification

Task CON-ANONYM RVL-CDIP All Table Edge All Table Edge Pow 2 82.8 96.4 − 57.8 80.9 − + Edge 84.2 97.0 93.4 58.2 79.1 84.1 Pow 5 82.7 96.2 − 56.5 82.3 − + Edge 84.5 97.2 93.4 62.3 83.9 84.0

23 Table Detection by GNN Riba et al.

slide-24
SLIDE 24

Introduction Table Detection Framework Experimental Validation Conclusion

Table Detection

◮ Intersection over Union

Task CON-ANONYM RVL-CDIP F1-Score Precision Recall F1-Score Precision Recall Pow 2 69.4 65.8 73.4 28.6 23.9 35.4 + Edge 70.8 65.2 77.6 30.8 26.7 36.5 Pow 5 68.4 65.3 71.8 22.6 20.0 26.0 + Edge 73.7 78.4 69.5 30.8 25.2 39.6

24 Table Detection by GNN Riba et al.

slide-25
SLIDE 25

Introduction Table Detection Framework Experimental Validation Conclusion

Table Detection

◮ Proper detection

25 Table Detection by GNN Riba et al.

slide-26
SLIDE 26

Introduction Table Detection Framework Experimental Validation Conclusion

Table Detection

◮ Proper detection

26 Table Detection by GNN Riba et al.

slide-27
SLIDE 27

Introduction Table Detection Framework Experimental Validation Conclusion

Table Detection

◮ Proper detection ◮ Preprocessing problems

27 Table Detection by GNN Riba et al.

slide-28
SLIDE 28

Introduction Table Detection Framework Experimental Validation Conclusion

Table Detection

◮ Proper detection ◮ Preprocessing problems ◮ Tabular layout

28 Table Detection by GNN Riba et al.

slide-29
SLIDE 29

Conclusions and Future Work

29 Table Detection by GNN Riba et al.

slide-30
SLIDE 30

Introduction Table Detection Framework Experimental Validation Conclusion

Conclusions and Future Work

◮ First Table Detection based on structural information. ◮ A Graph models the underlying structure of the document. ◮ Publicly available RVL-CDIP invoice dataset. ◮ Deal with anonymized data. ◮ Generalize to unconstrained tabular layout.

30 Table Detection by GNN Riba et al.

slide-31
SLIDE 31

Thank you for your attention!

Pau Riba Computer Vision Center priba@cvc.uab.cat http://www.cvc.uab.cat/people/priba/