Bag-of-Visual-Ngrams for Histopathology Image Classification - - PowerPoint PPT Presentation

bag of visual ngrams for histopathology image
SMART_READER_LITE
LIVE PREVIEW

Bag-of-Visual-Ngrams for Histopathology Image Classification - - PowerPoint PPT Presentation

Bag-of-Visual-Ngrams for Histopathology Image Classification opez-Monroy, M.Sc. 1 A. Pastor L omez, Ph.D. 1 H. J. Escalante, Ph.D. 1 M. Montes-y-G A. Cruz-Roa, M.Sc. 2 alez, Ph.D. 2 F. A. Gonz November-2013 M exico LabTL, Computer


slide-1
SLIDE 1

Bag-of-Visual-Ngrams for Histopathology Image Classification

  • A. Pastor L´
  • pez-Monroy, M.Sc.1
  • M. Montes-y-G´
  • mez, Ph.D.1
  • H. J. Escalante, Ph.D.1
  • A. Cruz-Roa, M.Sc.2
  • F. A. Gonz´

alez, Ph.D.2

November-2013 M´ exico LabTL, Computer Science Department, Instituto Nacional de Astrof´ ısica, ´ Optica y Electr´

  • nica 1

MindLab, Computing Systems and Industrial Engineering Department, National University of Colombia 2 1 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-2
SLIDE 2

Instituto Nacional de Astrof´ ısica, ´ Optica y Electr´

  • nica.

Contents

Introduction Representing images through visual n-grams Evaluating visual words, visual n-grams and language models Conclusions

2 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-3
SLIDE 3

1.- Introduction

Introduction

The amount of digital images available is constantly growing. Image Classification (IC) is important for the organization and analysis of visual information (e.g., for automated medical diagnosis from visual imagery). In this work, we focus in the challenging problem of automatic classification of the histophatology images.

3 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-4
SLIDE 4

1.- Introduction

Introduction

Histopathological images have several interesting particularities: heterogeneous rich visual content, high intra-class variability and complex mixtures of non-localized patterns. Automatic classification of histopathology images is according to tissue structures (healthy or pathological) that can be recognized by visual inspection of an expert pathologist. The classification is related to pathological lesions, morphological and architectural features, which encompass a complex mixture of visual patterns that allow to decide about the illness presence. [Cruz-Roa et al., 2011].

4 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-5
SLIDE 5

1.- Introduction

Introduction

Example of histopathology images from skin biopsies with healthy (ephitelium) and pathological tissues (morpheaform basal-cell carcinoma), left and right respectively, used for basal-cell carcinoma diagnosis.

5 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-6
SLIDE 6

1.- Introduction

Image representation

One of the most used approaches is the Bag-of-Visual-Words (BoVW) formulation [Sivic and Zisserman., 2003]. BoVW representation is inspired in the bag-of-words (BoW) representation used in text classification and information retrieval. The idea of BoW is to represent a document by a numerical vector that indicates the presence/absence of words in a document. In computer vision tasks, a vocabulary of visual words is first generated and then images are represented by histograms that account for the occurrence of visual words in images.

6 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-7
SLIDE 7

1.- Introduction

General view of BoVW

7 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-8
SLIDE 8

1.- Introduction

Introduction

BoVW has an important shortcoming: it ignores spatial relationships among visual words. Spatial context has proven to be helpful for boosting the performance in diverse computer vision tasks.[Galleguillos and Belongie, 2010]. We propose a natural extension to the BoVW formulation: the Bag-of-Visual-ngrams (BoVN) In Text mining, n−grams are sequences of n words that allows to capture compound word-patterns, e.g., united-states, very-good, visual-words, etc.). We propose building codebooks of visual n − grams to capture visual patterns.

8 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-9
SLIDE 9

1.- Introduction

Main contributions

1

We introduce a different way to use of n-grams under the BoVW formulation for IC; where n-grams are used as attributes for a classification model.

2

We show that the BoVN can outperform the performance of the BoVW approach for the classification of histopathology images.

9 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-10
SLIDE 10

2.- Representing images through visual n-grams

General view of the proposed visual n-grams framework

10 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-11
SLIDE 11

2.- Representing images through visual n-grams

Step 1: Detailed process to build the Visual Word codebook

11 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-12
SLIDE 12

2.- Representing images through visual n-grams

Step 2: Representing images using the Visual Word codebook

(a) Original image. (b) Visual Words representation.

12 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-13
SLIDE 13

2.- Representing images through visual n-grams

Step 3: Building the Visual n-grams using a sliding window

For the dark path (65) the extracted n-grams are: 65-12, 65-213, 65-546, 65-645, 65-654, 65-565, 65-444, 65-33.

13 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-14
SLIDE 14

3.- Evaluating visual words, visual n-grams and language models

Description of the image dataset

Histopathology image distribution for each category. Histopathology image positives negatives

  • 1. basal-cell carcinoma

518 899

  • 2. collagen

1238 179

  • 3. epidermis

147 1270

  • 4. hair follicle

118 1299

  • 5. eccrine glands

126 1291

  • 6. sebaceous glands

136 1281

  • 7. inflammatory infiltrate

99 1318

14 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-15
SLIDE 15

3.- Evaluating visual words, visual n-grams and language models

Evaluation of the standard Visual Words (Unigrams)

Experiments using Visual Words (Unigrams) through two kinds of term weighting (TF and BIN) and two different size patches (8 and 16). Visual Words Config FM Bin-8 48.27 Bin-16 47.63 TF-8 58.59 TF-16 52.33

15 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-16
SLIDE 16

3.- Evaluating visual words, visual n-grams and language models

Finding a suitable number of Visual Bi- grams

Experiments using Bigrams to analyze the impact of dimensionality (F-measure). Frequency threshold Config 1.5K 2.5K 5K 7.5K 10K 1+2grams 63.73 64.31 64.03 62.24 61.63

16 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-17
SLIDE 17

3.- Evaluating visual words, visual n-grams and language models66

Visual Bigrams

Interesting bigrams for basal-cell carcinoma

17 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-18
SLIDE 18

3.- Evaluating visual words, visual n-grams and language models

Visual Words vs Visual Bigrams

Experiments using sequences of Visual Words (Uni-Bi-grams) through two kinds of term weighting (TF and BIN) and two different size patches (8 and 16). Unigrams vs Uni+Bigrams Config F-Measure 1grams 1+2grams Bin-8 48.27 59.50 Bin-16 47.63 56.67 TF-8 58.9 64.31 TF-16 52.33 56.09

18 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-19
SLIDE 19

3.- Evaluating visual words, visual n-grams and language models

The impact of sequence length for n-grams

Experiments using sequences of Visual Words (from Unigrams to Tetragrams). Experiments with n-grams Config FM 1grams 58.59 1+2grams 64.31 1+2+3grams 62.69 1+2+3+4grams 61.34

19 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-20
SLIDE 20

3.- Evaluating visual words, visual n-grams and language models

Detailed F-Measure by class

Detailed experiments per class using Visual Words (Unigrams) versus sequences of Visual Words (Uni-Bi-grams). Detailed F-Measure by class Class (a) 1grams (b) 1+2grams (b-a) gain/loss 1 86.10 90.70 4.6 2 94.80 95.50 0.7 3 74.40 83.40 9.0 4 36.80 50.80 14.0 5 35.80 52.50 16.7 6 48.00 43.60

  • 4.4

7 34.20 33.70

  • 0.5

20 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-21
SLIDE 21

3.- Evaluating visual words, visual n-grams and language models

Comparison with other typical approaches

Experiments using sequences of Visual Words (Uni-Bi-grams) compared with a LMC. LMC vs Uni+Bigrams Config F-Measure LMC 1+2grams TF-8 53.0 64.31 TF-16 48.31 56.09

21 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-22
SLIDE 22

4.- Conclusions

Conclusions

We proposed an extension to the standard BoVW to extract sequential visual patterns and use them as attributes for a classification model. An histopathology image collection was used to extract n-grams using inspired ideas from NLP, such visual n-grams have proven good performance for this task. Visual n-grams as attributes have proven to be useful versus the traditional BoVW approach and LMC. Future work includes applying the visual n-grams based-representation to other computer vision tasks that use BoVW as representation.

22 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-23
SLIDE 23

Thank you, ... Questions?

23 / 24 9th International Seminar on Medical Information Processing and Analysis

slide-24
SLIDE 24

References

References

Cruz-Roa, A., D´ ıaz, G., Romero, E., and Gonz´ alez, F. A. (2011).

Automatic annotation of histopathological images using a latent topic model based on non-negative matrix factorization. Journal of Pathology Informatics, 4.

Galleguillos, C. and Belongie, S. (2010).

Context based object categorization: A critical survey. Computer Vision and Image Understanding, 114:712–722.

Sivic, J. and Zisserman., A. (2003).

Video google: A text retrieval approach to object matching in videos. In Proceedings of the International Conference on Computer Vision, ICCV. 24 / 24 9th International Seminar on Medical Information Processing and Analysis