Few-Shot Learnin ing For Text xt Cla lassif ificatio ion
Master’s Thesis by Shaour Haider First Referee : Prof. Dr. Benno Stein Second Referee : Prof. Dr. Volker Rodehorst
1
For Text xt Cla lassif ificatio ion Masters Thesis by Shaour - - PowerPoint PPT Presentation
Few-Shot Learnin ing For Text xt Cla lassif ificatio ion Masters Thesis by Shaour Haider First Referee : Prof. Dr. Benno Stein Second Referee : Prof. Dr. Volker Rodehorst 1 Overview Introduction Approaches And Results
Master’s Thesis by Shaour Haider First Referee : Prof. Dr. Benno Stein Second Referee : Prof. Dr. Volker Rodehorst
1
2
3
Introduction
4
Image: Sentiment Analysis Image: Spam Detection Image: Topic Classification
Introduction
Few-shot learning aims to learning a classifier with limited amount of labeled examples (<10)
5
Video: Video provides a powerful way to help you prove your point. Document: You can also type a keyword to search online for the video that
best fits your document.
Themes: Themes and styles also help keep your document coordinated. Design: When you click Design and choose a new Theme, the pictures, charts,
and SmartArt graphics change to match your new theme. When you apply styles, your headings change to match the new theme. Save time in Word with new buttons that show up where you need them. To change the way a picture fits in your document, click it and a button for layout
When you work on a table, click where you want to add a row or a column, and then click the plus sign.
Train Set 4-way 1-shot task Test Set Introduction
6
t (f (few-shot tr training set) t)
est t Set t ( ( tes estin ting set) t)
itional dataset t th that t is is dis isjoint to
train in and tes est t set t of
Introduction
7
Target Training Data (Few) Feature Extraction Classifier Target Loss & Update Target Testing Data Feature Extraction Classifier Target Accuracy Assessment
Fixed Weights
Approaches And Results
K=1 K=3 K=9 BOW 0.48 0.58 0.74
8 0.48 0.58 0.74 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 K=1 K=3 K=9 BOW
Approaches And Results
9
Football is a family of team sports that involve, to varying degrees, kicking a ball to score a goal.
Sports
[0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 1 0 0 0 0 1 0 1 1 0 0 1 0 0 1 0 0 0 0 0 0 1 1 1 1 0 0 2 0 1 0 0 0]
While football continued to be played in various forms throughout Britain, its public schools (equivalent to private schools in other countries) are widely credited with four key achievements in the creation of modern football codes.
[0 1 0 0 1 0 0 0 1 0 1 0 0 1 1 1 1 1 0 0 1 0 0 2 1 1 0 0 0 3 0 0 1 1 0 0 1 1 0 1 1 1 1 2 0 0 0 0 1 1 2 1 0 1 1 1]
Baseball evolved from older bat-and-ball games already being played in England by the mid-18th century.
[1 0 1 1 0 1 1 1 0 1 0 1 1 0 0 0 0 0 0 1 0 1 0 0 0 0 1 1 0 1 0 0 0 0 0 1 0 0 1 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0] [0 1 0 0 1 0 0 0 1 0 1 0 0 1 1 1 1 1 0 0 1 0 0 2 1 1 0 0 0 3 0 0 1 1 0 0 1 1 0 1 1 1 1 2 0 0 0 0 1 1 2 1 0 1 1 1] Approaches And Results
10
Target Training Data (Few) Feature Extraction Model Target Loss & Update Target Testing Data Feature Extraction Classifier Target Accuracy Assessment
Fixed Weights
Approaches And Results
K=1 K=3 K=9 BOW 0.48 0.58 0.74 FastText 0.66 (+ 0.18) 0.78 (+ 0.20) 0.84 (+ 0.10) Bert 0.73 (+ 0.25) 0.84 (+ 0.26) 0.89 (+ 0.15)
11 0.48 0.58 0.74 0.66 0.78 0.84 0.73 0.84 0.89 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 K=1 K=3 K=9 BOW FastText Bert
Approaches And Results
12
Image: Transfer Learning
Approaches And Results
13
Target Training Data (Few) Feature Extraction Classifier Target Loss & Update Target Testing Data Feature Extraction Classifier Target Accuracy Assessment
Fixed Weights
Target Dataset
Pre-Training Data (Many) Feature Extraction Model Pre-Training Loss & Update Model Pre-Training
Base Dataset
Model Pre-Training
Fixed Weights Fixed Weights
Bag of words
Approaches And Results
14
Approaches And Results
15
BOW Transfer Learning (Standard) K=1 K=3 K=9 BOW 0.49 (+ 0.01) 0.52 (- 0.06) 0.71 (- 0.03)
0.49 0.52 0.71 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 K=1 K=3 K=9 BOW
Approaches And Results
16
Target Training Data (Few) Feature Extraction Classifier Target Loss & Update Target Testing Data Feature Extraction Classifier Target Accuracy Assessment
Fixed Weights
Target Dataset
Pre-Training Data (Many) Feature Extraction Model Pre-Training Loss & Update Model Pre-Training
Base Dataset
Model Pre-Training
Fixed Weights Fixed Weights
Pretrained FastText & Bert Model
Approaches And Results
17
BOW
0.48 0.58 0.74 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 K=1 K=3 K=9 BOW
FastText And Bert
0.62 0.75 0.81 0.73 0.84 0.88 0.2 0.4 0.6 0.8 1 K=1 K=3 K=9 FastText Bert
Transfer Learning (Standard) K=1 K=3 K=9 BOW 0.49 (+ 0.01) 0.52 (- 0.06) 0.71 (- 0.03) FastText 0.62 (- 0.04) 0.75 (- 0.03) 0.81 (- 0.03) Bert 0.73 ( 0.00) 0.84 ( 0.00) 0.88 ( -0.01) Approaches And Results
18
Approaches And Results
19
Transfer Learning (Modified) K=1 K=3 K=9 BOW 0.68 (+ 0.20) 0.75 (+ 0.17) 0.84 (+ 0.10) FastText 0.69 (+ 0.03) 0.78 ( 0.00) 0.83 (- 0.01) Bert 0.73 ( 0.00) 0.81 (- 0.03) 0.87 (- 0.02)
0.68 0.75 0.84 0.69 0.78 0.83 0.73 0.81 0.87 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 K=1 K=3 K=9 BOW FastText Bert
Approaches And Results
0.48 0.58 0.74 0.66 0.78 0.84 0.73 0.84 0.89 0.49 0.52 0.71 0.62 0.75 0.81 0.73 0.84 0.88 0.68 0.75 0.84 0.69 0.78 0.83 0.73 0.81 0.87 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 K=1 K=3 K=9 BOW- Baseline FastText- Baseline Bert- Baseline BOW- Standard Transfer Leaning FastText- Standard Transfer Leaning Bert- Standard Transfer Leaning BOW- Modified Transfer Learning FastText- Modified Transfer Learning Bert- Modified Transfer Learning 20
Approaches And Results
21
Approaches And Results
22
Related Work
Relation Network Advances in few-shot learning Siamese
Related Work
MAML
Related Work
25
Future Work
26
42
43
44
Neural Network Neural Network Input 1 Input 2 Distance Metric
Support Set Instances Query Set Instance
45
46
47