graph product
KDD 2018
OpenTag: Open Attribute Value Extraction From Product Profiles
Guineng Zheng*, Subhabrata MukherjeeΔ, Xin Luna DongΔ, FeiFei Li*
ΔAmazon.com, *University of Utah
1
OpenTag : Open Attribute Value Extraction From Product Profiles - - PowerPoint PPT Presentation
OpenTag : Open Attribute Value Extraction From Product Profiles Guineng Zheng*, Subhabrata Mukherjee , Xin Luna Dong , FeiFei Li* Amazon.com, *University of Utah product KDD 2018 graph 1 Motivation Alexa , what are the flavors of
graph product
KDD 2018
ΔAmazon.com, *University of Utah
1
graph product
KDD 2018
2
Alexa, what are the flavors of nescafe? Nescafe Coffee flavors include caramel, mocha, vanilla, coconut, cappuccino, original/regular, decaf, espresso, and cafe au lait decaf.
graph product
KDD 2018
3
graph product
KDD 2018
Open World Assumption
Limited semantics, irregular syntax
structure in titles and bullets
Food, Lamb Meal & Brown Rice Recipe
4
graph product
KDD 2018
5
Open World Assumption No Lexicon, No Hand-crafted Features Active Learning
Ghani et al. 2003, Putthividhya et al. 2011, Ling et al. 2012, Petrovski et
Huang et al. 2015, Kozareva et al. 2016 Kozareva et al. 2016, Lample et al. 2016, Ma et al. 2016 OpenTag (this work)
graph product
KDD 2018
6
graph product
KDD 2018
7
Input Product Profile Output Extractions
Title Description Bullets Flavor Brand
…
CESAR Canine Cuisine Variety Pack Filet Mignon & Porterhouse Steak Dog Food (Two 12-Count Cases) A Delectable Meaty Meal for a Small Canine Looking for the right food … This delicious dog treat contains tender slices of meat in gravy and is formulated to meet the nutritional needs of small dogs.
Flavor;
Flavor;
Cuisine provides complete and balanced nutrition … 1.filet mignon 2.porterhouse steak cesar canine cuisine
graph product
KDD 2018
w1 w2 w3 w4 w5
w6 beef meal & ranch raised lamb w7 recipe B I O E B I O E B I O E B I O E B I O E B I O E B I O E
t1 t2 t3 t4 t5 t6 t7 B I O E
Beginning of attribute value Inside of attribute value Outside of attribute value End of attribute value
x={w1,w2,…,wn} input sequence y={t1,t2,…,tn} tagging decision {beef meal} {ranch raise lamb} Flavor Extractions
8
O U T P U T I N P U T
graph product
KDD 2018
9
graph product
KDD 2018
10
graph product
KDD 2018
11
Map ‘beef’, ‘chicken’, ‘pork’ to nearby points in Flavor– embedding space
graph product
KDD 2018
12
Capture long and short range dependencies in input sequence via forward and backward hidden states
graph product
KDD 2018
13
between output tags
graph product
KDD 2018
14
graph product
KDD 2018
15
graph product
KDD 2018
16
graph product
KDD 2018
17
graph product
KDD 2018
18
graph product
KDD 2018
19
graph product
KDD 2018
20
graph product
KDD 2018
Distribution of word vectors before attention Distribution of word vectors after attention
21
graph product
KDD 2018
22
graph product
KDD 2018
23
graph product
KDD 2018
24
graph product
KDD 2018
25
sample with highest uncertainty for annotation
graph product
KDD 2018
26
duck , fillet mignon and ranch raised lamb flavor B O B E O B I E O B O B O O O O B O
Tag flips = 4
graph product
KDD 2018
TF v.v. LC on detergent data TF v.v. LC on multi extraction
27
graph product
KDD 2018
Learning from scratch on detergent data Learning from scratch on multi extraction
28
graph product
KDD 2018
Previous Coverage of Existing Production System (%) OpenTag Coverage (%) Increase in Coverage (%) Attribute_1 23 78 53 Attribute_2 21 72 45 Attribute_3 < 1 56 50 Attribute_4 < 1 49 48
29
graph product
KDD 2018
30
graph product
KDD 2018
31
graph product
KDD 2018
32
graph product
KDD 2018
33
graph product
KDD 2018
34
graph product
KDD 2018
ranch raised beef flavor
h1 h2 h3 h4 B I O E B I O E B I O E B I O E
Word Index Word Embedding
glove embedding 50
Forward LSTM
100 units
Backward LSTM
100 units
Hidden Vector
100+100=200 units
Cross Entropy Loss
35
graph product
KDD 2018
graph product
KDD 2018
ranch
raised
beef
flavor
h1 h2 h3 h4 B I O E B I O E B I O E B I O E
Word Index Embedding
glove embedding 50
Forward LSTM
100 units
Backward LSTM
100 units
Hidden Vector
100+100=200 units
Cross Entropy Loss
CRF Conditional Random Fields CRF feature space formed by Bi- LSTM hidden states
37
graph product
KDD 2018
38
l1 l2 l3 l4
graph product
KDD 2018
39
Maximize log-likelihood of joint distribution Best possible tag sequence with highest conditional probability
CRF feature space formed by attention-focused representation of hidden states
graph product
KDD 2018
40
graph product
KDD 2018
duck , fillet mignon and ranch raised lamb flavor B O B E O B I E O B O B O O O O B O Tag flips = 4
41
graph product
KDD 2018
42