【社外秘】
YJTI at the NTCIR-13 STC Japanese Subtask
1
- Dec. 7, 2017
YJTI at the NTCIR-13 STC Japanese Subtask Dec. 7, 2017 Toru - - PowerPoint PPT Presentation
YJTI at the NTCIR-13 STC Japanese Subtask Dec. 7, 2017 Toru Shimizu 1 Overview 2 Retrieval or Generation Retrieval-based system Effective if you have a good matching model and enough
【社外秘】
1
【公開】
2
【公開】
– Hence more practical
– This can be mitigated with large amount of candidates and the variety in them. – 1.2M unique sentences in the training data
3
【公開】
4
query encoder document encoder
【公開】
5
Model Training Reply Text Preparation and Indexing Runtime ・Train two models:
・Preprocess the training data to
・Generate vector representations
・Build the reply index ・Produce actual reply lists using the runtie system
【公開】
6
【公開】
7
【公開】
8
Model training stage Reply text preparation and indexing stage Runtime stage
comment encoder model reply encoder model candidate replies ・data ・component query (comment) comment vector retriever reply vectors reply encoder comment encoder top-200 replies ranker top-10 ranked replies
【公開】
9
Model training stage Reply text preparation and indexing stage Runtime stage
comment encoder model reply encoder model candidate replies query (comment) comment vector retriever reply vectors reply encoder comment encoder top-200 replies ranker top-10 ranked replies ・data ・component
【公開】
10
Model training stage Reply text preparation and indexing stage Runtime stage
comment encoder model reply encoder model candidate replies query (comment) comment vector retriever reply vectors reply encoder comment encoder top-200 replies ranker top-10 ranked replies ・data ・component
【公開】
11
Model training stage Reply text preparation and indexing stage Runtime stage
comment encoder model reply encoder model candidate replies query (comment) comment vector retriever reply vectors reply encoder comment encoder top-200 replies ranker top-10 ranked replies
【公開】
12
Reply text preparation and indexing stage Runtime stage
reply encoder model candidate replies query (comment) comment vector retriever reply vectors reply encoder comment encoder top-200 replies ranker top-10 ranked replies
Model training stage
comment encoder model
【公開】
13
Model training stage Reply text preparation and indexing stage Runtime stage
comment encoder model reply encoder model candidate replies query (comment) comment vector retriever reply vectors reply encoder comment encoder top-200 replies ranker top-10 ranked replies
【公開】
14
【公開】
15
The final top-10 replies
The Theme is matched btw. the comment and a reply. (At most 3) The Genre is matched btw. the comment and a reply (At most 3) No metadata match. (No limitation of number)
【公開】
16
【公開】
17
【公開】
18
zQ zD
【公開】
19
【公開】
20
【公開】
21
【公開】
22
【公開】
23
【公開】
24