simple and effective retrieve edit rerank text generation
play

Simple and Effective Retrieve-Edit-Rerank Text Generation Nabil - PowerPoint PPT Presentation

Simple and Effective Retrieve-Edit-Rerank Text Generation Nabil Hossain Marjan Ghazvininejad Luke Zettlemoyer Facebook AI Research Facebook AI Research University of Rochester nhossain@cs.rochester.edu Overview Retrieve-and-edit


  1. Simple and Effective Retrieve-Edit-Rerank Text Generation Nabil Hossain Marjan Ghazvininejad Luke Zettlemoyer Facebook AI Research Facebook AI Research University of Rochester nhossain@cs.rochester.edu

  2. Overview • Retrieve-and-edit • Generate text using retrieved examples from training set • Uses: Summarization, Machine Translation, Conversation Generation • We apply post-generation ranking • Retrieve N examples, generate a candidate output with each • Rank these candidates • Post-ranking improves results on: • 2 Machine Translation tasks • Gigaword Summarization task

  3. Retrieve (Gigaword) ( x ′ � , y ′ � ) Training Retrieve • 1st sentence of news article (x) -> title (y) Set • { y ′ � 1 , y ′ � 2 , y ′ � 3 } Retrieval: given x, find closest x', then obtain its title y' x Augmented Input • LUCENE (TF-IDF based) x Test [SEP] y ′ � x 1 Data [SEP] y ′ � x 2 x [SEP] y ′ � 3 • Examples: Article (x) Best retrieved (y') Title (y) factory orders for manufactured goods rose #.# u.s. factory orders us september percent in september , the commerce rises #.# percent in factory orders up department said here thursday . october #.# percent france , still high after their convincing ##-## win france poised to french keep same over new zealand have named the same team make history in #nd team for #nd test for the second test next saturday in paris . test

  4. ̂ ̂ ̂ ̂ Edit (Generate) ( x ′ � , y ′ � ) Training Module 1 Module 2 Retrieve Generate Set { y ′ � 1 , y ′ � 2 , y ′ � 3 } x Candidate Outputs Augmented Input x [SEP] y ′ � x [SEP] y ′ � y 1 x 1 Test 1 [SEP] y ′ � x x [SEP] y ′ � Data y 2 2 2 x [SEP] y ′ � x [SEP] y ′ � y 3 3 3 • For each augmented input � [SEP] � , generate � y ′ � x y i i

  5. ̂ ̂ ̂ ̂ Edit (Generate) ( x ′ � , y ′ � ) Training Module 1 Module 2 Retrieve Generate Set { y ′ � 1 , y ′ � 2 , y ′ � 3 } x Candidate Outputs Augmented Input x [SEP] y ′ � x [SEP] y ′ � y 1 x 1 Test 1 [SEP] y ′ � x x [SEP] y ′ � Data y 2 2 2 x [SEP] y ′ � x [SEP] y ′ � y 3 3 3 Article (x) Best retrieved (y') Title (y) factory orders for manufactured goods rose #.# percent in u.s. factory orders rises us september factory september , the commerce department said here thursday . #.# percent in october orders up #.# percent y 1 [SEP] y ′ � x factory orders rises #.# 1 percent in september

  6. ̂ ̂ ̂ ̂ ̂ ̂ Post-gen Rerank ( x ′ � , y ′ � ) Training Module 1 Module 2 Module 3 Retrieve Generate Set Post-Gen Rerank { y ′ � 1 , y ′ � 2 , y ′ � 3 } x Candidate Outputs Ranked Outputs Augmented Input x [SEP] y ′ � x [SEP] y ′ � y 2 y 1 x 1 Test 1 [SEP] y ′ � x x [SEP] y ′ � y 3 Data y 2 2 2 x [SEP] y ′ � x y 1 [SEP] y ′ � y 3 3 3 • Given: • Estimate:

  7. ̂ ̂ ̂ ̂ ̂ ̂ ̂ ̂ ̂ Post-gen Rerank ( x ′ � , y ′ � ) Training Module 1 Module 2 Module 3 Retrieve Generate Set Post-Gen Rerank { y ′ � 1 , y ′ � 2 , y ′ � 3 } x Candidate Outputs Ranked Outputs Augmented Input x [SEP] y ′ � x [SEP] y ′ � y 2 y 1 x 1 Test 1 [SEP] y ′ � x x [SEP] y ′ � y 3 Data y 2 2 2 x [SEP] y ′ � x y 1 [SEP] y ′ � y 3 3 3 Article (x) Best retrieved (y') Title (y) factory orders for manufactured goods rose #.# percent in u.s. factory orders rises us september factory september , the commerce department said here thursday . #.# percent in october orders up #.# percent y 2 y 1 y 3 factory orders rises #.# us september factory factory orders for good rose percent in september orders rose #.# percent #.# percent in september

  8. Model • BPE • Transformer base • Segment Embeddings • A [RANK] token similar to [CLS] token in BERT • to estimate salience of the retrieved � y ′ � • Generate with beam = 5 [RANK]

  9. Machine Translation • Data : EN-NL (Dutch) and EN-HU (Hungarian), from EU meetings • Current SOTA is NFR: Retrieval-based LSTM model • Uses SetSimilaritySearch for retrieval (retrieves top 3) • Our ranker: Select highest scored output from the trained MT model BLEU • Post-generation ranking amounting to extended beam search Bulté, Bram, and Arda Tezcan. "Neural Fuzzy Repair: Integrating Fuzzy Matches into Neural Machine Translation." In ACL 2019.

  10. Machine Translation • Data : EN-NL (Dutch) and EN-HU (Hungarian), from EU meetings • Current SOTA is NFR: Retrieval-based LSTM model • Uses SetSimilaritySearch for retrieval (retrieves top 3) • Our ranker: Select highest scored output from the trained MT model BLEU • Post-generation ranking amounting to extended beam search Bulté, Bram, and Arda Tezcan. "Neural Fuzzy Repair: Integrating Fuzzy Matches into Neural Machine Translation." In ACL 2019.

  11. Gigaword Summarization • Metric: Rouge F-scores • Re 3 Sum model: LSTM, retrieve-and-edit, pre-ranking • uses 30 retrieved examples • Our ranker: select the most frequent of the 30 candidate outputs Method Rouge-1 Rouge-2 Rouge-LCS LSTM 35.01 16.55 32.42 Re 3 Sum 37.04 19.03 34.46 Transformer (Tr) 37.68 18.79 34.87 x Tr + Lucene + [SEP] y ′ � 37.51 19.15 34.86 1 Tr + Lucene + pre-rank 36.46 18.01 33.85 38.23 19.58 35.60 Tr + Luc + post-rank BiSET 39.11 19.78 36.87 Cao, Ziqiang, et al. "Retrieve, rerank and rewrite: Soft template based neural summarization." In ACL. 2018.

  12. ̂ Gigaword oracle experiments • Room for improvement with better post-ranking • use x, x ’ , y ’ , for re-ranking y

  13. ̂ Gigaword oracle experiments • Room for improvement with better post-ranking • use x, x ’ , y ’ , in post-ranking y

  14. ̂ ̂ Summary • We extended the retrieve-and-edit framework with post-generation ranking: 1. Retrieve N training set outputs y’ for input x 2. Edit each input x[SEP]y’ to produce N candidate outputs � . y 3. Re-rank � to select best ranked output y • Simple post-ranking improved results on MT and summarization • Interesting to explore better post-ranking using x, x’, y’, yhat Questions: nhossain@cs.rochester.edu

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend