diamonds in the rough generating fluent sentences from
play

Diamonds in the Rough: Generating Fluent Sentences from Early-stage - PowerPoint PPT Presentation

Diamonds in the Rough: Generating Fluent Sentences from Early-stage Drafts for Academic Writing Assistance Takumi Ito 1,2 , Tatsuki Kuribayashi 1,2 , Hayato Kobayashi 3,4 , Ana Brassard 4,1 , Masato Hagiwara 5 , Jun Suzuki 1,4 and Kentaro Inui 1,4


  1. Diamonds in the Rough: Generating Fluent Sentences from Early-stage Drafts for Academic Writing Assistance Takumi Ito 1,2 , Tatsuki Kuribayashi 1,2 , Hayato Kobayashi 3,4 , Ana Brassard 4,1 , Masato Hagiwara 5 , Jun Suzuki 1,4 and Kentaro Inui 1,4 1: Tohoku University, 2: Langsmith Inc., 3: Yahoo Japan Corporation, 4: RIKEN, 5: Octanove Labs LLC

  2. 2 The writing process FIRST DRAFT: “Model have good results.” Revising “Our model show “Our model shows a excellent perfomance good result in this task.” in this task.” Editing “Our model shows “Our model shows a excellent perfomance good results in this in this task.” task.” Proofreading “Our model shows excellent performance in this task.” “Our model shows excellent FINAL VERSION: performance in this task.” 2019/10/29 INLG2019

  3. 3 Automatic writing assistance FIRST DRAFT: “Model have good results.” insufficient fluidity • awkward style • Revising “Our model show “Our model shows collocation errors • a excellent perfomance good result missing words • in this task.” in this task.” Editing “Our model shows “Our model shows a excellent perfomance good results in this in this task.” grammatical errors task.” • spelling errors • Proofreading “Our model shows excellent performance in this task.” “Our model shows excellent FINAL VERSION: performance in this task.” 2019/10/29 INLG2019

  4. 4 Automatic writing assistance FIRST DRAFT: “Model have good results.” ✗ insufficient fluidity ✗ awkward style Revising “Our model show “Our model shows ✗ collocation errors a excellent perfomance good result ✗ missing words in this task.” in this task.” EXISTING STUDIES Editing “Our model shows “Our model shows ✓ grammatical errors a excellent perfomance good results in this in this task.” task.” ✓ spelling errors Proofreading “Our model shows excellent Grammatical error performance in this task.” correction (GEC) “Our model shows excellent FINAL VERSION: performance in this task.” 2019/10/29 INLG2019

  5. 5 Automatic writing assistance FIRST DRAFT: “Model have good results.” ✓ insufficient fluidity ✓ awkward style OUR FOCUS ✓ collocation errors Revising “Our model show “Our model shows ✓ missing words a excellent perfomance good result in this task.” in this task.” Editing “Our model shows “Our model shows ✓ grammatical errors a excellent perfomance good results in this in this task.” task.” ✓ spelling errors Proofreading “Our model shows excellent Grammatical error performance in this task.” correction (GEC) Sentence-level revision “Our model shows excellent FINAL VERSION: (SentRev) performance in this task.” 2019/10/29 INLG2019

  6. 6 Proposed Task: Sentence-level Revision The idea of our approach derives Our aproach idea is <*> at read from the normal human reading patern of normal human. pattern. revising, editing, draft final version proofreading l input: early-stage draft sentence - has errors (e.g., collocation errors) - has Information gaps (denoted by <*> ) l output: final version sentence - error-free - correctly filled-in sentence 2019/10/29 INLG2019

  7. 7 Proposed Task: Sentence-level Revision The idea of our approach derives Our aproach idea is <*> at read from the normal human reading patern of normal human. pattern. revising, editing, draft final version proofreading l input: early-stage draft sentence - has errors (e.g., collocation errors) - has Information gaps (denoted by <*> ) l output: final version sentence - error-free - correctly filled-in sentence l issue: lack of evaluation resource 2019/10/29 INLG2019

  8. 8 Our contributions The idea of our approach derives Our aproach idea is <*> at read from the normal human reading patern of normal human. pattern. revising, editing, draft final version proofreading l Created an evaluation dataset for SentRev - Set of Modified Incomplete TecHnical paper sentences (SMITH) l Analyzed the characteristics of the dataset l Established baseline scores for SentRev 2019/10/29 INLG2019

  9. 9 Evaluation Dataset Creation Goal : collect pairs of draft sentence and final version Our model shows Our model <*> results competitive results draft final 2019/10/29 INLG2019

  10. 10 Evaluation Dataset Creation Goal : collect pairs of draft sentence and final version Our model shows Our model <*> results competitive results Straight-forward approach ︓ Experts modify collected drafts to final version drafts final version Note: limitation: We can access plenty of early-stage draft sentences are final version sentences not usually publicly available 2019/10/29 INLG2019

  11. 11 Evaluation Dataset Creation Goal : collect pairs of draft sentence and final version Our model shows Our model <*> results competitive results Straight-forward approach ︓ Experts modify collected drafts to final version drafts final version Our approach : create draft sentences from final version sentences 2019/10/29 INLG2019

  12. 12 Crowdsourcing Protocol for Creating an Evaluation Dataset Our approach : create draft sentences from final version sentences ACL Anthology drafts final version Our model shows Our model <*> competitive results 私達のモデルは results 匹敵する結果を ⽰しました。 1.automatically translate 2. Japanese native workers the final sentence into translate into English Japanese 2019/10/29 INLG2019

  13. 13 Crowdsourcing Protocol for Creating an Evaluation Dataset Our approach : create draft sentences from final version sentences insert <*> where workers ACL could not think of a good Anthology expression drafts final version Our model shows Our model <*> competitive results 私達のモデルは results 匹敵する結果を ⽰しました。 1.automatically translate 2. Japanese native workers the final sentence into translate into English Japanese 2019/10/29 INLG2019

  14. 14 Statistics Levenshtein Dataset size w/<*> w/change distance Lang-8 2.1M - 42% 3.5 AESW 1.2M - 39% 4.8 JFLEG 1.5K - 86% 12.4 SMITH 10K 33% 99% 47.0 w/<*>: percentage of source sentences with <*> w/change: percentage where the source and target sentences differ l collected 10,804 pairs l SMITH simulates significant editing l Larger Levenshtein distance ⇨ more drastic editing 2019/10/29 INLG2019

  15. 15 Examples of SMITH draft: I research the rate of workable SQL <*> at the generated result. final: We study the percentage of executable SQL queries in the generated results. draft: For <*>, we used Adam using weight decay and gradient clipping . final: We used Adam with a weight decay and gradient clipping for optimization. draft: In the model aechitecture, as shown in Figure 1 , it is based an AE and GAN. final: The model architecture, as illustrated in figure 1 , is based on the AE and GAN. 2019/10/29 INLG2019

  16. 16 Examples of SMITH (1) Wording problems draft: I research the rate of workable SQL <*> at the generated result. final: We study the percentage of executable SQL queries in the generated results. draft: For <*>, we used Adam using weight decay and gradient clipping . final: We used Adam with a weight decay and gradient clipping for optimization. draft: In the model aechitecture, as shown in Figure 1 , it is based an AE and GAN. final: The model architecture, as illustrated in figure 1 , is based on the AE and GAN. 2019/10/29 INLG2019

  17. 17 Examples of SMITH (1) Wording problems draft: I research the rate of workable SQL <*> at the generated result. final: We study the percentage of executable SQL queries in the generated results. draft: For <*>, we used Adam using weight decay and gradient clipping. final: We used Adam with a weight decay and gradient clipping for optimization. draft: In the model aechitecture, as shown in Figure 1 , it is based an AE and GAN. final: The model architecture, as illustrated in figure 1 , is based on the AE and GAN. 2019/10/29 INLG2019

  18. 18 Examples of SMITH (2) Information gaps draft: I research the rate of workable SQL <*> at the generated result. final: We study the percentage of executable SQL queries in the generated results. draft: For <*>, we used Adam using weight decay and gradient clipping . final: We used Adam with a weight decay and gradient clipping for optimization. draft: In the model aechitecture, as shown in Figure 1 , it is based an AE and GAN. final: The model architecture, as illustrated in figure 1 , is based on the AE and GAN. 2019/10/29 INLG2019

  19. 19 Examples of SMITH (2) Information gaps draft: I research the rate of workable SQL <*> at the generated result. final: We study the percentage of executable SQL queries in the generated results. draft: For <*>, we used Adam using weight decay and gradient clipping. final: We used Adam with a weight decay and gradient clipping for optimization. draft: In the model aechitecture, as shown in Figure 1 , it is based an AE and GAN. final: The model architecture, as illustrated in figure 1 , is based on the AE and GAN. 2019/10/29 INLG2019

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend