A Class of Submodular Functions for Document Summarization Hui Lin, - PowerPoint PPT Presentation

A Class of Submodular Functions for Document Summarization Hui Lin, Jeff Bilmes University of Washington, Seattle Dept. of Electrical Engineering June 20, 2011 Lin and Bilmes Submodular Summarization June 20, 2011 1 / 29

Extractive Document Summarization The figure below represents the sentences of a document Lin and Bilmes Submodular Summarization June 20, 2011 2 / 29

Extractive Document Summarization We extract sentences (green) as a summary of the full document Lin and Bilmes Submodular Summarization June 20, 2011 2 / 29

Extractive Document Summarization We extract sentences (green) as a summary of the full document ⊂ The summary on the left is a subset of the summary on the right. Lin and Bilmes Submodular Summarization June 20, 2011 2 / 29

Extractive Document Summarization We extract sentences (green) as a summary of the full document ⊂ The summary on the left is a subset of the summary on the right. Consider adding a new (blue) sentence to each of the two summaries. Lin and Bilmes Submodular Summarization June 20, 2011 2 / 29

Extractive Document Summarization We extract sentences (green) as a summary of the full document ⊂ The summary on the left is a subset of the summary on the right. Consider adding a new (blue) sentence to each of the two summaries. The marginal (incremental) benefit of adding the new (blue) sentence to the smaller (left) summary is no more than the marginal benefit of adding the new sentence to the larger (right) summary. Lin and Bilmes Submodular Summarization June 20, 2011 2 / 29

Extractive Document Summarization We extract sentences (green) as a summary of the full document ⊂ The summary on the left is a subset of the summary on the right. Consider adding a new (blue) sentence to each of the two summaries. The marginal (incremental) benefit of adding the new (blue) sentence to the smaller (left) summary is no more than the marginal benefit of adding the new sentence to the larger (right) summary. diminishing returns ↔ submodularity Lin and Bilmes Submodular Summarization June 20, 2011 2 / 29

Background on Submodularity Outline Background on Submodularity 1 Problem Setup and Algorithm 2 Submodularity in Summarization 3 New Class of Submodular Functions for Document Summarization 4 Experimental Results 5 Summary 6 Lin and Bilmes Submodular Summarization June 20, 2011 3 / 29

Background on Submodularity Submodular Set Functions There is a finite sized “ground set” of elements V We use set functions of the form f : 2 V → R A set function f is monotone nondecreasing if ∀ R ⊆ S , f ( R ) ≤ f ( S ). Definition of Submodular Functions For any R ⊆ S ⊆ V and k ∈ V , k / ∈ S , f ( · ) is submodular if S R f ( S + k ) − f ( S ) ≤ f ( R + k ) − f ( R ) This is known as the principle of diminishing returns Lin and Bilmes Submodular Summarization June 20, 2011 4 / 29

Background on Submodularity Example: Number of Colors of Balls in Urns f ( R ) = f ( ) = 3 f ( S ) = f ( ) = 4 Given a set A of colored balls f ( A ): the number of distinct colors contained in the urn The incremental value of an object only diminishes in a larger context (diminishing returns). Lin and Bilmes Submodular Summarization June 20, 2011 5 / 29

Background on Submodularity Example: Number of Colors of Balls in Urns f ( R ) = f ( ) = 3 f ( S ) = f ( ) = 4 f ( R + k ) = f ( + ) = 4 f ( S + k ) = f ( + ) = 4 Given a set A of colored balls f ( A ): the number of distinct colors contained in the urn The incremental value of an object only diminishes in a larger context (diminishing returns). Lin and Bilmes Submodular Summarization June 20, 2011 5 / 29

Background on Submodularity Why is submodularity attractive? Lin and Bilmes Submodular Summarization June 20, 2011 6 / 29

Background on Submodularity Why is submodularity attractive? Why is convexity attractive? How about submodularity: Lin and Bilmes Submodular Summarization June 20, 2011 6 / 29

Background on Submodularity Why is submodularity attractive? Why is convexity attractive? convexity appears in many mathematical models in economy, engineering and other sciences. minimum can be found efficiently. convexity has many nice properties, e.g. convexity is preserved under many natural operations and transformations. How about submodularity: Lin and Bilmes Submodular Summarization June 20, 2011 6 / 29

Background on Submodularity Why is submodularity attractive? Why is convexity attractive? convexity appears in many mathematical models in economy, engineering and other sciences. minimum can be found efficiently. convexity has many nice properties, e.g. convexity is preserved under many natural operations and transformations. How about submodularity: submodularity arises in many areas: combinatorics, economics, game theory, operation research, machine learning, and (now) natural language processing. minimum can be found in polynomial time submodularity has many nice properties, e.g. submodularity is preserved under many natural operations and transformations (e.g. scaling, addition, convolution, etc.) Lin and Bilmes Submodular Summarization June 20, 2011 6 / 29

Problem Setup and Algorithm Outline Background on Submodularity 1 Problem Setup and Algorithm 2 Submodularity in Summarization 3 New Class of Submodular Functions for Document Summarization 4 Experimental Results 5 Summary 6 Lin and Bilmes Submodular Summarization June 20, 2011 7 / 29

Problem Setup and Algorithm Problem setup The ground set V corresponds to all the sentences in a document. Extractive document summarization: select a small subset S ⊆ V that accurately represents the entirety (ground set V ). Lin and Bilmes Submodular Summarization June 20, 2011 8 / 29

Problem Setup and Algorithm Problem setup The ground set V corresponds to all the sentences in a document. Extractive document summarization: select a small subset S ⊆ V that accurately represents the entirety (ground set V ). The summary is usually required to be length-limited. c i : cost (e.g., the number of words in sentence i ), b : the budget (e.g., the largest length allowed), knapsack constraint: � i ∈ S c i ≤ b . Lin and Bilmes Submodular Summarization June 20, 2011 8 / 29

Problem Setup and Algorithm Problem setup The ground set V corresponds to all the sentences in a document. Extractive document summarization: select a small subset S ⊆ V that accurately represents the entirety (ground set V ). The summary is usually required to be length-limited. c i : cost (e.g., the number of words in sentence i ), b : the budget (e.g., the largest length allowed), knapsack constraint: � i ∈ S c i ≤ b . A set function f : 2 V → R measures the quality of the summary S , Thus, the summarization problem is formalized as: Problem (Document Summarization Optimization Problem) S ∗ ∈ argmax � f ( S ) subject to: c i ≤ b . (1) S ⊆ V i ∈ S Lin and Bilmes Submodular Summarization June 20, 2011 8 / 29

Problem Setup and Algorithm A Practical Algorithm for Large-Scale Summarization When f is both monotone and submodular : A greedy algorithm with partial enumeration (Sviridenko, 2004), theoretical guarantee of near-optimal solution, but not practical for large data sets. Lin and Bilmes Submodular Summarization June 20, 2011 9 / 29

Problem Setup and Algorithm A Practical Algorithm for Large-Scale Summarization When f is both monotone and submodular : A greedy algorithm with partial enumeration (Sviridenko, 2004), theoretical guarantee of near-optimal solution, but not practical for large data sets. A greedy algorithm (Lin and Bilmes, 2010): near-optimal with theoretical guarantee, and practical/scalable! We choose next element with largest ratio of gain over scaled cost: f ( G ∪ { i } ) − f ( G ) k ← argmax . (2) ( c i ) r i ∈ U Lin and Bilmes Submodular Summarization June 20, 2011 9 / 29

Problem Setup and Algorithm A Practical Algorithm for Large-Scale Summarization When f is both monotone and submodular : A greedy algorithm with partial enumeration (Sviridenko, 2004), theoretical guarantee of near-optimal solution, but not practical for large data sets. A greedy algorithm (Lin and Bilmes, 2010): near-optimal with theoretical guarantee, and practical/scalable! We choose next element with largest ratio of gain over scaled cost: f ( G ∪ { i } ) − f ( G ) k ← argmax . (2) ( c i ) r i ∈ U Scalability: the argmax above can be solved by O (log n ) calls of f , thanks to submodularity Integer linear programming (ILP) takes 17 hours vs. greedy which takes < 1 second!! Lin and Bilmes Submodular Summarization June 20, 2011 9 / 29

Problem Setup and Algorithm Objective Function Optimization: Performance in Practice 140 exact solution 120 e u l a 100 v n o i t 80 c n op mal u f 60 e r=0 v i t c r=0.5 e 40 j b r=1 O 20 r=1.5 0 0 2 4 6 8 10 12 number of sentences in the summary Figure: The plots show the achieved objective function value as the number of selected sentences grows. The plots stop when in each case adding more sentences violates the budget. Lin and Bilmes Submodular Summarization June 20, 2011 10 / 29

A Class of Submodular Functions for Document Summarization Hui Lin, - PowerPoint PPT Presentation

A Class of Submodular Functions for Document Summarization Hui Lin, Jeff Bilmes University of Washington, Seattle Dept. of Electrical Engineering June 20, 2011 Lin and Bilmes Submodular Summarization June 20, 2011 1 / 29 Extractive

( ) Outline Submodular

Minimizing Submodular Functions Satoru Iwata (RIMS, Kyoto University) Outline Submodular

Approximating Submodular Functions Everywhere Nick Harvey February 16, 2008 Joint work with M.

ACL19 Summarization Xiachong Feng Papers Multi-Document Summarization Scientific Paper

Document Summarization Statistical NLP Spring 2011 Lecture 25: Summarization Dan Klein UC

Fast Semi-differential based Submodular Function Optimization Rishabh Iyer 1 Stefanie Jegelka 2

Submodular Maximization Seffi Naor Lecture 2 4th Cargese Workshop on Combinatorial Optimization

Streaming -submodular Maximization under Noise subject to Size Constraint Lan N. Nguyen, My

Optimization of Submodular Functions Tutorial - lecture II Jan Vondrk 1 1 IBM Almaden Research

Maximization of Submodular Functions Seffi Naor Lecture 1 4th Cargese Workshop on Combinatorial

Fast and Private Submodular and k- Submodular Functions Maximization with Matroid Constraints

MELODI M achin E L earning, O ptimization, & D ata I nterpretation @ UW Iyer & Bilmes,

CS675: Convex and Combinatorial Optimization Fall 2019 Submodular Function Optimization

Overview of TAC 2011 Summarization Track Karolina Owczarzak, Hoa Trang Dang National Institute of

A Neural Attention Model for Sentence Summarization Alexander M. Rush, Sumit Chopra, Jason

Statistical NLP Spring 2011 Lecture 25: Summarization Dan Klein UC Berkeley Document

..or Ganesh Gopalakrishnan, Wei-Fan Chiang, and Alexey Solovyev ! School of Computing University

Silicon Compilers - Version 2.0 Andreas Olofsson Program Manager, DARPA/MTO I nternational

Challenges in Applying Ranking and Selection after Search David Eckman Shane Henderson Cornell

Algorithms in Nature Optimization What Is Optimization? Selecting an element from a defined

Human-Robot Interaction Elective in Artificial Intelligence Lecture 9 Motion control for HRI

Anglo-Saxon Literature 05.23.13 || English 2322: British Literature: Anglo-Saxon Mid 18th

State Engineer John DAntonio Jr., P.E. Two Nations One Water April 25, 2019 The agencys

Updating & Improving optim(): Unifying optimization algorithms in R for smooth, nonlinear

A Class of Submodular Functions for Document Summarization Hui Lin, - PowerPoint PPT Presentation

A Class of Submodular Functions for Document Summarization Hui Lin, Jeff Bilmes University of Washington, Seattle Dept. of Electrical Engineering June 20, 2011 Lin and Bilmes Submodular Summarization June 20, 2011 1 / 29 Extractive

( ) Outline Submodular

Minimizing Submodular Functions Satoru Iwata (RIMS, Kyoto University) Outline Submodular

Approximating Submodular Functions Everywhere Nick Harvey February 16, 2008 Joint work with M.

ACL19 Summarization Xiachong Feng Papers Multi-Document Summarization Scientific Paper

Document Summarization Statistical NLP Spring 2011 Lecture 25: Summarization Dan Klein UC

Fast Semi-differential based Submodular Function Optimization Rishabh Iyer 1 Stefanie Jegelka 2

Submodular Maximization Seffi Naor Lecture 2 4th Cargese Workshop on Combinatorial Optimization

Streaming -submodular Maximization under Noise subject to Size Constraint Lan N. Nguyen, My

Optimization of Submodular Functions Tutorial - lecture II Jan Vondrk 1 1 IBM Almaden Research

Maximization of Submodular Functions Seffi Naor Lecture 1 4th Cargese Workshop on Combinatorial

Fast and Private Submodular and k- Submodular Functions Maximization with Matroid Constraints

MELODI M achin E L earning, O ptimization, &amp; D ata I nterpretation @ UW Iyer &amp; Bilmes,

CS675: Convex and Combinatorial Optimization Fall 2019 Submodular Function Optimization

Overview of TAC 2011 Summarization Track Karolina Owczarzak, Hoa Trang Dang National Institute of

A Neural Attention Model for Sentence Summarization Alexander M. Rush, Sumit Chopra, Jason

Statistical NLP Spring 2011 Lecture 25: Summarization Dan Klein UC Berkeley Document

..or Ganesh Gopalakrishnan, Wei-Fan Chiang, and Alexey Solovyev ! School of Computing University

Silicon Compilers - Version 2.0 Andreas Olofsson Program Manager, DARPA/MTO I nternational

Challenges in Applying Ranking and Selection after Search David Eckman Shane Henderson Cornell

Algorithms in Nature Optimization What Is Optimization? Selecting an element from a defined

Human-Robot Interaction Elective in Artificial Intelligence Lecture 9 Motion control for HRI

Anglo-Saxon Literature 05.23.13 || English 2322: British Literature: Anglo-Saxon Mid 18th

State Engineer John DAntonio Jr., P.E. Two Nations One Water April 25, 2019 The agencys

Updating &amp; Improving optim(): Unifying optimization algorithms in R for smooth, nonlinear

MELODI M achin E L earning, O ptimization, & D ata I nterpretation @ UW Iyer & Bilmes,

Updating & Improving optim(): Unifying optimization algorithms in R for smooth, nonlinear