Learning Greedy Policies for the Easy-First Framework Jun Xie, Chao - PowerPoint PPT Presentation

Learning Greedy Policies for the Easy-First Framework Jun Xie, Chao Ma, Janardhan Rao Doppa, Prashanth Mannem, Xiaoli Fern, Tom Dietterich, Prasad Tadepalli Oregon State University 1

The Easy-First Framework: Example A 4.2 magnitude earthquake struck near eastern Sonoma County . Doc 1 A tremor struck in Sonoma County. Doc 2

The Easy-First Framework: Example A 4.2 magnitude earthquake struck near eastern Sonoma County . Doc 1 A tremor struck in Sonoma County. Doc 2 A 4.2 magnitude earthquake eastern Sonoma County Sonoma County A tremor 1. Begin with every mention in its own cluster

The Easy-First Framework: Example A 4.2 magnitude earthquake struck near eastern Sonoma County . Doc 1 A tremor struck in Sonoma County. Doc 2 A 4.2 magnitude earthquake eastern Sonoma County Sonoma County A tremor 1. Begin with every mention in its own cluster 2. Evaluate all possible merges with a scoring function and select the highest scoring merge (easiest)

The Easy-First Framework: Example A 4.2 magnitude earthquake struck near eastern Sonoma County . Doc 1 A tremor struck in Sonoma County. Doc 2 A 4.2 magnitude earthquake eastern Sonoma County Sonoma County A tremor 1. Begin with every mention in its own cluster 2. Evaluate all possible merges with a scoring function and select the highest scoring merge (easiest) 3. Repeat until stopping condition is met

Easy First Training S Initial State Bad Good 0 Weight Update …… c a b d S 1         f ( a ) 0 . 03 f f ( ( b b ) ) 0 0 . . 36 12 f f ( ( c c ) ) 0 0 . . 57 47 f f ( ( d d ) ) 0 0 . . 29 63 f ( a ) 0 . 04 …… h e g i S 2     f ( e ) 0 . 27 f ( g ) 0 . 39 f ( h ) 0 . 41 f ( i ) 0 . 52 …… m Weight Update j k n S 3         f ( j ) 0 . 31 f f ( ( k k ) ) 0 0 . . 38 36 f f ( ( m m ) ) 0 0 . . 51 55 f f ( ( n n ) ) 0 0 . . 62 68 f ( j ) 0 . 34 …… S T 6

Learning Scoring Function Possible goal: learn a scoring function such that: in every state ALL good actions are ranked higher than all bad actions Over-Constrained Goal A better goal: learn a scoring function such that in every state ONE good action is ranked higher than all bad actions 7

Proposed Objective for Update • Goal: find a linear function such that it ranks one good action higher than all bad actions – This can be achieved by a set of constraints max 𝑕∈𝐻 𝑥 ⋅ 𝑦 𝑕 > 𝑥 ⋅ 𝑦 𝑐 + 1 for all 𝑐 ∈ 𝐶 • Our Objective: • Use hinge loss to capture the constraints • Regularization to avoid overly aggressive update 1        w  2 argmin ( 1 max w x w x ) w  g b c  B g G  w b B 8

Optimization • Majorization Minimization algorithm to find a local optimal solution. • In each MM iteration: – Let be the current highest scoring good action – Solve following convex objective (via subgradient descent) 1        w  2 argmin ( 1 max w x w x ) w  g b c  B g G  w b B w  * x g

Contrast with Existing Methods Bad Good • Average-good vs. average-bad (AGAB) Average-Good Average-Bad • Best-good vs. best-bad (BGBB) Best-good Best-bad • Proposed method: Best-good vs. violated-bad (BGVB) Best-good Violated-bad 10

Experiment I: cross-document entity and event coref Results on EECB corpus (Lee et al., 2012) BGBB R-BGBB BGVB R-BGVB Lee et al. 80 70 60 50 40 30 20 10 0 MUC B-CUBE CEAF_e CoNLL 11

Experiment II: within-doc Coref Results on OntoNotes BGBB R-BGBB BGVB R-BGVB 80 70 60 50 40 30 20 10 0 MUC B-CUBE CEAF_e CoNLL 12

Diagnostics • Some training statistics on ACE 2004 corpus: Approach Total Steps Mistakes Recoveries Percentage Accuracy RBGVB 50195 16228 4255 0.262 0.87 13

Diagnostics • Some training statistics on ACE 2004 corpus: Approach Total Steps Mistakes Recoveries Percentage Accuracy RBGVB 50195 16228 4255 0.262 0.87 BGBB 50195 11625 4075 0.351 0.82 BGBB corrects errors more aggressively than RBGVB. This is a strong evidence that overfitting does happen with BGBB. 14

Contributions • We precisely represent the learning goal for Easy First as an optimization problem • We develop an efficient Majorization Minimization algorithm to optimize the proposed objective • Achieve highly competitive results against state-of-the-art for both within- and cross- document coref 15

Learning Greedy Policies for the Easy-First Framework Jun Xie, Chao - PowerPoint PPT Presentation

Learning Greedy Policies for the Easy-First Framework Jun Xie, Chao Ma, Janardhan Rao Doppa, Prashanth Mannem, Xiaoli Fern, Tom Dietterich, Prasad Tadepalli Oregon State University 1 The Easy-First Framework: Example A 4.2 magnitude earthquake

Greedy On-Line Planning - abstract overview: what is greedy on-line planning? Part 1: - greedy

Greedy embedding of a graph Greedy embedding of a graph 99 Greedy embedding Greedy embedding

From greedy approximation to greedy optimization Vladimir Temlyakov July, 2014 Vladimir

From greedy approximation to greedy optimization Vladimir Temlyakov December 10, 2013 Vladimir

Greedy Algorithms Chapter 16 1 CPTR 430 Algorithms Greedy Algorithms Greedy Algorithms For

CS 170 Section 4 Greedy Algorithms I Owen Jow | owenjow@berkeley.edu Agenda Greedy

Greedy Algorithms Pedro Ribeiro DCC/FCUP 2018/2019 Pedro Ribeiro (DCC/FCUP) Greedy Algorithms

Greedy algorithms Greedy algorithms Find the best solution to a local problem and (hope) it

Greedy Algorithms 1 The main idea of greedy algorithm is look some optimal solution locally

Greedy routing Greedy routing Other variations on greedy criterion Introduce

General remarks Algorithms Algorithms Oliver Oliver Week 8 Kullmann Kullmann Greedy Greedy

Easy-to-Use Easy-to-Install Easy on the Budget orecx.com Easy-to-Use

Greedy Algorithm and Matroid Intersections by Yan Alves Radtke July 2020 by Yan Alves Radtke

Greedy Algorithms Week 5 Objectives Subproblem structure Greedy algorithm

Week 8 Kullmann Greedy algorithms Making Greedy Algorithms change Minimum spanning trees

Greedy routing by distributed D l Delaunay triangulation t i l ti 4/4/2017 Greedy Routing (S.

Development What is a New Product? New to the world product, or really new products New

Tremor How Rust killed thousands of cores and TB of memory at Wayfair Agenda A bit about us

Continuous Motor Monitoring: Implementation and Value Webinar Will Begin at 12:00 PM EDT Outline

5/4/2018 Death Notification: Essential Elements & Responder Self Care Wayne F Dailey, PhD

0 Regression Model Development and Yet Another Regression Function Werner Stahel Seminar f u

Adv Advanced anced Worksho shop p on n Ea Earthquake Fa Fault Mechanics: The Theory, ,

Animo Stability and Independence Daniel Carballo Background Physiology Muscle REST TREMOR

Newborn Screening and Spinal Muscular Atrophy Nancy Kuntz, MD Professor of Pediatrics and

Learning Greedy Policies for the Easy-First Framework Jun Xie, Chao - PowerPoint PPT Presentation

Learning Greedy Policies for the Easy-First Framework Jun Xie, Chao Ma, Janardhan Rao Doppa, Prashanth Mannem, Xiaoli Fern, Tom Dietterich, Prasad Tadepalli Oregon State University 1 The Easy-First Framework: Example A 4.2 magnitude earthquake

Greedy On-Line Planning - abstract overview: what is greedy on-line planning? Part 1: - greedy

Greedy embedding of a graph Greedy embedding of a graph 99 Greedy embedding Greedy embedding

From greedy approximation to greedy optimization Vladimir Temlyakov July, 2014 Vladimir

From greedy approximation to greedy optimization Vladimir Temlyakov December 10, 2013 Vladimir

Greedy Algorithms Chapter 16 1 CPTR 430 Algorithms Greedy Algorithms Greedy Algorithms For

CS 170 Section 4 Greedy Algorithms I Owen Jow | owenjow@berkeley.edu Agenda Greedy

Greedy Algorithms Pedro Ribeiro DCC/FCUP 2018/2019 Pedro Ribeiro (DCC/FCUP) Greedy Algorithms

Greedy algorithms Greedy algorithms Find the best solution to a local problem and (hope) it

Greedy Algorithms 1 The main idea of greedy algorithm is look some optimal solution locally

Greedy routing Greedy routing Other variations on greedy criterion Introduce

General remarks Algorithms Algorithms Oliver Oliver Week 8 Kullmann Kullmann Greedy Greedy

Easy-to-Use Easy-to-Install Easy on the Budget orecx.com Easy-to-Use

Greedy Algorithm and Matroid Intersections by Yan Alves Radtke July 2020 by Yan Alves Radtke

Greedy Algorithms Week 5 Objectives Subproblem structure Greedy algorithm

Week 8 Kullmann Greedy algorithms Making Greedy Algorithms change Minimum spanning trees

Greedy routing by distributed D l Delaunay triangulation t i l ti 4/4/2017 Greedy Routing (S.

Development What is a New Product? New to the world product, or really new products New

Tremor How Rust killed thousands of cores and TB of memory at Wayfair Agenda A bit about us

Continuous Motor Monitoring: Implementation and Value Webinar Will Begin at 12:00 PM EDT Outline

5/4/2018 Death Notification: Essential Elements &amp; Responder Self Care Wayne F Dailey, PhD

0 Regression Model Development and Yet Another Regression Function Werner Stahel Seminar f u

Adv Advanced anced Worksho shop p on n Ea Earthquake Fa Fault Mechanics: The Theory, ,

Animo Stability and Independence Daniel Carballo Background Physiology Muscle REST TREMOR

Newborn Screening and Spinal Muscular Atrophy Nancy Kuntz, MD Professor of Pediatrics and

5/4/2018 Death Notification: Essential Elements & Responder Self Care Wayne F Dailey, PhD