TAC 2017 Jay DeYoung Yee Seng Chan, Chinnu Pittapally, Hannah - PowerPoint PPT Presentation

TAC 2017 Jay DeYoung Yee Seng Chan, Chinnu Pittapally, Hannah Provenza Ryan Gabbard*, Marjorie Freedman* Distribution Statement `A' (Approved for Public Release, Distribution Unlimited) *now at USC ISI The views, opinions, and/or findings expressed are those of the author(s) and should not be interpreted as representing the 1 official views or policies of the Department of Defense or the U.S. Government.

Document Level Event Extraction • Argument Assertions e.g. (Contact.Meet, Place, Pittsburgh, Actual) 1. Logistic regression to identify (1) event –focused terms and (2) roles/arguments for events • Two argument classifiers: one that depends on event-focused terms, the second relies of just identifying a role in the argument context 2. Identify a canonical string for the argument using • SERIF within document coreference • SERIF time normalization 3. ERE-based trained classifier for distinguishing ACTUAL/GENERIC • Syntactic rules for identifying past/negated as OTHER 4. Joint optimization using system confidence of 1-3 5. World-knowledge based inference using event structure • Within document event frame creation – Sieve-based system that relies on argument overlap, argument conflict, and syntactic links between arguments and event-focused terms 2

2017 Updates • Incorporated additional training data – More Rich ERE – Event Nugget Training – BBN-developed targeted training • Incorporated additional event types – Contact.Broadcast – Contact.Contact – Transaction.Transaction 3

Challenges with Contact.Broadcast • Rich ERE only marks the first mention of a Contact.Broadcast, subsequent mentions are ignored – Unmarked RichERE text is ambiguous between • Negative example for Contact.Broadcast • 2 nd , 3 rd , 4 th ,…. positive example of a Contact.Broadcast event • System trained exclusively with targeted training on EAL dry-run data – Many false alarms that seem like annotation errors – Contact.Broadcast annotation agreement may be low enough to interfere with measuring system performance 4

Targeted Training (1) • Core challenge of EAL task is sparsity of training data – Many annotated documents – Few positive examples of events • Develop targeted event annotation using human intuitions about event contexts – Ask annotator to find useful examples – Let annotator skip hard examples • Annotation process – Annotator asked to come up with a list of likely event-related phrases • Nuggets OR other words likely to be associated with an event – Annotator searches & then marks ~10 examples per-term • Only marks sentences with one event mention (and may skip confusing sentences) • Marks all words that could be considered an event trigger • Marks arguments – Annotator asked to mark negative examples in the surrounding context (e.g. sentence N-1 does not contain a Contact.Meet event) – Annotator revises list to include additional event words • Resulting annotation is – Dense in events – Likely to contain multiple syntactic contexts for arguments <-> triggers – For polysemous triggers, likely to contain positives and negatives 5

Targeted Training (2) • 2015: Annotated ~5.8K positive & 6.4K negative sentences – Each sentence for a single event type • 4-8 hours per event type for all event types • Additional annotation for a few event types where we observed poor system performance – 2015 TAC system used only trigger annotation • ~12% relative improvement on argument score for system (BBN1 vs BBN2) – Arg F1: BBN2 35.5 – Arg F1: BBN1 38.0 (rank 1) • 2016: Additional annotation for new event types P R F1 No spannotator 26.3 26 26.2 Target:Trigger 26.1 26 26.1 Target:Trigger+Arg 28.1 26.2 27.1 2016 Dry Run Data: All Event Types 6

Context Embeddings (2015) • Event arguments can often be distant from event triggers • But often the argument context is informative – The knife-wielding man was tackled by a bystander, but only after three people were severely injured in the attack. – Acme Inc. ’s creditors were disappointed by Friday’s bankruptcy filing. • We would like to learn informative argument contexts which never appear in our supervised training data based on those which do 7

Context Embeddings: AA (2015) • We trained dense vector representations of the normalized dependency trees contexts of words on Gigaword(s) using a variant of the skip-gram model due to (Levy & Goldberg, ‘14) • We include this representation in our AA model tackled <0.25, 1.234, …> obj Arg attach Pooling man classifier mod <-0.34, 0.17, …> Knife-wielding 8

Context Embeddings: AA (2015) • Internal development tests on KBP-2014 EA newswire eval corpus (English) – Embeddings improve on 2014’s best system (BBN1), scored using 2014 EA scorer • 2015’s BBN1 used context embeddings, 2015’s BBN3 did not – ~10% relative improvement from context embeddings • Context embeddings used in all languages in 2017 9

CROSS DOC EVENT FRAME COREFERENCE 10

Cross Document Event Coreference • Task: Identify coreferent event frames across corpus Event-1 Event-2 Role Fillers MEET Role Fillers ENTITY EU heads of government MEET • ENTITY • Mehment Simsek • Ahmet Davutoglu GID: M1 • EU LOCATION Brussels GID: DATE 11-29-2015 LOCATION Brussels M2 DATE 12-14-2015 Role Fillers MEET ENTITY • Turkey MEET Role Fillers • 28 EU member states GID: M1 GID: M2 DATE 12-14-2015 • the presidents of European Council… LOCATION Brussels DATE 11-29-2015 • System can (and probably needs to) use – Information that is available in the event frames – Information directly derived from the document – Information provided by other automatic processes • Cross-document entity coreference (EDL) • Event nuggets and their context • Discovered topics • … 11

Challenges Role Fillers MEET ENTITY • Turkey • Imperfect automatic • 28 EU member states event-frame detection • the presidents of European Council… – Top performing 2015 system: protesters LOCATION Brussels • Precision: 36.8 • Recall: 39.2 Istanbul • Linking F1: 23.3 DATE 11-29-2015 • Event-frames represent a snapshot of what goes into a knowledge-base, not all of the information necessary for coreference decision – Marjorie Freedman and Jason Duncan both attended 3 distinct meetings 09-29-2016 • Event nuggets do not provide same discrimination as entity names – Nuggets for the 09-29-2016 meetings would be: attend or telecon • Currently, no frame-level exhaustive training data – Small number of assessments from pilot – Even when training data exists, it is likely to be small in quantity 12

BBN Approach: Overview • Pipeline of decisions – Find arguments (previous section) – Link arguments into per-document event frames (previous section) – Cluster event-frames across the corpus using event- type (and role) specific intuitions 13

BBN Approach: Argument Specific Intuitions • Define per-role equivalence – TIME: Year, month, and day (if available) relying on SERIF’s Timex normalization – PLACE: Containment of GeoNames’ Admin districts – AGENT/ENTITY/etc.: • For named entities, AWAKE cross-document coreference • Ignore non-named entities (e.g . 7 soldiers , the crowd ) • Event Frame Coreference heuristics include – Specific roles that must be matched (e.g. TIME or PLACE) – Minimum number of arguments that must be matched (e.g. at least three arguments) – Maximum number of • Documents in which an event can be mentioned • Distinct arguments in an event

Thanks!

TAC 2017 Jay DeYoung Yee Seng Chan, Chinnu Pittapally, Hannah - PowerPoint PPT Presentation

TAC 2017 Jay DeYoung Yee Seng Chan, Chinnu Pittapally, Hannah Provenza Ryan Gabbard, Marjorie Freedman Distribution Statement `A' (Approved for Public Release, Distribution Unlimited) *now at USC ISI The views, opinions, and/or findings

ABA Meeting TAC Card Update May 21, 2019 Office of Disbursements ABA Meeting TAC Card Update

Texas Administrative Code Ch. 202 W EDNESDAY , J ULY 23, 2014 | A USTIN , T EXAS TAC 202

Existing Class B Graphics Los Angeles TAC/Flyway San Diego TAC/Flyway Phoenix

Text Analysis Conference TAC 2016 Sponsored by: Hoa Trang Dang National Institute of Standards

TAC Meeting Presentation 8/31/15 Additional Analysis Requested by TAC Baseline 2040 Values

TAC Services Overview 1969: TAC was founded by these Associations County Judges &

Red Wing Bridge Project PAC #11/TAC #14 Meeting June 25, 2015 PAC #11/TAC #14 June 25, 2015

Resource Adequacy Assessment December Update Dan Woodfin Director, System Planning TAC TAC

TAC Services Overview New Treasurers Seminar December 14, 2018 San Marcos, TX 1969: TAC was

AICP Island RTPO TAC Meeting Kendra Breiland, Turning Data into Planning Solutions October 10,

JVN-TDT Entity Linking Systems at TAC-KBP2012 at TAC-KBP2012

TAC report Subaru UM 200 K. Ohta Kyoto Univ.) 200 .01.30 outline Trend TAC

Text Analysis Conference TAC 2018 Sponsored by: Hoa Trang Dang U.S. National Institute of

Technology Advisory Committee (TAC) Report to the Board of Education Fremont Unified School

Tollgate Action Committee November TAC Meeting TAC No November Meetin ing Agenda Opening

DIFFERENTIAL REFLECTIVITY CALIBRATION for NEXRAD NEXRAD TAC MEETING NEXRAD TAC MEETING 1

Paper Reading and Presentation CMPS 7010 Research Seminar A Three-Phase Approach How to read

PRESENTATION SKILLS: ONE HOUR WORKSHOP FREE DOWNLOAD Author: Carol Silvis Number of Pages: 112

Andover BF 0132(14) Alternatives Presentation Meeting FAS Route 132 Bridge #9 over Trout

ECONOMIC DEVELOPMENT COMMITTEE EASTLAND MALL REDEVELOPMENT OCTOBER 8, 2018 AGENDA Vision

INSTITUTIONAL ANIMAL CARE AND USE COMMITTEE http://citiprogram.org Step 3: Create your

Commuting Programs and Products TRB 2015 Most Recent American Community Survey (ACS) Releases

SKS@J-PARC J-PARC Project Office Day-1 @J-PARC New

2017/18 9 November 2017 Disclaimer This presentation contains certain forward-looking statements

TAC 2017 Jay DeYoung Yee Seng Chan, Chinnu Pittapally, Hannah - PowerPoint PPT Presentation

TAC 2017 Jay DeYoung Yee Seng Chan, Chinnu Pittapally, Hannah Provenza Ryan Gabbard*, Marjorie Freedman* Distribution Statement `A' (Approved for Public Release, Distribution Unlimited) *now at USC ISI The views, opinions, and/or findings

ABA Meeting TAC Card Update May 21, 2019 Office of Disbursements ABA Meeting TAC Card Update

Texas Administrative Code Ch. 202 W EDNESDAY , J ULY 23, 2014 | A USTIN , T EXAS TAC 202

Existing Class B Graphics Los Angeles TAC/Flyway San Diego TAC/Flyway Phoenix

Text Analysis Conference TAC 2016 Sponsored by: Hoa Trang Dang National Institute of Standards

TAC Meeting Presentation 8/31/15 Additional Analysis Requested by TAC Baseline 2040 Values

TAC Services Overview 1969: TAC was founded by these Associations County Judges &amp;

Red Wing Bridge Project PAC #11/TAC #14 Meeting June 25, 2015 PAC #11/TAC #14 June 25, 2015

Resource Adequacy Assessment December Update Dan Woodfin Director, System Planning TAC TAC

TAC Services Overview New Treasurers Seminar December 14, 2018 San Marcos, TX 1969: TAC was

AICP Island RTPO TAC Meeting Kendra Breiland, Turning Data into Planning Solutions October 10,

JVN-TDT Entity Linking Systems at TAC-KBP2012 at TAC-KBP2012

TAC report Subaru UM 200 K. Ohta Kyoto Univ.) 200 .01.30 outline Trend TAC

Text Analysis Conference TAC 2018 Sponsored by: Hoa Trang Dang U.S. National Institute of

Technology Advisory Committee (TAC) Report to the Board of Education Fremont Unified School

Tollgate Action Committee November TAC Meeting TAC No November Meetin ing Agenda Opening

DIFFERENTIAL REFLECTIVITY CALIBRATION for NEXRAD NEXRAD TAC MEETING NEXRAD TAC MEETING 1

Paper Reading and Presentation CMPS 7010 Research Seminar A Three-Phase Approach How to read

PRESENTATION SKILLS: ONE HOUR WORKSHOP FREE DOWNLOAD Author: Carol Silvis Number of Pages: 112

Andover BF 0132(14) Alternatives Presentation Meeting FAS Route 132 Bridge #9 over Trout

ECONOMIC DEVELOPMENT COMMITTEE EASTLAND MALL REDEVELOPMENT OCTOBER 8, 2018 AGENDA Vision

INSTITUTIONAL ANIMAL CARE AND USE COMMITTEE http://citiprogram.org Step 3: Create your

Commuting Programs and Products TRB 2015 Most Recent American Community Survey (ACS) Releases

SKS@J-PARC J-PARC Project Office Day-1 @J-PARC New

2017/18 9 November 2017 Disclaimer This presentation contains certain forward-looking statements

TAC 2017 Jay DeYoung Yee Seng Chan, Chinnu Pittapally, Hannah Provenza Ryan Gabbard, Marjorie Freedman Distribution Statement `A' (Approved for Public Release, Distribution Unlimited) *now at USC ISI The views, opinions, and/or findings

TAC Services Overview 1969: TAC was founded by these Associations County Judges &