UTD HLTRI at TAC 2019: DDI Track Ramon Maldonado , Maxwell - PowerPoint PPT Presentation

UTD HLTRI at TAC 2019: DDI Track Ramon Maldonado , Maxwell Weinzierl, & Sanda M. Harabagiu The University of Texas at Dallas Human Language Technology Research Institute http://www.hlt.utdallas.edu/~{ramon, max, sanda}

Outline 1. Introduction 2. The Approach 1. Pipeline Overview 2. Preprocessing 3. Multi-Task Transformer 4. Postprocessing 3. Results 4. Conclusion

Introduction Multi-task neural model for: • Task 1: entity identification • Task 2: relation identification • Task 3*: concept normalization • Task 4: normalized relation identification

Introduction Problem • Sentence-level • Binary Relation identification Our Approach • Multi-task learning – Sentence classification – Mention boundary detection – Relation extraction – PK effect classification • Pre-trained Transformer for shared representation

The Approach FDA Label Drug-Drug Interaction Pipeline Postprocessing Structured Product Labels Task 1: Mentions • Mention Filtering • Continuation Linking Task 2: Relations SPLs SPLs SPLs • Unused mention/relation filtering Normalization PK Effects UMLS SNOMED-CT MED-RT Mentions Relations Sentence Mention Relation PKE Classifier Boundary Extractor Classifier Preprocessing Annotation Propagation Shared Representation • Mentions Task 3: • Relations Normalized Mentions • Pseudo-triggers BERT Tokenization • Spacy Multi-task Transformer Net for Identifying Drug-Drug Task 4: Label Interactions • Word-piece Interactions

Preprocessing • Binary Relations – (Trigger, Precipitant, Effect) -> • (Trigger, Precipitant) • (Trigger, Effect) – Pseudo-triggers for SIs in some PDIs – PK effects as attributes • Mention annotation propagation – Ease the learning problem

Preprocessing • Tokenization – spaCy – WordPiece using BERT vocab • C-IOBES tagging – Continuation necessary for disjoint spans

Multi-Task Transformer Multi-Task Transformer network for Identifying Drug-Drug Interactions (MTTDDI) Relation Labels for all Mention Pairs Mention Type & Boundary Softmax Layer Labels for all words in an EEG report PKI effect codes r Mention Boundary Labeler Sentences containing PKE Classifier interactions b n b 1 b 2 Sentence Classifier Trigger Argument Context Embedding Embedding Embedding CRF Softmax Layer Softmax Layer r c 1 c 2 c 3 c 4 c 5 c 6 c n c n c 1 c 2 If r is a PKI s c n s c 1 c 2 c 3 BERT Sentence Encoder t 1 t 2 t 3 t n [CLS] [SEP]

BERT Sentence Encoder Multi-Task Transformer network for Identifying Drug-Drug Interactions (MTTDDI) Relation Labels for all Mention Pairs Mention Type & Boundary Softmax Layer Labels for all words in an EEG report PKI effect codes r Mention Boundary Labeler Sentences containing PKE Classifier interactions b n b 1 b 2 Sentence Classifier Trigger Argument Context Embedding Embedding Embedding CRF Softmax Layer Softmax Layer r c 1 c 2 c 3 c 4 c 5 c 6 c n c n c 1 c 2 If r is a PKI s c n s c 1 c 2 c 3 BERT Sentence Encoder t 1 t 2 t 3 t n [CLS] [SEP]

Sentence Classifier Multi-Task Transformer network for Identifying Drug-Drug Interactions (MTTDDI) Relation Labels for all Mention Pairs Mention Type & Boundary Softmax Layer Labels for all words in an EEG report PKI effect codes r Mention Boundary Labeler Sentences containing PKE Classifier interactions b n b 1 b 2 Sentence Classifier Trigger Argument Context Embedding Embedding Embedding CRF Softmax Layer Softmax Layer r c 1 c 2 c 3 c 4 c 5 c 6 c n c n c 1 c 2 If r is a PKI s c n s c 1 c 2 c 3 BERT Sentence Encoder t 1 t 2 t 3 t n [CLS] [SEP]

Mention Boundary Labeler Multi-Task Transformer network for Identifying Drug-Drug Interactions (MTTDDI) Relation Labels for all Mention Pairs Mention Type & Boundary Softmax Layer Labels for all words in an EEG report PKI effect codes r Mention Boundary Labeler Sentences containing PKE Classifier interactions b n b 1 b 2 Sentence Classifier Trigger Argument Context Embedding Embedding Embedding CRF Softmax Layer Softmax Layer r c 1 c 2 c 3 c 4 c 5 c 6 c n c n c 1 c 2 If r is a PKI s c n s c 1 c 2 c 3 BERT Sentence Encoder t 1 t 2 t 3 t n [CLS] [SEP]

Relation Extractor Multi-Task Transformer network for Identifying Drug-Drug Interactions (MTTDDI) Relation Labels for all Mention Pairs Mention Type & Boundary Softmax Layer Labels for all words in an EEG report PKI effect codes r Mention Boundary Labeler Sentences containing PKE Classifier interactions b n b 1 b 2 Sentence Classifier Trigger Argument Context Embedding Embedding Embedding CRF Softmax Layer Softmax Layer r c 1 c 2 c 3 c 4 c 5 c 6 c n c n c 1 c 2 If r is a PKI s c n s c 1 c 2 c 3 BERT Sentence Encoder t 1 t 2 t 3 t n [CLS] [SEP]

Pharmacokinetic Effect Classifier Multi-Task Transformer network for Identifying Drug-Drug Interactions (MTTDDI) Relation Labels for all Mention Pairs Mention Type & Boundary Softmax Layer Labels for all words in an EEG report PKI effect codes r Mention Boundary Labeler Sentences containing PKE Classifier interactions b n b 1 b 2 Sentence Classifier Trigger Argument Context Embedding Embedding Embedding CRF Softmax Layer Softmax Layer r c 1 c 2 c 3 c 4 c 5 c 6 c n c n c 1 c 2 If r is a PKI s c n s c 1 c 2 c 3 BERT Sentence Encoder t 1 t 2 t 3 t n [CLS] [SEP]

Postprocessing • Filtering – Invalid boundary tag sequences – Repeated mentions – Mentions not involved in an interaction • C-spans linked to closest mention • Reconstruct ternary interactions from binary through shared trigger

Postprocessing • Normalization – String matching – SNOMED-CT • Specific interactions – MED-RT • Drug classes – UNII • precipitants – Augmented with atoms from UMLS • Map precipitants first to MED-RT, then to UNII of no match was found

Postprocessing Task 4 • inferred from unique interactions between normalized mentions • PK effect codes from MTTDDI

Results Evaluated MTTDDI against two alternate configurations: • UTDHLTRI Run3: No sentence filtering/targeted training • Run3 + Filtering: Dedicated Learners System Task1 Task2 Task3 Task4 Best Submission 65.38 49.03 62.39 17.56 Median 48.97 37.13 45.53 17.56 UTDHLTRI Run3 35.04 27.48 28.66 17.56 Run3 + Filtering 56.03 42.29 45.73 24.07 MTTDDI 54.39 41.34 44.08 25.20 * Bold indicated best score. Italics indicates best score among LDIIP systems.

Questions

UTD HLTRI at TAC 2019: DDI Track Ramon Maldonado , Maxwell - PowerPoint PPT Presentation

UTD HLTRI at TAC 2019: DDI Track Ramon Maldonado , Maxwell Weinzierl, & Sanda M. Harabagiu The University of Texas at Dallas Human Language Technology Research Institute http://www.hlt.utdallas.edu/~{ramon, max, sanda} Outline 1.

Presentation for UTD FLA March 2017 Askeladden Capital Intro / Bio Samir Patel UTD alum,

UTD 2012 REU Summer Program on Software Safety Bhanu Kapoor, PhD Adjunct Faculty, Department of

Overview of Event Nugget Track TAC KBP 2016 Teruko Mitamura Zhengzhong Liu Eduard Hovy

ABA Meeting TAC Card Update May 21, 2019 Office of Disbursements ABA Meeting TAC Card Update

Status on positron fraction Multi-track event CC fitted Multi-track event 1 track Multi-Track

TGx-DDI Qualification of a Preclinical Biomarker C-Path - PSTC RIKEN Meeting Yokohama

UTD at the KBP 2016 Event Track Jing Lu and Vincent Ng Human Language Technology Research

CSV on the Web Intro to W3C CSV on the Web Specifications DDI Metadata Workshop Dagstuhl 2016

Texas Administrative Code Ch. 202 W EDNESDAY , J ULY 23, 2014 | A USTIN , T EXAS TAC 202

Existing Class B Graphics Los Angeles TAC/Flyway San Diego TAC/Flyway Phoenix

Overview of TAC 2011 Summarization Track Karolina Owczarzak, Hoa Trang Dang National Institute of

Session B.3 Spectrum Efficient Technologies Track Chair: Mr. Tom Young Track Chair Track

Track Filtering/Quality/Merging A proposal for data format of track quality and track merging in

Track & Field Parent Meeting BT Track & Field Mission Bartram Trail Track & Field

Track fitting, vertex fitting and Track fitting, vertex fitting and Track fitting, vertex fitting

AICP Island RTPO TAC Meeting Kendra Breiland, Turning Data into Planning Solutions October 10,

kohn kohn architecture 466 AMSTERDAM AVE. STOREFRONT PROPOSAL P-01 EXISTING PHOTOS 4 4 6 6 6

OPTIONS AND CHALLENGES FOR COMMISSIONING DOMICILIARY CARE Professor John Bolton What crisis?

ST E M/ ST E AM & Compute r Sc ie nc e : National T r e nds Je nnife r Zinth 7 th

COAHOMA COMMUNITY COLLEGE QEP TEAM MEETING FEBRUARY 9, 2018 2:00 PM WELCOME Ms. Glynda

!"##$%&'()' +,-./0'.10'2+3-./0'4.105+!6'' 1660!' ' !"#$%&'"%()

Analysis of Cobalt Strike network traffic obfuscation in C2 communication Vincent van der Eijk

Terrameter LS Imaging System Automatic System for Resistivity and IP Imaging Terrameter LS is a

Pathways to Graduation & Life Stanwood - Camano School District GRADUATION REQUIREMENTS

UTD HLTRI at TAC 2019: DDI Track Ramon Maldonado , Maxwell - PowerPoint PPT Presentation

UTD HLTRI at TAC 2019: DDI Track Ramon Maldonado , Maxwell Weinzierl, & Sanda M. Harabagiu The University of Texas at Dallas Human Language Technology Research Institute http://www.hlt.utdallas.edu/~{ramon, max, sanda} Outline 1.

Presentation for UTD FLA March 2017 Askeladden Capital Intro / Bio Samir Patel UTD alum,

UTD 2012 REU Summer Program on Software Safety Bhanu Kapoor, PhD Adjunct Faculty, Department of

Overview of Event Nugget Track TAC KBP 2016 Teruko Mitamura Zhengzhong Liu Eduard Hovy

ABA Meeting TAC Card Update May 21, 2019 Office of Disbursements ABA Meeting TAC Card Update

Status on positron fraction Multi-track event CC fitted Multi-track event 1 track Multi-Track

TGx-DDI Qualification of a Preclinical Biomarker C-Path - PSTC RIKEN Meeting Yokohama

UTD at the KBP 2016 Event Track Jing Lu and Vincent Ng Human Language Technology Research

CSV on the Web Intro to W3C CSV on the Web Specifications DDI Metadata Workshop Dagstuhl 2016

Texas Administrative Code Ch. 202 W EDNESDAY , J ULY 23, 2014 | A USTIN , T EXAS TAC 202

Existing Class B Graphics Los Angeles TAC/Flyway San Diego TAC/Flyway Phoenix

Overview of TAC 2011 Summarization Track Karolina Owczarzak, Hoa Trang Dang National Institute of

Session B.3 Spectrum Efficient Technologies Track Chair: Mr. Tom Young Track Chair Track

Track Filtering/Quality/Merging A proposal for data format of track quality and track merging in

Track &amp; Field Parent Meeting BT Track &amp; Field Mission Bartram Trail Track &amp; Field

Track fitting, vertex fitting and Track fitting, vertex fitting and Track fitting, vertex fitting

AICP Island RTPO TAC Meeting Kendra Breiland, Turning Data into Planning Solutions October 10,

kohn kohn architecture 466 AMSTERDAM AVE. STOREFRONT PROPOSAL P-01 EXISTING PHOTOS 4 4 6 6 6

OPTIONS AND CHALLENGES FOR COMMISSIONING DOMICILIARY CARE Professor John Bolton What crisis?

ST E M/ ST E AM &amp; Compute r Sc ie nc e : National T r e nds Je nnife r Zinth 7 th

COAHOMA COMMUNITY COLLEGE QEP TEAM MEETING FEBRUARY 9, 2018 2:00 PM WELCOME Ms. Glynda

!&quot;##$%&amp;'()*' +,-./0'.10'2+3-./0*'4.105+!6'' 1660!' ' !&quot;#$%&amp;'&quot;%()

Analysis of Cobalt Strike network traffic obfuscation in C2 communication Vincent van der Eijk

Terrameter LS Imaging System Automatic System for Resistivity and IP Imaging Terrameter LS is a

Pathways to Graduation &amp; Life Stanwood - Camano School District GRADUATION REQUIREMENTS

Track & Field Parent Meeting BT Track & Field Mission Bartram Trail Track & Field

ST E M/ ST E AM & Compute r Sc ie nc e : National T r e nds Je nnife r Zinth 7 th

!"##$%&'()' +,-./0'.10'2+3-./0'4.105+!6'' 1660!' ' !"#$%&'"%()

Pathways to Graduation & Life Stanwood - Camano School District GRADUATION REQUIREMENTS