a Cascaded Sequence Labeling Approach Hua Xu Ph.D. School of - PowerPoint PPT Presentation

Detecting Adverse Drug Reaction in Drug Labels using a Cascaded Sequence Labeling Approach Hua Xu Ph.D. School of Biomedical Informatics The University of Texas Health Science Center at Houston

Introduction • TAC 2017 ADR Challenge • Adverse Drug Reaction Extraction from Drug Labels • We participated in all four tasks • Task 1 – Extract mentions of AdverseReactions and modifier concepts (i.e., Severity , Factor , DrugClass , Negation , and Animal) • Task 2 – Identify the relations between AdverseReactions and their modifier concepts (i.e., Negated , Hypothetical , and Effect ) • Task 3 – Identify positive AdverseReaction mentions in the labels • Task 4 – Map recognized positive AdverseReaction to MedDRA PT (s) and LLT (s). TAC 2017 2

Data Sets #drug labels Usage Training 101 Developing models and optimizing parameters Development 2,208 Training word embeddings and rule development Test 99 Testing TAC 2017 3

Pre-processing and baseline approaches Two cases of anaphylaxis were reported in the dose-finding trials. There were no Grade 3 or 4 CLAMP infusion-related reactions reported in Studies 1 and 2; however, Grade 1 or 2 infusion-related reactions were reported for 19 patients (12%). In Studies 1 and 2, the most common adverse reactions Clinical Language Annotation, (>=2%) associated with infusion-related reactions were chills (4%), nausea (3%), dyspnea (3%), Modeling, and Processing Toolkit pruritus (3%), pyrexia (2%), and cough (2%). Sentence Boundary Detection Tokenization POS Tagging Entity Recognition Entity Normalization Visualization TAC 2017 4

Task 1&2: Extract AdverseReactions , related mentions, and their relations • Task 1: Named Entity Recognition • Task 2: Relation Extraction TAC 2017 5

Identified Issues – related mention recognition • A related mention is not annotated in the gold standard if it is not associated with any AdverseReaction Animal • Issue 1 : Cannot train a machine-learning based NER system directly • Issue 2 : Missing some negative relation samples, thus making it difficult for the traditional relation classification approach, which requires for both positive and negative candidates for training TAC 2017 6

Identified Issue – Disjoint/overlapping entities • Example of disjoint entities • Issue : Cannot handle disjoint entities using the traditional NER approaches • Basic assumptions for a machine learning-based NER system • entities do not overlap with one another • each entity consists of contiguous words TAC 2017 7

Our approach - Cascaded Sequence Labeling Models • Model 1 – Sequence labeling model for AdverseReaction only • Model 2 – Recognize both related mentions and their relations to the target AdverseReaction mentions at the same time, using one sequence labeling model TAC 2017 8

Model 1 – AdverseReaction NER • Train 1 st sequence labeling model, recognize AdverseReaction only Gold Label O B-AdverseReaction O O O B-AdverseReaction O O severe neutropenia and Grade 4 thrombocytopenia can occur Word 1 st Sequence Labeling Model TAC 2017 9

Model 2 – Related mentions and relations • Train 2 nd sequence labeling model, focus on modifier concepts and their relations with AdverseReactions together Gold B-Severity O O O O O B-Factor O Label Word severe neutropenia and Grade 4 thrombocytopenia can occur Sample 1 Target ADR O B-T-ADR O O O B-O-ADR O O TAC 2017 10

Model 2 – Related mentions and relations • Train 2 nd sequence labeling model, focus on modifier concepts and their relations with AdverseReactions Gold B-Severity O O O O O B-Factor O Label Word severe neutropenia and Grade 4 thrombocytopenia can occur Sample 1 Target ADR O B-T-ADR O O O B-O-ADR O O Label O O O B-Severity I-Severity O B-Factor O Sample 2 Word severe neutropenia and Grade 4 thrombocytopenia can occur Target ADR O B-O-ADR O O O B-T-ADR O O 2 nd Sequence Labeling Model TAC 2017 11

Predict with Cascaded Sequence Labeling Models Input severe neutropenia and Grade 4 thrombocytopenia can occur 1 st Sequence Labeling Model AdverseReaction Recognition AdeverseReaction AdeverseReaction severe neutropenia and Grade 4 thrombocytopenia can occur Transformation Target- Other- Other- Target- AdeverseReaction AdeverseReaction AdeverseReaction AdeverseReaction severe neutropenia and Grade 4 thrombocytopenia can occur severe neutropenia and Grade 4 thrombocytopenia can occur 2 nd Sequence Labeling Model Modifier Concept Recognition Factor Severity Severity Factor TAC 2017 severe neutropenia and Grade 4 thrombocytopenia can occur severe neutropenia and Grade 4 thrombocytopenia can occur 12

Predict with Cascaded Sequence Labeling Models Other- Target- Target- Other- AdeverseReaction AdeverseReaction AdeverseReaction AdeverseReaction severe neutropenia and Grade 4 thrombocytopenia can occur severe neutropenia and Grade 4 thrombocytopenia can occur + + Severity Factor Severity Factor severe neutropenia and Grade 4 thrombocytopenia can occur severe neutropenia and Grade 4 thrombocytopenia can occur

Sequence Labeling Models • Conditional Random Fields (CRF) • Linear-Chain CRF (Lafferty et al., 2001) • Recurrent Neural Network (RNN) • LSTM-CRF: a bidirectional LSTM with a conditional random field layer above it (Lafferty et al., 2016) • Input layer: word embeddings + character embeddings • LSTM-CRF(Dict) • Use B-/I-/O to represent dictionary lookup results, initiate with random values • Input layer: word embeddings + character embeddings +dictionary features TAC 2017 14

LSTM-CRF(Dict) 1 st model for AdverseReaction 2 nd model for modifier concepts and recognition relation extraction … B-Severity O O … … O B-ADR O … Word/Char Embedding Word/Char Embedding Dictionary Feature Dictionary Feature Target ADR Representation … severe neutropenia and … … severe neutropenia and … … O B-ADR O … TAC 2017 15

Our approach for disjoint entities • Step 1 - Merge qualified disjoint entities into pseudo continuous entities • Step 2 - Training NER models using pseudo continuous entities • Step 3 - Split detected continuous entities using rules TAC 2017 16

Merge and Train disjoint entities • Merge qualified entities in gold standard • Discard, if • cross sentences, or • more than 3 segments, or • more than 5 tokens between two segments • Merge others merge • Train NER models using ‘continuous’ entities ATAC 2017 17

Split continuous entities • Detect candidates • has more than 4 tokens, or • contain any of ‘and’, ‘or’, ‘/’, ‘,’, or ‘(’ • Split using rules • Regular expression rules • ((grade|stage)\s+\d)\s*(?:and|or|\-|\/)\s*(\d) → group(1)|group(2)+group(3) • E.g. ‘Grade 3 and 4’ → ‘Grade 3 ‘ and ‘Grade … 4’ • Dictionary–based rules • Dictionary(~3000 pairs):<infections, viral>, <infections, protozoal>, <increase in, AST> etc. • Started from Training data, and • enriched with MedDRA terms E.g. viral, or protozoal infections ’ → ‘ viral … infections’ and ‘protozoal infections’ • 18 ATAC 2017 18

Task 3 - Identify Positive AdverseReactions • An AdverseReaction is positive if: the AdverseReaction is not negated AND the AdverseReaction is not related by a Hypothetical relation to a DrugClass or Animal ATAC 2017 19

Task 4 Link AdverseReactions to MedDRA codes • Work flow for MedDRA encoding Retrieve Top 10 Lucene BM25 Similarity Scores Learning to rank BM25 Matching Score Input Terms Linear RankSVM Jaccard Similarity Score Translation-based Matching Score Index of MedDRA Terms Top 10 Concepts Top 10 Concepts BM25 Jaccard TransLM Top 10 Concepts score Lipids Lipids 11.12 0.5 -1.95 Lipids 0.73 “elevations, lipids” Lipid proteinosis Lipid proteinosis 8.93 0.5 -5.74 Lipid proteinosis 0.63 … … … Lipid increased Lipid increased 8.93 0.5 -0.76 Lipid increased 0.98 20 TAC 2017

Translation-based similarity • Motivation --- Word mismatch problem Mention Elevations, lipids Simple Match lipids Ground-truth lipids increased • Machine translation model • Word-to-word translation probability • t = increased, w = elevations, p(w|t) = 0.6142 TAC 2017 21

Train the word-to-word translation probabilities • Prepare parallel corpus • From MedDRA, construct 53,368 mapping pairs <Low Level Term, Preferred Term>, e.g. • <Diseases of nail, Nail disorder> • <Bilirubin elevated, Blood bilirubin increased> • From Training Data, construct 7,045 mapping pairs <Mention, Mapped MedDRA Term>, e.g. • <alt elevations, ALT increased> • <cardiovascular disease, cardiovascular disorder> • Train word-to-word translation probability with IBM Model 1(Brown et al., 1993) 𝑄𝒖 ⁠ 𝒕 𝒕 = 𝜗/ ( 𝑚 +1) ↑𝑛 ∏𝑘 =1 ↑𝑛▒∑𝑗 =0 ↑𝑚▒𝑞 ( 𝑢↓𝑘 | 𝑡↓𝑗 ) We use GIZA++ toolkit to train the translation probabilities TAC 2017 22

Submissions • Run 1: discarded all disjoint AdverseReaction s, for higher precision • Run 2: use “merge → predict → split” strategy, for higher recall • Run 3: combine Run 1 and Run 2, for higher F1 TAC 2017 23

a Cascaded Sequence Labeling Approach Hua Xu Ph.D. School of - PowerPoint PPT Presentation

Detecting Adverse Drug Reaction in Drug Labels using a Cascaded Sequence Labeling Approach Hua Xu Ph.D. School of Biomedical Informatics The University of Texas Health Science Center at Houston Introduction TAC 2017 ADR Challenge

Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat POS tagging Sequence labeling with

Background Sequence labeling MEMMs - ? HMMs you know, right? Structured

EMNLP | 2020 SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup Rongzhi Zhang, Yue

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Connectionist Temporal Classification 1 Sequence-to-sequence

Sequence Labeling Markov Models Many information extraction tasks can be formulated as

Conditional Random Fields Dietrich Klakow Overview Sequence Labeling Bayesian Networks

Learning Cascaded Influence under Partial Monitoring Jiaqi Ma 1 Jie Zhang 2 Jie Tang 3 1 Dept. of

SEQUENCE ANALYSIS The term " sequence analysis " in biology implies subjecting a DNA or

Requirements of the Final Rule for Restaurant Menu Labeling Loretta Carey Food Labeling and

Definitions in the Final Rule for Restaurant Menu Labeling Loretta Carey Food Labeling and

Fall Seminar Seed Sampling & Labeling Larry Nees Seed Administrator Office of INDIANA

Hub Labeling Algorithms Andrew V. Goldberg Amazon.com A.V. Goldberg Hub Labeling 6/2/2016 1 /

Myeloi yeloid d Pr Progenit nitor ors: s: An off-the-shel An shelf f cellula ular r

Automatic generation of MedDRA terms groupings using an ontology Gunnar DECLERCK a , Cdric

Capital Markets Day Delivering sustainable growth and value 19 th September 2019 Cautionary

Draft Planning Guidance Public Meeting 16 July 2019 This evenings meeting Agenda

CTP-543, an Oral JAK Inhibitor, Achieves Primary Endpoint in Phase 2 Randomized,

Pediatric Readmissions: Development of a New Federally-Sponsored Measure Mari M. Nakamura, MD,

individualizing treatment choices for people with IBD This is what we all want! We are not there

Sanjeev Arora M.D. Sanjeev Arora M.D. Professor of Medicine (Gastroenterology/Hepatology)

Sambuz

Useful Links

Newsletter

Mail Us

a Cascaded Sequence Labeling Approach Hua Xu Ph.D. School of - PowerPoint PPT Presentation

Detecting Adverse Drug Reaction in Drug Labels using a Cascaded Sequence Labeling Approach Hua Xu Ph.D. School of Biomedical Informatics The University of Texas Health Science Center at Houston Introduction TAC 2017 ADR Challenge

Structured Perceptron CMSC 470 Marine Carpuat POS tagging Sequence labeling with the perceptron

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat POS tagging Sequence labeling with

Background Sequence labeling MEMMs - ? HMMs you know, right? Structured

EMNLP | 2020 SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup Rongzhi Zhang, Yue

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Connectionist Temporal Classification 1 Sequence-to-sequence

Sequence Labeling Markov Models Many information extraction tasks can be formulated as

Conditional Random Fields Dietrich Klakow Overview Sequence Labeling Bayesian Networks

Learning Cascaded Influence under Partial Monitoring Jiaqi Ma 1 Jie Zhang 2 Jie Tang 3 1 Dept. of

SEQUENCE ANALYSIS The term &quot; sequence analysis &quot; in biology implies subjecting a DNA or

Requirements of the Final Rule for Restaurant Menu Labeling Loretta Carey Food Labeling and

Definitions in the Final Rule for Restaurant Menu Labeling Loretta Carey Food Labeling and

Fall Seminar Seed Sampling &amp; Labeling Larry Nees Seed Administrator Office of INDIANA

Hub Labeling Algorithms Andrew V. Goldberg Amazon.com A.V. Goldberg Hub Labeling 6/2/2016 1 /

Myeloi yeloid d Pr Progenit nitor ors: s: An off-the-shel An shelf f cellula ular r

Automatic generation of MedDRA terms groupings using an ontology Gunnar DECLERCK a , Cdric

Capital Markets Day Delivering sustainable growth and value 19 th September 2019 Cautionary

Draft Planning Guidance Public Meeting 16 July 2019 This evenings meeting Agenda

CTP-543, an Oral JAK Inhibitor, Achieves Primary Endpoint in Phase 2 Randomized,

Pediatric Readmissions: Development of a New Federally-Sponsored Measure Mari M. Nakamura, MD,

individualizing treatment choices for people with IBD This is what we all want! We are not there

Sanjeev Arora M.D. Sanjeev Arora M.D. Professor of Medicine (Gastroenterology/Hepatology)

Sambuz

Useful Links

Newsletter

Mail Us

SEQUENCE ANALYSIS The term " sequence analysis " in biology implies subjecting a DNA or

Fall Seminar Seed Sampling & Labeling Larry Nees Seed Administrator Office of INDIANA