Machine learning vs. knowledge based approaches to ADR - PowerPoint PPT Presentation

Machine learning vs. knowledge based approaches to ADR identification November 2017

Topics Short about us • Iden3fying ADRs • Machine Learning for seman3c rela3ons • iden3fica3on Results • Challenges •

SHORT ABOUT US Focus on text-analy3cs for Pharmaceu3cals. Since 1998 Othe her text xt sou sources s Voi oice of of the he Pa3 Pa3ent nt Electroni Ele onic He Healt alth h Re Recor ords ds Scien3fic literature – FDA – Patents – Business opportuni3es

SHORT ABOUT US Advanced Databases Group, Universidad Carlos III de Madrid § Research lines: • Natural language processing • Accessibility § Resources produced: • Drug-drug-interac-on collec-on (DDI Corpus) • DINTO ontology

Our goal at TAC ADR “ Combine Knowledge Based with Machine Learning Based approaches to leverage ADR identification ”

Iden%fying ADRs

TO TOPI PIC C EXTRA EXTRACTI CTION N NLP and Resource based approach ¡ SI SIDER DER ¡ UMLS UMLS ¡ Tr Trai aini ning ng cor orpus s

TO TOPI PIC C EXTRA EXTRACTI CTION N NLP and Resource based approach ¡ SI SIDER DER ¡ UMLS UMLS ¡ Tr Trai aini ning ng cor orpus s Dictionary #entries Adverse Reactions 21,826 Factor 41 Severity 158 Animal 27 DrugClass 101

TO TOPI PIC C EXTRA EXTRACTI CTION N NLP and Resource based approach And some rules to iden3fy nega3on: • MeaningCloud Insights Engine API supports this rule syntax •

Machine Learning for seman%c rela%ons iden%fica%on

Machine learning for seman3c rela3ons iden3fica3on Represen3ng ADR men3on context through a set of features: ADRMen9on – Other pairs (where Other is Severity, DrugClass, Ø M1TXT, M1TXT, M2TXT, M2TXT, BWTXT BWTXT: the text of both/between men3ons. Nega9on, Animal or Factor) Ø C1BO C1BOW, W, C2BO C2BOW: bag-of-words of both men3ons. Ø C1PO C1POS, S, C2PO C2POS: part of speech of both men3ons. Ø PB1PO PB1POS, S, PA PA1PO 1POS, S, PB2PO PB2POS, S, PA PA2PO 2POS, S, PWPO PWPOS: the PoS tags of the two tokens before/ aZer/between both men3ons. Ø WA WA1TXT, 1TXT, WB2TXT, WB2TXT, WA WA2TXT, 2TXT, WB1TXT WB1TXT: the two tokens aZer/before the men3on. Ø LA LA1LEM, 1LEM, LB2LEM, LB2LEM, LA LA2LEM, 2LEM, LB1LEM LB1LEM: the lemmas of the two tokens aZer/before both men3ons. Ø LWLEM LWLEM: the lemmas between of the two men3ons Ø NTO NTOKB KB: the number of tokens between the two men3ons.

Machine learning for seman3c rela3ons iden3fica3on And the algorithm? § SVM, support vector machines (using scikit-learn on Python) § Specifically, SVC implementa3on: • Default parameter values • Linear kernel But, no deep learning??!! Of course (CNN), but not in the official runs.

Results

Results • Task 1. ADR and related en33es Low precision!! • Task 2. Rela3ons between ADRs and en33es Oh, oh!!

Results • Task 3. Posi3ve ADRs Only a few negated men9ons? PreQy good!! • Task 4. Normaliza3on through MedDRA Using dic9onaries with seman9c informa9on produces nice results

Challenges

Challenges • Nega3on iden3fica3on requires more effort (not only on the ADRs field). Some weird things found in the test set: Eg.: The most frequently observed malignancies other than non- melanoma skin cancer … Negation? • CNNs and the use of syntac3c features improves results P R F1 Other 0.71 0.81 0.76 Negated 0.72 0.40 0.51 Hypothetical 0.75 0.75 0.75 Effect 0.76 0.61 0.68 Avg / total 0.73 0.73 0.73

Challenges • Recall must be improved: o separated mul3word men3ons o ADRs with no MedDRA code, enough lexical resources? • How to approach errors when applying deep learning? • Enough accuracy for prac3cal applica3ons? What does FDA think?

Thanks QUESTIONS?

LabDA Resources Corpus DDI (Drug-Drug Interac3ons) 1,025 annotated documents, 18,502 en33es and } 5,028 DDIs (by expert pharma) MedLine and DrugBank texts } Annota3ns guidelines and interannotator agreement. } Available at labda.inf.uc3m.es } Used at DDIExtrac3on 2011 and DDIExtrac3on 2013 } Semeval Tasks

LabDA Resources DINTO Ontology- knowledge about drugs and intera3ons (11,555 DDIs and 8,786 pharmacological en33es). Available at OBO Foundry Applica3on to Informa3on Extrac3on and Predic3on

Meani Me aning ngClou Cloud d LLC LLC Automa3ng the extrac3on of Meaning from any informa3on source. Addr ddress ss 35-37 36th Street 11106 Astoria NY Cont Contac act Inf nfo o jmar3nez@meaningcloud.com Tele Te lephone hone Phone: +1 (646) 403-3104 meaningcloud.com

Machine learning vs. knowledge based approaches to ADR - PowerPoint PPT Presentation

Machine learning vs. knowledge based approaches to ADR identification November 2017 Topics Short about us Iden3fying ADRs Machine Learning for seman3c rela3ons iden3fica3on Results Challenges SHORT ABOUT US Focus on

ADR in financial services WHAT ADR bodies do WHY ADR is a good idea HOW ADR bodies are

Public consultation EXPERTS WIPO ADR PRESENTATION AND CURRENT STATE OF THE EXPERTS WIPO ADR

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

26:198:722 Expert Systems I Knowledge representation I Knowledge acquisition I Machine learning I

ADR in Qatar - A New Law, A New Regional Seat of Choice, and a New Era ADR in Qatar - A New Law,

The Carriage of Dangerous Goods g g by Road Regulations What is ADR? ADR is the acronym

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Plan for today Knowledge-based systems 1 Explicit knowledge Knowledge Representation Inferred

Plan for today Knowledge-based systems 1 Tacit knowledge Knowledge Representation Inferred

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

effective as a supervised exercise training program for people with CVD? Hazel Mountford &

reduces susceptibility to pulmonary arterial hypertension in bmpr2 deficient mice A Crosby, E

CD38 and Ceramide-Enriched Membrane Platforms in Bovine Coronary Myocytes July 31, 2008 Matt

City of Racine 2020 High Deductible HSA Compatible Health Plan Overview Agenda Benefit

Session 3 : Basis for Regulatory Decision-Making Objectives of paediatric developm ent EMA/ FDA/

Philip Oddie & Abhishek Oswal The clinical scenario The patient & problem Mrs C.

Nutritional Management of Cows Zach McCracken Suther Feeds, Inc. Who We Are Second generation,

Lexaria Bioscience Corp. Investor Presentation LXX:CSE | LXRP:US June 2017

Machine learning vs. knowledge based approaches to ADR - PowerPoint PPT Presentation

Machine learning vs. knowledge based approaches to ADR identification November 2017 Topics Short about us Iden3fying ADRs Machine Learning for seman3c rela3ons iden3fica3on Results Challenges SHORT ABOUT US Focus on

ADR in financial services WHAT ADR bodies do WHY ADR is a good idea HOW ADR bodies are

Public consultation EXPERTS WIPO ADR PRESENTATION AND CURRENT STATE OF THE EXPERTS WIPO ADR

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

26:198:722 Expert Systems I Knowledge representation I Knowledge acquisition I Machine learning I

ADR in Qatar - A New Law, A New Regional Seat of Choice, and a New Era ADR in Qatar - A New Law,

The Carriage of Dangerous Goods g g by Road Regulations What is ADR? ADR is the acronym

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Plan for today Knowledge-based systems 1 Explicit knowledge Knowledge Representation Inferred

Plan for today Knowledge-based systems 1 Tacit knowledge Knowledge Representation Inferred

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

effective as a supervised exercise training program for people with CVD? Hazel Mountford &amp;

reduces susceptibility to pulmonary arterial hypertension in bmpr2 deficient mice A Crosby, E

CD38 and Ceramide-Enriched Membrane Platforms in Bovine Coronary Myocytes July 31, 2008 Matt

City of Racine 2020 High Deductible HSA Compatible Health Plan Overview Agenda Benefit

Session 3 : Basis for Regulatory Decision-Making Objectives of paediatric developm ent EMA/ FDA/

Philip Oddie &amp; Abhishek Oswal The clinical scenario The patient &amp; problem Mrs C.

Nutritional Management of Cows Zach McCracken Suther Feeds, Inc. Who We Are Second generation,

Lexaria Bioscience Corp. Investor Presentation LXX:CSE | LXRP:US June 2017

effective as a supervised exercise training program for people with CVD? Hazel Mountford &

Philip Oddie & Abhishek Oswal The clinical scenario The patient & problem Mrs C.