Extracting drug-drug interactions from pharmacological texts. - - PowerPoint PPT Presentation

extracting drug drug interactions from pharmacological
SMART_READER_LITE
LIVE PREVIEW

Extracting drug-drug interactions from pharmacological texts. - - PowerPoint PPT Presentation

Extracting drug-drug interactions from pharmacological texts. Isabel Segura Bedmar, Cesar de Pablo-Snchez, (joint work with Mario Crespo and Paloma Martnez) CS Department, Universidad Carlos III de Madrid September 2009 Saarbrcken,


slide-1
SLIDE 1

Extracting drug-drug interactions from pharmacological texts.

Isabel Segura Bedmar, Cesar de Pablo-Sánchez, (joint work with Mario Crespo and Paloma Martínez) CS Department, Universidad Carlos III de Madrid September 2009 Saarbrücken, Germany

slide-2
SLIDE 2
  • U. Carlos III de Madrid

http://www.uc3m.es/

Founded on 1989 3 campus in Madrid province

Getafe (Humanities, Social Sciences) Leganés (Engineering) Colmenarejo

slide-3
SLIDE 3

Research on LT – IE on pharmacological and clinical domain – Question Answering – Natural Interaction in real environments

  • E-commerce
  • People with dissabilities

Advanced Databases Group http://basesdatos.uc3m.es/

slide-4
SLIDE 4

http://www.mavir.net

http://twitter.com/mavircm

  • 6 Madrid research group (30 doctors)

UNED, UAM, UC3M, UEM, UPM and CINDOC

  • Network of companies and research groups

academic vs. professional research vs. services resource generation vs. applications

slide-5
SLIDE 5

http://www.mavir.net

http://twitter.com/mavircm

Human Language Technologies: Distributed, Multilingual & Multimedia Information Retrieval Automatic Question & Answer Systems with Natural Language Semantic Web Automatic Document Classification and Document Summarization Representation and Extraction of Linguistic Information Scientific Communication via WWW: Cybermetric indicators, Webometrics Cultural, scientific, technical and business web sites Digital Libraries User-centered design: usability, accessibility and visualization

slide-6
SLIDE 6

Extracting drug-drug interactions from pharmacological texts.

Isabel Segura Bedmar, Cesar de Pablo-Sánchez, (joint work with Mario Crespo and Paloma Martínez) CS Department, Universidad Carlos III de Madrid September 2009 Saarbrücken, Germany

slide-7
SLIDE 7

What is a Drug-Drug Interaction?

slide-8
SLIDE 8

Beneficial

Ritonavir + Lopinavir = Effective antiretroviral Nifedipine + propranolol = Antianginal drug

slide-9
SLIDE 9

Dangerous

Aspirin + Heparin Bleeding → Aspirin + Acetazolamide Death →

slide-10
SLIDE 10

 Medication errors kill

7,000 patients per annum in USA1.

 5% of medication

errors are DDI2.

 High incidence in

certain patient groups.

 Increase the

Healthcare costs

Things can get complicated...

Kohn et al., 2000. “To Err is Human”. Leape et al., 1995. “Systems analysis of adverse drug events”.

slide-11
SLIDE 11

Drug interaction Resources

Most effective source: Medical Literature.

slide-12
SLIDE 12

How Information Extraction helps?

Aspirin may decrease the effects of probenecid, sulfinpyrazone, and phenylbutazone.

DDI MAY DECREASE( ASPIRIN , PROBENECID) DDI MAY DECREASE ( ASPIRIN , SULFINPYRAZONE ) DDI MAY DECREASE ( ASPIRIN , PHENYLBUTAZONE)

slide-13
SLIDE 13

What will you see in this talk?

Construction of an annotated corpus with DDI Information Extraction System for DDI

T ext Analysis Drug Name Recognition Anaphora Resolution DDI Extraction

I

slide-14
SLIDE 14
slide-15
SLIDE 15
slide-16
SLIDE 16
slide-17
SLIDE 17

DrugBank HTML To Text Wrapper Corpus TXT

Collecting a corpus for DDI

slide-18
SLIDE 18

Annotation of Corpus DrugDDI

Total Avg . per doc Documents 579 Sentences 5806 10.2 Sentences with at least one DDI 2044 3.5 Drugs 14930 25.7 Drug-Drug Interactions 3027 5.2

slide-19
SLIDE 19

Example of annotation

slide-20
SLIDE 20

What will you see in this talk?

Construction an annotated corpus with DDI Information Extraction System for DDI

T ext Analysis Drug Name Recognition Anaphora Resolution DDI Extraction

I

slide-21
SLIDE 21

IE System for DDI

Corpus TXT Text analysis XML annotated with shallow syntactic and semantic information Drug Name Recognition Anaphora Resolution DDI Extraction + drugs and other biomedical concepts + anaphoras + Drug interactions

Biomedical Resources

slide-22
SLIDE 22

IE System for DDI

Corpus TXT Text analysis XML annotated with shallow syntactic and semantic information Drug Name Recognition Anaphora Resolution DDI Extraction + drugs and other biomedical concepts + anaphoras + Drug interactions

Biomedical Resources

slide-23
SLIDE 23

Text Analysis by UMLS MetaMap program (MMTx)

Corpus TXT

XML annotated with shallow syntactic and semantic information from UMLS

UMLS MetaMap (MMTx): T ext analysis

Unified Medical Language System (UMLS)

slide-24
SLIDE 24

Sentence Splitting

slide-25
SLIDE 25

Shallow Syntactic Information

slide-26
SLIDE 26

Tokenization

slide-27
SLIDE 27

Semantic Information from UMLS

slide-28
SLIDE 28

Drug Name Recognition

Corpus TXT Text analysis Drug Name Recognition Anaphora Resolution DDI Extraction + drugs and other biomedical concepts + anaphoras + Drug interactions

WHOINN affixes UMLS

XML annotated with shallow syntactic and semantic information

slide-29
SLIDE 29

Is MMTx enough to recognize Drugs?

New Drugs Date of Approval Valturna (aliskiren and valsartan) Tablets September 17, 2009 Influenza A (H1N1) 2009 Monovalent Vaccine September 15, 2009 Zirgan (ganciclovir) Ophthalmic Gel September 15, 2009 Vibativ (telavancin) Injection September 11, 2009 Bepreve (bepotastine) Ophthalmic Solution September 8, 2009 Metozolv ODT Orally Disintegrating Tablets September 4, 2009 Intuniv (guanfacine) Extended Release Tablets September 2, 2009 Zenpep (pancrelipase) Capsules August 27, 2009 Sabril (vigabatrin) Tablets and Oral Solution August 21, 2009 Hiberix Solution for Intramuscular Injection August 19, 2009 Extavia (interferon beta-1b) August 14, 2009 Saphris (asenapine) Sublingual Tablets August 13, 2009 Embeda Extended Release Capsules August 13, 2009 Livalo (pitavastatin) Tablets August 3, 2009

slide-30
SLIDE 30

Is MMTx enough to classify Drugs?

UMLS Semantic Types for drugs:

Clinical Drug (clnd): a pharmaceutical preparation as produced by the manufacturer. Pharmacological substance (phsu): a substance used in the treatment or prevention of pathologic disorders. Antibiotic (antb): A pharmacologically active compound produced by growing microorganisms which kill or inhibit growth of other microorganisms.

slide-31
SLIDE 31

WHO affixes for identifying and classifying drugs

Affixes WHOINN Drug Family Pattern Drugs

  • pristin

Antibacterials, pristinamycin derivatives

[A-Za-z0-9]*[pristin] Efepristin

  • gatran

Antithrombotic agents

[A-Za-z0-9]*[gatran] Dabigatran

  • tinib

Antineoplastic agents

[A-Za-z0-9]*[tinib]

Dasatinib, Sunitinib, Nilotinib

  • mycin
  • Antibiotics

[A-Za-z0-9]*[mycin]

T anespimycin

 Affix-based classification obtains an accuracy rate of

75%

slide-32
SLIDE 32

Drug name recognition and classification in biomedical texts (Segura-Bedmar et al., 2008)

slide-33
SLIDE 33

Drug Anaphora Resolution

Corpus TXT Text analysis Drug Name Recognition Anaphora Resolution DDI Extraction + drugs and other biomedical concepts + anaphoras + Drug interactions XML annotated with shallow syntactic and semantic information

Biomedical Resources

slide-34
SLIDE 34

Levofloxacin is one of the most commonly prescribed antibiotics in clinical practice. Several case reports have indicated that this drug may signicantly potentiate the anticoagulation effect of warfarin.

DDI MAY POTENTIATE( LEVOFLOXACIN , WARFARIN )

How Anaphora Resolution helps?

slide-35
SLIDE 35

Score-based approach for Anaphora Resolution in Drug- Drug Interacion Documents (Segura-Bedmar et al., 2009a) NLDB 2009 DrugNerAR: Linguistic Rule- Based Anaphora Resolution for DDI Extraction in pharmacological documents (Segura-Bedmar et al., 2009b To appear in DTMBIO 2009

slide-36
SLIDE 36

Anaphora Resolution in DDI Documents

Identification of anaphoric expressions Selection of candidate antecedents Ordering candidate antecedents

slide-37
SLIDE 37

Scoring candidates

Pronominal Nominal

slide-38
SLIDE 38
  • Since the concomitant administration of

[warfarin] with [amiodarone] increases the prothrombin time by 100% after 3 to 4 days, the dose of the anticoagulant should be reduced by one-third to one-half, and prothrombin times should be monitored closely.

Linguistic based selection

slide-39
SLIDE 39
  • Since the concomitant administration of

[warfarin] with [amiodarone] increases the prothrombin time by 100% after 3 to 4 days, the dose of the anticoagulant should be reduced by one-third to one-half, and prothrombin times should be monitored closely.

Linguistic based Selection

slide-40
SLIDE 40

Linguistic based selection

[Quinidine and procainamide] doses should be reduced by one-third when either is administered with amiodarone. Plasma levels of flecainide have been reported to increase in the presence of oral amiodarone; because of this, the dosage of flecainide should be adjusted when these drugs are administered concomitantly.

slide-41
SLIDE 41

Anaphora resolution results

slide-42
SLIDE 42

Drug-Drug Interaction Extraction

Corpus TXT Text analysis Drug Name Recognition Anaphora Resolution DDI Extraction + drugs and other biomedical concepts + anaphoras + Drug interactions XML annotated with shallow syntactic and semantic information

Biomedical Resources

slide-43
SLIDE 43

Approaches for DDI Detection

DDI Extraction Drug interactions

Syntactic Information and Pattern Matching. Subsequence Kernel Method (Giuliano et al., 2006)

slide-44
SLIDE 44

Pharmacological patterns

Patterns defined by our pharmacist.

<DRUG> INTERACT WITH <DRUG>. <DRUG> (INCREASE|DECREASE|...) <DRUG EFFECTS> <DRUG> INTERFERE WITH <DRUG PROPERTIES> CONCURRENT USE OF <DRUG> WITH <DRUG> (INCREASE| DECREASE|...) <DRUG PROPERTIES> <DRUG> INHIBIT <DRUG PROPERTIES> CO-ADMINISTRATION OF <DRUG> AND <DRUG> RESULT IN <DRUG PROPERTIES> <DRUG EFFECTS> OF <DRUG> BE (ENHANCED|REDUCED|...) BY <DRUG>

slide-45
SLIDE 45

The pressor effects of [catecholamines such as dopamine or norepinephrine]_APOS are enhanced by Bretylium T

  • sylate.

which can be interpreted as: 1) The pressor effects of catecholamines are enhanced by Bretylium 2) The pressor effects of dopamine are enhanced by Bretylium 3) The pressor effects of norepinephrine are enhanced by Bretylium

How syntactic information helps?

<DRUG EFFECT> OF (DRUG|APOS) BE <INTERACT_VERB> BY (DRUG|APOS) 1) DDI increase ( BRETYLIUM TOSYLATE, CATECHOLAMINES ) 2) DDI increase (BRETYLIUM TOSYLATE, DOPAMINE) 3) DDI increase (BRETYLIUM TOSYLATE, NOREPINEPHRINE)

Detecting appositive structures

slide-46
SLIDE 46

Aspirin may interact with [probenecid, sulfinpyrazone, and phenylbutazone]_COORD

which can be interpreted as: 1) Aspirin may interact with probenecid 2) Aspirin may interact with sulfinpyrazone. 3) Aspirin may interact with phenylbutazone

How syntactic information helps?

(DRUG|COORD) INTERACT WITH (DRUG|COORD)

Detecting coordinative propositions

1) DDI MAY INTERACT( ASPIRIN , PROBENECID) 2) DDI MAY INTERACT ( ASPIRIN , SULFINPYRAZONE) 3) DDI MAY INTERACT ( ASPIRIN , PHENYLBUTAZONE)

slide-47
SLIDE 47

[The Cmax of norethindrone was 13% higher] when [it was coadministered with gabapentin]

What is the problem?

Complex sentences: Interactions could span several clauses

In a pharmacokinetic substudy in patients with congestive heart failure receiving furosemide or digoxin in whom therapy with FLOLAN was initiated, apparent oral clearance values for furosemide (n = 23) and digoxin (n= 30) were decreased by 13% and 15%, respectively,

  • n the second day of therapy and had returned

to baseline values by day 87.

slide-48
SLIDE 48

How to improve the results?

DDI Extraction Drug interactions

Syntactic Simplification and Pattern Matching. Kernel Method for DDI Extraction.

slide-49
SLIDE 49

Concurrent administration of a TNF antagonist with ORENCIA has been associated with an increased risk of serious infections and no significant additional efficacy over use of the TNF antagonists alone.

DDI(TNF antagonist, ORENCIA) DDI(ORENCIA, TNF antagonist) DDI(TNF antagonist, TNF antagonist)

ML and Kernel approach to DDI

  • Classification problem
  • Every drug pair is an instance
  • Relation is reciprocal, drug order is not important
  • Unbalanced dataset: 10% positives
slide-50
SLIDE 50

Shallow sequence kernels

Learning algorithm: SVM Shallow representation of sentences (no syntax)

– Global Context Kernel: whole sentence info

  • Before - Between
  • Between
  • Between - After

– Local Context Kernel: entity info

Claudio Giuliano, Alberto Lavelli, Lorenza Romano. Exploiting Shallow Linguistic Information for Relation Extraction from Biomedical Literature, EACL 2006

KSS(A1,A2)=KGC(A1,A2) + KLC(A1,A2)

slide-51
SLIDE 51

Global Context: Fore-Between

Concurrent administration of a TNF antagonist with ORENCIA has been associated with an increased risk of serious infections and no significant additional efficacy over use of the TNF antagonists alone.

KGC(A1,A2)=KFB(A1,A2)

slide-52
SLIDE 52

Global Context: Between

Concurrent administration of a TNF antagonist with ORENCIA has been associated with an increased risk of serious infections and no significant additional efficacy over use of the TNF antagonists alone.

KGC(A1,A2)=KFB(A1,A2) + KB(A1,A2)

slide-53
SLIDE 53

Global Context: Between-After

Concurrent administration of a TNF antagonist with ORENCIA has been associated with an increased risk of serious infections and no significant additional efficacy over use of the TNF antagonists alone.

KGC(A1,A2)=KFB(A1,A2) + KB(A1,A2) + KBA(A1,A2)

slide-54
SLIDE 54

Global Context: N-gram information

Concurrent administration of a TNF antagonist with ORENCIA has been associated with an increased risk of serious infections and no significant additional efficacy over use of the TNF antagonists alone.

concurrent 1 administration 1

  • f

1 a 1 with 1 concurrent_administration 1 administration_of 1

  • f_a

1 concurrent_administration_of 1 administration_of_a 1

Φc(A1)={

}

KGC(A1,A2)=KFB(A1,A2) KGC(A1,A2)=KFB(A1,A2) + KB(A1,A2) + KBA(A1,A2)

slide-55
SLIDE 55

Local Context: Left Entity

Concurrent administration of a TNF antagonist with ORENCIA has been associated with an increased risk of serious infections and no significant additional efficacy over use of the TNF antagonists alone.

KLC(A1,A2)=KLEFT(A1,A2) + KRIGHT(A1,A2)

slide-56
SLIDE 56

Local Context: Right Entity

Concurrent administration of a TNF antagonist with ORENCIA has been associated with an increased risk of serious infections and no significant additional efficacy over use of the TNF antagonists alone.

KLC(A1,A2)=KLEFT(A1,A2) + KRIGHT(A1,A2)

slide-57
SLIDE 57

Local Context: Feature based

Concurrent administration of a TNF antagonist with ORENCIA has been associated with an increased risk of serious infections and no significant additional efficacy over use of the TNF antagonists alone.

KLC(A1,A2)=KLEFT(A1,A2) + KRIGHT(A1,A2)

Token(of) 1 Lemma(of) 1 PoS(of) 1 Stem(of) 1 Ortho(of) 1

....

1 Token(with) 1 Lemma(with) 1 PoS(with) 1 Stem(with) 1

Φc(A1)={

}

slide-58
SLIDE 58

Comparative Patterns, Syntactic + Patterns, kernels

slide-59
SLIDE 59

IE System for DDI

Corpus TXT Text analysis XML annotated with shallow syntactic and semantic information Drug Name Recognition Anaphora Resolution DDI Extraction + drugs and other biomedical concepts + anaphoras + Drug interactions

Biomedical Resources

slide-60
SLIDE 60

Drug Drug Interaction detection is a promising application for IE and NLP