TextMed: A Multi-Agent System with Reinforcement Learning Agents - - PowerPoint PPT Presentation

textmed a multi agent system with reinforcement learning
SMART_READER_LITE
LIVE PREVIEW

TextMed: A Multi-Agent System with Reinforcement Learning Agents - - PowerPoint PPT Presentation

TextMed: A Multi-Agent System with Reinforcement Learning Agents for Biomedical Text Mining Michael Camara Janyl Jumadinova Oliver Bonham-Carter September 9, 2015 Big Data Biomedical Research PubMed: U.S. National Library of Medicine


slide-1
SLIDE 1

TextMed: A Multi-Agent System with Reinforcement Learning Agents for Biomedical Text Mining

Michael Camara Janyl Jumadinova Oliver Bonham-Carter September 9, 2015

slide-2
SLIDE 2

Big Data

slide-3
SLIDE 3

Biomedical Research

◮ PubMed: U.S. National Library of Medicine free search engine ◮ 24 million records (abstracts and citations) ◮ Annual growth rate of 4%

slide-4
SLIDE 4

Text Mining

◮ Text summarization ◮ Document retrieval ◮ Document classification

slide-5
SLIDE 5

Text Mining

◮ Text summarization ◮ Document retrieval ◮ Document classification ◮ Information extraction

slide-6
SLIDE 6

Preprocessing: Lister

  • 1. Lister downloads and decompresses data
  • 2. Keyword used to obtain relevant abstracts
  • 3. Abstracts divided into datasets
  • 4. Agent assigned to each dataset
slide-7
SLIDE 7

Preprocessing: Abstract Creation

  • 1. Lister downloads and decompresses data
  • 2. Keyword used to obtain relevant abstracts
  • 3. Abstracts divided into datasets
  • 4. Agent assigned to each dataset
slide-8
SLIDE 8

Preprocessing: Dataset Creation

  • 1. Lister downloads and decompresses data
  • 2. Keyword used to obtain relevant abstracts
  • 3. Abstracts divided into datasets
  • 4. Agent assigned to each dataset
slide-9
SLIDE 9

Preprocessing: Agent Allocation

  • 1. Lister downloads and decompresses data
  • 2. Keyword used to obtain relevant abstracts
  • 3. Abstracts divided into datasets
  • 4. Agent assigned to each dataset
slide-10
SLIDE 10

TextMed: Parsing

  • 1. Scan through each document with keyword
slide-11
SLIDE 11

TextMed: MeSH Keyword List

  • 2. Obtain keyword from MeSH (Medical Subject Heading) list
slide-12
SLIDE 12

TextMed: Match Found?

  • 3. Iterate through list until match found
slide-13
SLIDE 13

TextMed: SentiStrength

  • 4. Perform sentiment analysis on keyword match
slide-14
SLIDE 14

TextMed: SentiStrength (cont.)

Sentiment Analysis Example: ”The penicillin successfully treated the condition, but the patient complained

  • f severe side effects afterwards.”
slide-15
SLIDE 15

TextMed: SentiStrength (cont.)

Sentiment Analysis Example: ”The penicillin successfully [+3] treated the condition, but the patient complained

  • f severe [-2] side effects afterwards.”

Sentiment Score = [+3] + [-2] = 1

slide-16
SLIDE 16

TextMed: Reinforcement Learning

  • 5. Perform reinforcement learning:
slide-17
SLIDE 17

TextMed: Reinforcement Learning (cont.)

  • 1. Give command
  • 2. Dog performs an action
  • 3. Give treat if action matches

command

  • 4. Dog tries to maximize treats
slide-18
SLIDE 18

TextMed: Reinforcement Learning (cont.)

  • 1. Provide list of possible actions
  • 2. Agent performs an action
  • 3. Agent receives reward based on how

sentiment changes Rk =

N

  • i=d

|gsk − lsk,d| N

  • 4. Agent tries to optimize reward for

next time

slide-19
SLIDE 19

TextMed: Continue Parsing

  • 6. Continue parsing all keywords, then begin next document
slide-20
SLIDE 20

TextMed: Multiple Agents

  • 7. Multiple agents working simultaneously
slide-21
SLIDE 21

Experimental Setup

◮ Three primary datasets used for experiments ◮ Each dataset obtained using different keywords with Lister

program and PubMed database

◮ Similar pattern of results for each

slide-22
SLIDE 22

Alzheimer’s Dataset: Reward Data

◮ Smaller reward = more optimal, less sentiment fluctuation ◮ Initially high reward, becomes smaller over time

slide-23
SLIDE 23

Alzheimer’s Dataset: Local Sentiment vs Global Sentiment

◮ Sentiment before learning ◮ Highly variable throughout all

documents

◮ Sentiment after learning ◮ Variable at beginning,

stabilizes near end

slide-24
SLIDE 24

Proximity Parameter

Keyword = penicillin. The penicillin successfully treated the condition, but the patient complained of severe side effects afterwards.

slide-25
SLIDE 25

Proximity Parameter

Keyword = penicillin. Proximity = 1: The penicillin successfully [+3] treated the condition, but the patient complained of severe side effects afterwards. Sentiment Score = [+3] + 0 = 3

slide-26
SLIDE 26

Proximity Parameter

Keyword = penicillin. Proximity = 2: The penicillin successfully [+3] treated the condition, but the patient complained of severe side effects afterwards. Sentiment Score = [+3] + 0 = 3

slide-27
SLIDE 27

Proximity Parameter

Keyword = penicillin. Proximity = 3: The penicillin successfully [+3] treated the condition, but the patient complained of severe side effects afterwards. Sentiment Score = [+3] + 0 = 3

slide-28
SLIDE 28

Proximity Parameter

Keyword = penicillin. Proximity = 13: The penicillin successfully [+3] treated the condition, but the patient complained of severe [-2] side effects afterwards. Sentiment Score = [+3] + [-2] = 1

slide-29
SLIDE 29

Alzheimer’s: Proximity/Reward Heatmap

slide-30
SLIDE 30

Future Work

◮ Optimize SentiStrength for biomedical texts ◮ Modify reinforcement learning algorithm ◮ Incorporate data from multiple databases ◮ Incorporate data from medical records ◮ Compare to other systems

slide-31
SLIDE 31

Thank You:

◮ Professor Jumadinova ◮ Oliver Bonham-Carter ◮ Dr. Michael Thelwall ◮ Dr. Barbara Lotze Research Fellowship Fund