Clinical NLP, PubGene
Clinical trials in Coremine Oncology Text processing and information extraction for surgery planning form
November 2017 Dag Are Steenhoff Hov, PubGene AS
1
Clinical NLP, PubGene Clinical trials in Coremine Oncology Text - - PowerPoint PPT Presentation
Clinical NLP, PubGene Clinical trials in Coremine Oncology Text processing and information extraction for surgery planning form November 2017 Dag Are Steenhoff Hov, PubGene AS 1 PubGene, founded 2001 ArrayIt H25K microarray Scientific
November 2017 Dag Are Steenhoff Hov, PubGene AS
1
2
ArrayIt H25K microarray Scientific Literature Coremine Networks
COREMINE Oncology COREMINE Medical COREMINE Platform
Integration of structured and unstructured information
3
Courtesy of DNV-GL (Tore Hartvigsen)
4
quantity
pathways, clinical trials, etc.
considerations have been taken care of
5
– (Somatic) mutations – Copy number changes – gene expression
– Mutation – Gene/Protein – Protein Domains
– Statistics on mutations – Related drugs for targets with change (in progress: also biomarker and sensitivity info) – Pathways for targets with change – Relevant clinical trials for aberrations
6
7
10
11
to relevant clinical trials
mentioned (referred to) in clinical trials
clinicaltrials.gov
and methods for detecting these in trial descriptions
biomarkers related to eligibility, but this is not straightforward
inclusion/exclusion criteria, e.g., negation
and condition for biomarkers
12
– Single-Nucleotide mutations (Cosmic) – Polymorphisms – Fusion genes – Gene regulation (Exp-up/down) – Copy number changes
text:
– Detect explicit mentions – Detect patterns based on gene name and ‘marker’ type, e.g., “GENE amplification” “GENE activating mutation”
10
biomarker
13
14
15
16
Domain knowledge Manual curation Filter
17
BRAF G469A BRAF D594G BRAF V600E EGFR T790M KIF5B/RET CD74/ROS1 KIF5B/ALK BCR/ABL1
18
matching trial to patient
alterations with same/similar effect, e.g., amplification/up-regulation with activating mutation Example: Patient ERBB2 Exp up Trial:
19
20
Courtesy of DNV-GL (Tore Hartvigsen)
Human touch and empathy – with professional skill
Akershus University Hospital (Ahus) Optique project.
Courtesy of DNV-GL (Tore Hartvigsen)
22
Surgery Planning Form (“The Green Form”)
Stage 1: Examination Stage 3: Check/ QA Stage 2: Preparations Structured data Text
DIPS Ahus
Metavision O Metavision I Metavision DKS System System
Metavision Ahus Additional systems To complete the form, data must be collected from a number of systems! This is today done manually.
Courtesy of DNV-GL (Tore Hartvigsen)
23
Metavision O DIPS Metav Ahus production databases Ahus research Databases. Metavision I Metavision DKS
DIPS (EPJ) (EPJ)
DIPS
(EPJ)
(EPJ)
Researchers/ Analysts
Data warehousing is an option
A semantic IT solution and
Health Care Expert users «Ordinary» users
Courtesy of DNV-GL (Tore Hartvigsen)
24 Text mining Solutions provided by the Optique project A semantic IT solution and
Health Care
Structured data Unstructured data (text) Expert users
«Ordinary» users
Courtesy of DNV-GL (Tore Hartvigsen)
25
Fields
26
27
Sentence Status Røyker. Yes Røyker 15-20 om dagen. Yes Ifølge datter er han også storrøyker, 40/ dag siste 50 år. Yes Røykeplaster? Uncertain Tidligere storrøyker. Stopped Ikke røyker og drikker ikke alkohol, tidligere, måteholdent alkoholbruk. No Eks-røyker, lite alkohol. Stopped Text analysis
28
Courtesy of DNV-GL (Tore Hartvigsen)
Courtesy of DNV-GL (Tore Hartvigsen)
Courtesy of DNV-GL (Tore Hartvigsen)
Page for surgery planning form
Courtesy of DNV-GL (Tore Hartvigsen)
BMI
Courtesy of DNV-GL (Tore Hartvigsen)
Courtesy of DNV-GL (Tore Hartvigsen)
Courtesy of DNV-GL (Tore Hartvigsen)
Courtesy of DNV-GL (Tore Hartvigsen)
Courtesy of DNV-GL (Tore Hartvigsen)
Courtesy of DNV-GL (Tore Hartvigsen)
Allergy
Courtesy of DNV-GL (Tore Hartvigsen)
Smoking
Courtesy of DNV-GL (Tore Hartvigsen)
Courtesy of DNV-GL (Tore Hartvigsen)
Surgery planning form
42