Predicting MeSH Beyond MEDLINE Adam Kehoe Vetle Torvik School of - PowerPoint PPT Presentation

Predicting MeSH Beyond MEDLINE Adam Kehoe Vetle Torvik School of Information Sciences School of Information Sciences University of Illinois at University of Illinois at Urbana-Champaign Urbana-Champaign Neil R. Smalheiser Matthew Ross Department of Psychiatry Department of Economics University of Illinois at Chicago Ohio State University

Medical Subject Headings Controlled vocabulary created ● MeSH Heading Neuroimaging by NLM for indexing biomedical documents Tree Number E01.370.350.578 ● MeSH is hierarchical Tree Number E01.370.376.537 Divided into 16 top level ● categories (anatomy, Tree Number E05.629 organisms, diseases, etc) Scope Note Non-invasive methods ● A MeSH term can appear in of visualizing the CENTRAL NERVOUS more than one place in the SYSTEM, especially the MeSH hierarchy brain, by various ● About 27,000 terms, 10-12 imaging modalities. terms per paper

‘Neuroimaging’ in the MeSH Hierarchy

Problem Definition + Motivation Medical subject headings (MeSH) are useful but aren’t available everywhere. ● Assigning terms manually is labor intensive; estimated cost of annotating one ● article is ~7.50 GBP (8.70 EUR / 9.40 USD)¹ There are many existing MeSH classification systems (MTI, DeepMeSH, ● MeSHLabeler), but all are optimized for MEDLINE. Our work focuses on building a generalized MeSH classifier that can work with ● many different kinds of documents (patents, grants, etc). Mork, J. et al. (2013) The NLM medical text indexer system for indexing biomedical literature. In: BioASQ@CLEF

MeSH Prediction Challenges ● Multilabel classification problem (each MeSH heading is a class label) The number of headings varies. ● ● MeSH headings have a highly biased distribution. Some terms are extremely common, others very rarely used. Example: ‘Humans’ has about ~13 million occurrences, ‘Portion Size’ ~ 200 occurrences ● The priors of MeSH headings likely to vary across domains. Example: ‘Inventions’ highly common in the patent literature. ● Vocabulary and semantics vary across domains, complicating an NLP approach.

Methodology: Sources of Evidence References “References of Documents by text References of References” similarity of abstract similar documents ● Our method draw on two primary sources of information for any given document: The set of references to MEDLINE The 15 most similar record abstracts within MEDLINE ● We extract, weight and rank all of the MeSH terms in each set ● Experimental tool weights calculates a simple additive score ● Recent work trained weights empirically on MEDLINE records using logistic regression

Methodology: Tools Absim: returns the most similar MEDLINE records by BM25 text similarity to abstracts from an input text: http://abel.lis.illinois.edu/cgi-bin/absim/search.py Patci: a tool for matching patent citations to MEDLINE records. Can look up US patents by ID, or by entering citation string: http://abel.lis.illinois.edu/cgi-bin/patci/search.pl

Methodology: Preliminary Weighting Function

Evaluation 1. Quantitative assessment using MEDLINE records 2. Case study of MEDLINE papers 3. Evaluation of 21 NIH grants 4. Case study of three patents 5. Comparison of MeSHier with MTI ‘MeSH on Demand’

Evaluation: MEDLINE Data: Tested on 1600 papers, selecting 100 papers for every year from 2000 to 2015. For each year, we selected all papers that had an abstract, MeSH terms, and at least one citation. Of these, we randomly selected 100. Methods: We trained three logistic regression classifiers w/ 10-fold cross validation: 1. Using only direct citations and their references 2. Using only similar abstract records and their references 3. Using both together

Evaluation: Model Performance Model Precision Recall F1 Score Citation Only 0.41 0.47 0.44 Absim Only 0.39 0.45 0.42 Combined 0.43 0.50 0.46 Predicted terms that were not direct matches were often conceptually similar to assigned term, or otherwise relevant to the paper.

PMID Title Predicted MeSH Actual MeSH 23894639 Has large-scale Authorship; Patents as Algorithms; Names; named-entity network Topic; Bibliometrics; Publications analysis been resting on Publishing; Models, a flawed assumption? Theoretical; MEDLINE; Algorithms; Names; Cooperative Behavior; Research; Periodicals as Topic; Neural Networks (Computer) ; Computer Simulation; Research Personnel; Nerve Net MEDLINE Case Study: What is a true error?

Findings + Conclusions ● Evaluating accuracy can be challenging; sometimes difficult to differentiate between true error and a plausible MeSH assignment. ● System has limitations with terms related to organisms due to imbalanced class distribution; working on a dedicated model for classifying organisms (“Humans” vs animal models). Key question: how to best quantify performance in non-MEDLINE records? ●

Predicting MeSH Beyond MEDLINE Adam Kehoe Vetle Torvik School of - PowerPoint PPT Presentation

Predicting MeSH Beyond MEDLINE Adam Kehoe Vetle Torvik School of Information Sciences School of Information Sciences University of Illinois at University of Illinois at Urbana-Champaign Urbana-Champaign Neil R. Smalheiser Matthew Ross

Medline Physician Office Presented to: Physician Partners Why Medline? Family owned since

Thomas Hhn 4. Juni 2009 TU-Berlin, Berlin Why to How to Worksheets mesh ? mesh ? Outline

Mesh Basics Mesh Basics 1 Spring 2010 Definitions: Definitions: 1/2 Definitions:

Mesh Networks | Hacking The T3lc0 Model http://arig.org.il What's a Mesh Anyway ? Mesh =

A Service Mesh Is Easy To Swallow In Small Pieces Andrew Jenkins Eng Lead, Aspen Mesh

W ir eless Mesh Netw or k W ir eless Mesh Netw or k Technical Overview Technical Overview Danny

What Makes for a Good Mesh? CS101 Meshing Winter 2007 1 Mesh Quality What makes a mesh

Smoothing Gianpaolo Palma Triangle Mesh List of vertices + List of triangle as triple of vertex

Non-Mesh Treatment of SUI Shachar Aharony MD AUA SUI Guidelines 2017 Shachar Aharony MD,

Best Practices Workshop: Parts & Mesh-Based Operations Overview What are Parts and Mesh

Mesh Network Information ArrowSpan Wireless Mesh Network Solutions (China) Sept 2007 1

What is a mesh network? A mesh network is created when many devices have established

13 Mesh Animation Steve Marschner CS5625 Spring 2020 Basic surface deformation methods Blend

CREATIVE FREEDOM FOR COMPUTATIONAL MESH GENERATION IN DEAL.II Nicola Giuliani , Luca Heltai

Mesh Models (Chapter 8) 1. Overview of Mesh and Related models. a. Diameter: The linear

05 Mesh Animation Steve Marschner CS5625 Spring 2019 Basic surface deformation methods Blend

Stage two Natural selection: -directional change relative to environmental context -acts on

MA111: Contemporary mathematics . Jack Schmidt University of Kentucky October 17, 2012 Entrance

Nanotechnology and Standing as a Christian in the Academy James M. Tour Rice University

The Healing Journey (Healing from within) Alastair Cunningham OCI/PMH/UHN OCI/PMH/UHN Healing:

Todays Presenters Kelli Ham Kelli Ham Consumer Health and Technology Coordinator, National

Cochrane Register of Studies and Cochrane Register of Studies and importing references into

Distilling Conceptual Connections from MeSH Co-Occurrences Padmini Srinivasan, Dimitar Hristovski

Heat Treating Critical manufacturing processes for the Medical Device Industry Bruce Dall,

Predicting MeSH Beyond MEDLINE Adam Kehoe Vetle Torvik School of - PowerPoint PPT Presentation

Predicting MeSH Beyond MEDLINE Adam Kehoe Vetle Torvik School of Information Sciences School of Information Sciences University of Illinois at University of Illinois at Urbana-Champaign Urbana-Champaign Neil R. Smalheiser Matthew Ross

Medline Physician Office Presented to: Physician Partners Why Medline? Family owned since

Thomas Hhn 4. Juni 2009 TU-Berlin, Berlin Why to How to Worksheets mesh ? mesh ? Outline

Mesh Basics Mesh Basics 1 Spring 2010 Definitions: Definitions: 1/2 Definitions:

Mesh Networks | Hacking The T3lc0 Model http://arig.org.il What's a Mesh Anyway ? Mesh =

A Service Mesh Is Easy To Swallow In Small Pieces Andrew Jenkins Eng Lead, Aspen Mesh

W ir eless Mesh Netw or k W ir eless Mesh Netw or k Technical Overview Technical Overview Danny

What Makes for a Good Mesh? CS101 Meshing Winter 2007 1 Mesh Quality What makes a mesh

Smoothing Gianpaolo Palma Triangle Mesh List of vertices + List of triangle as triple of vertex

Non-Mesh Treatment of SUI Shachar Aharony MD AUA SUI Guidelines 2017 Shachar Aharony MD,

Best Practices Workshop: Parts &amp; Mesh-Based Operations Overview What are Parts and Mesh

Mesh Network Information ArrowSpan Wireless Mesh Network Solutions (China) Sept 2007 1

What is a mesh network? A mesh network is created when many devices have established

13 Mesh Animation Steve Marschner CS5625 Spring 2020 Basic surface deformation methods Blend

CREATIVE FREEDOM FOR COMPUTATIONAL MESH GENERATION IN DEAL.II Nicola Giuliani , Luca Heltai

Mesh Models (Chapter 8) 1. Overview of Mesh and Related models. a. Diameter: The linear

05 Mesh Animation Steve Marschner CS5625 Spring 2019 Basic surface deformation methods Blend

Stage two Natural selection: -directional change relative to environmental context -acts on

MA111: Contemporary mathematics . Jack Schmidt University of Kentucky October 17, 2012 Entrance

Nanotechnology and Standing as a Christian in the Academy James M. Tour Rice University

The Healing Journey (Healing from within) Alastair Cunningham OCI/PMH/UHN OCI/PMH/UHN Healing:

Todays Presenters Kelli Ham Kelli Ham Consumer Health and Technology Coordinator, National

Cochrane Register of Studies and Cochrane Register of Studies and importing references into

Distilling Conceptual Connections from MeSH Co-Occurrences Padmini Srinivasan, Dimitar Hristovski

Heat Treating Critical manufacturing processes for the Medical Device Industry Bruce Dall,

Best Practices Workshop: Parts & Mesh-Based Operations Overview What are Parts and Mesh