Effective transfer learning for clinical applications Benjamin van - PowerPoint PPT Presentation

Effective transfer learning for clinical applications Benjamin van der Burgh (LIACS)

OVERVIEW 1. Transfer learning in NLP 2. Experiments on Dutch data 3. Well-being tracking using clinical journals 2

PROJECT BACKGROUND ▰ Physiotherapists keep journals ▰ Can we quantify well-being from text? ▰ Not a conventional task, no labeled data ▰ What can we do about it? 3

1 TRANSFER LEARNING Learning with a head start 4

TRANSFER LEARNING ▰ Deep neural networks ▰ First train model for different but similar task ▰ Learns reusable representation / features ▰ Replace last layer(s) to adjust to target ▰ Continue training the model on target dataset 5

Source: http://ruder.io/nlp-imagenet/ 6

TRANSFER LEARNING IN NLP ▰ Generic task in NLP: language modelling ▰ Example: “I’m not half the man I …” ▰ Dataset source: Wikipedia, CommonCrawl, etc. ▰ Suitable architecture ▻ RNN-based: ULMFiT (AWD-LSTM) ▻ Self-attention models: Transformer, BERT 7

FINE-TUNING LANGUAGE MODEL ▰ Adjust model to idiosyncrasies of target ▰ Example: “Patient has pain in the …“ ▰ Use language model as encoder for target 9

THREE-STAGE PROCESS Generic Fine-tuned Target LM LM Task 10

2 EXPERIMENTS Transfer learning on Dutch data 11

EXPERIMENTS WITH ULMFIT ▰ Language model trained on Dutch Wikipedia ▰ Dataset of 110k Dutch book reviews [1] ▻ {1, 2} → negative ▻ {4, 5} → positive ▻ {3} → neutral ▰ 18836 training examples, 50% pos / 50% neg [1] 110k Dutch Book Reviews Dataset for Sentiment Analysis 12 https://benjaminvdb.github.io/110kDBRD

EXPERIMENTAL RESULTS ▰ Training language model took days ▰ Fine-tuning encoder took an hour ▰ Training classifier took minutes ▰ Accuracy 94% ▰ Off-the-shelf software and hardware 13

ADVANTAGES 1. Improved data efficiency 2. Models can be shared 3. Or even collaboratively trained → Federated Learning [1] [1] Federated Learning: Collaborative Machine Learning without Centralized Training Data 15 https://ai.googleblog.com/2017/04/federated-learning-collaborative.html

3 WELL-BEING TRACKING Learning from subjective data 16

WELL-BEING TRACKING ▰ Well-being tracking using journal text (SOAP) ▰ Multivariate regression: positive and negative ▰ No labeled data available 17

LABEL DATA Experts quantify the contents of a journal entry on a positive and negative axis. 18

4 TAKEAWAYS … no, not that kind of takeaway 20

SUMMARY ▰ Transfer learning in NLP possible ▰ State-of-the-art while easy-to-use ▰ Unlock knowledge in subjective data ▰ Models can be shared 21

RELATED WORK ▰ Bert-as-a-service [1] ▰ Self-supervised learning for image data [2] ▰ Sentiment analysis using text in psychiatry [3] [1] bert-as-a-service: https://github.com/hanxiao/bert-as-service [2] Selfie: Self-supervised Pretraining for Image Embedding: https://arxiv.org/abs/1906.02940 [3] Distinguishing Clinical Sentiment: The Importance of Domain Adaptation in Psychiatric 22 Patient Health Records: https://arxiv.org/abs/1904.03225

FURTHER RESEARCH ▰ Can privacy be preserved when models are shared? ▰ How can we make machine learning more accessible? ▰ What can be learned from subjective data? ▰ How to explain ‘deep results’? 23

SHARE MODELS Help patients while preserving privacy You can download mine from: https://github.com/benjaminvdb/110kDBRD 24

THANKS! Any questions? You can find me at @BenjaminBurgh & b.van.der.burgh@liacs.leidenuniv.nl 25

Effective transfer learning for clinical applications Benjamin van - PowerPoint PPT Presentation

Effective transfer learning for clinical applications Benjamin van der Burgh (LIACS) OVERVIEW 1. Transfer learning in NLP 2. Experiments on Dutch data 3. Well-being tracking using clinical journals 2 PROJECT BACKGROUND Physiotherapists

Industrial Transfer Learning Introduction to Industrial Transfer Learning Industrial Transfer

Radiative Transfer Radiative Transfer Radiative transfer is a branch of atmospheric physics. We

Heat Transfer Heat Transfer Introduction Practical occurrences, applications, factors

Transfer United: Partnerships to Foster Transfer Student Success Tuesday, November 5 th

Transfer Learning Eu Wern Teh What are we covering? Why transfer learning? Fine

EFFECTIVE EFFECTIVE EFFECTIVE EFFECTIVE COMMUNICATIONS COMMUNICATIONS People First Language

Why e Learning can actually be effective for learning an understanding from psycho

Technology Transfer or Knowledge Transfer? Russ Somma, Ph.D. SommaTech,LLC Affiliate of IPS

Technology Transfer and Commercialisation 1 05/06/2015 1 Tech Transfer and Commercialisation

Transfer! The VIEWS of Practitioners The RESULTS from ROI Dr Paul Donovan NUIM TRANSFER THAT

Transfer Transfer Transitions: Transitions: First Semester First Semester Persistence and

Remit #2 Elimination of Transfer and Settlement What does transfer mean? Transfer

Transfer of transfert Transfer principles Thomas Hales and Julia Gordon December 2015 The

Regional STEMI Transfer Systems: Regional STEMI Transfer Systems: Regional STEMI Transfer

TRANSFER: MYTHS & FACTS ANNE HABERKERN, DIVISION DIRECTOR TRANSFER & CURRICULAR

What Is Multicast? Key: Unicast transfer Broadcast transfer Unicast Multicast transfer

A Bayesian test of the lineage-specificity of word-order correlations Gerhard Jger Tbingen

Status of DUNE DAQ Hardware/Firmware Development Status David Cussans DUNE DAQ Meeting 15 th

Stability Analysis For Unsupervised Learning Dr. Derek Greene Insight @ UCD April 2014

Cluster Validity Hypothesis Random Graph Hypothesis Random Label Hypothesis Relative Criteria

15-780: Graduate AI Lecture 19. Learning Geoff Gordon (this lecture) Tuomas Sandholm TAs Sam

Word order variation in Dutch and German verb clusters Liesbeth Augustinus HeadLex16 - 29 July,

Logic for Computer Science 01 Intro Wouter Swierstra University of Utrecht 1 Today

MARINUS A. KAASHOEK: half a century of operator theory in Amsterdam Opening Lecture IWOTA 2017

Effective transfer learning for clinical applications Benjamin van - PowerPoint PPT Presentation

Effective transfer learning for clinical applications Benjamin van der Burgh (LIACS) OVERVIEW 1. Transfer learning in NLP 2. Experiments on Dutch data 3. Well-being tracking using clinical journals 2 PROJECT BACKGROUND Physiotherapists

Industrial Transfer Learning Introduction to Industrial Transfer Learning Industrial Transfer

Radiative Transfer Radiative Transfer Radiative transfer is a branch of atmospheric physics. We

Heat Transfer Heat Transfer Introduction Practical occurrences, applications, factors

Transfer United: Partnerships to Foster Transfer Student Success Tuesday, November 5 th

Transfer Learning Eu Wern Teh What are we covering? Why transfer learning? Fine

EFFECTIVE EFFECTIVE EFFECTIVE EFFECTIVE COMMUNICATIONS COMMUNICATIONS People First Language

Why e Learning can actually be effective for learning an understanding from psycho

Technology Transfer or Knowledge Transfer? Russ Somma, Ph.D. SommaTech,LLC Affiliate of IPS

Technology Transfer and Commercialisation 1 05/06/2015 1 Tech Transfer and Commercialisation

Transfer! The VIEWS of Practitioners The RESULTS from ROI Dr Paul Donovan NUIM TRANSFER THAT

Transfer Transfer Transitions: Transitions: First Semester First Semester Persistence and

Remit #2 Elimination of Transfer and Settlement What does transfer mean? Transfer

Transfer of transfert Transfer principles Thomas Hales and Julia Gordon December 2015 The

Regional STEMI Transfer Systems: Regional STEMI Transfer Systems: Regional STEMI Transfer

TRANSFER: MYTHS &amp; FACTS ANNE HABERKERN, DIVISION DIRECTOR TRANSFER &amp; CURRICULAR

What Is Multicast? Key: Unicast transfer Broadcast transfer Unicast Multicast transfer

A Bayesian test of the lineage-specificity of word-order correlations Gerhard Jger Tbingen

Status of DUNE DAQ Hardware/Firmware Development Status David Cussans DUNE DAQ Meeting 15 th

Stability Analysis For Unsupervised Learning Dr. Derek Greene Insight @ UCD April 2014

Cluster Validity Hypothesis Random Graph Hypothesis Random Label Hypothesis Relative Criteria

15-780: Graduate AI Lecture 19. Learning Geoff Gordon (this lecture) Tuomas Sandholm TAs Sam

Word order variation in Dutch and German verb clusters Liesbeth Augustinus HeadLex16 - 29 July,

Logic for Computer Science 01 Intro Wouter Swierstra University of Utrecht 1 Today

MARINUS A. KAASHOEK: half a century of operator theory in Amsterdam Opening Lecture IWOTA 2017

TRANSFER: MYTHS & FACTS ANNE HABERKERN, DIVISION DIRECTOR TRANSFER & CURRICULAR