effective transfer learning for clinical applications
play

Effective transfer learning for clinical applications Benjamin van - PowerPoint PPT Presentation

Effective transfer learning for clinical applications Benjamin van der Burgh (LIACS) OVERVIEW 1. Transfer learning in NLP 2. Experiments on Dutch data 3. Well-being tracking using clinical journals 2 PROJECT BACKGROUND Physiotherapists


  1. Effective transfer learning for clinical applications Benjamin van der Burgh (LIACS)

  2. OVERVIEW 1. Transfer learning in NLP 2. Experiments on Dutch data 3. Well-being tracking using clinical journals 2

  3. PROJECT BACKGROUND ▰ Physiotherapists keep journals ▰ Can we quantify well-being from text? ▰ Not a conventional task, no labeled data ▰ What can we do about it? 3

  4. 1 TRANSFER LEARNING Learning with a head start 4

  5. TRANSFER LEARNING ▰ Deep neural networks ▰ First train model for different but similar task ▰ Learns reusable representation / features ▰ Replace last layer(s) to adjust to target ▰ Continue training the model on target dataset 5

  6. Source: http://ruder.io/nlp-imagenet/ 6

  7. TRANSFER LEARNING IN NLP ▰ Generic task in NLP: language modelling ▰ Example: “I’m not half the man I …” ▰ Dataset source: Wikipedia, CommonCrawl, etc. ▰ Suitable architecture ▻ RNN-based: ULMFiT (AWD-LSTM) ▻ Self-attention models: Transformer, BERT 7

  8. 8

  9. FINE-TUNING LANGUAGE MODEL ▰ Adjust model to idiosyncrasies of target ▰ Example: “Patient has pain in the …“ ▰ Use language model as encoder for target 9

  10. THREE-STAGE PROCESS Generic Fine-tuned Target LM LM Task 10

  11. 2 EXPERIMENTS Transfer learning on Dutch data 11

  12. EXPERIMENTS WITH ULMFIT ▰ Language model trained on Dutch Wikipedia ▰ Dataset of 110k Dutch book reviews [1] ▻ {1, 2} → negative ▻ {4, 5} → positive ▻ {3} → neutral ▰ 18836 training examples, 50% pos / 50% neg [1] 110k Dutch Book Reviews Dataset for Sentiment Analysis 12 https://benjaminvdb.github.io/110kDBRD

  13. EXPERIMENTAL RESULTS ▰ Training language model took days ▰ Fine-tuning encoder took an hour ▰ Training classifier took minutes ▰ Accuracy 94% ▰ Off-the-shelf software and hardware 13

  14. 14

  15. ADVANTAGES 1. Improved data efficiency 2. Models can be shared 3. Or even collaboratively trained → Federated Learning [1] [1] Federated Learning: Collaborative Machine Learning without Centralized Training Data 15 https://ai.googleblog.com/2017/04/federated-learning-collaborative.html

  16. 3 WELL-BEING TRACKING Learning from subjective data 16

  17. WELL-BEING TRACKING ▰ Well-being tracking using journal text (SOAP) ▰ Multivariate regression: positive and negative ▰ No labeled data available 17

  18. LABEL DATA Experts quantify the contents of a journal entry on a positive and negative axis. 18

  19. 19

  20. 4 TAKEAWAYS … no, not that kind of takeaway 20

  21. SUMMARY ▰ Transfer learning in NLP possible ▰ State-of-the-art while easy-to-use ▰ Unlock knowledge in subjective data ▰ Models can be shared 21

  22. RELATED WORK ▰ Bert-as-a-service [1] ▰ Self-supervised learning for image data [2] ▰ Sentiment analysis using text in psychiatry [3] [1] bert-as-a-service: https://github.com/hanxiao/bert-as-service [2] Selfie: Self-supervised Pretraining for Image Embedding: https://arxiv.org/abs/1906.02940 [3] Distinguishing Clinical Sentiment: The Importance of Domain Adaptation in Psychiatric 22 Patient Health Records: https://arxiv.org/abs/1904.03225

  23. FURTHER RESEARCH ▰ Can privacy be preserved when models are shared? ▰ How can we make machine learning more accessible? ▰ What can be learned from subjective data? ▰ How to explain ‘deep results’? 23

  24. SHARE MODELS Help patients while preserving privacy You can download mine from: https://github.com/benjaminvdb/110kDBRD 24

  25. THANKS! Any questions? You can find me at @BenjaminBurgh & b.van.der.burgh@liacs.leidenuniv.nl 25

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend