nlp research group mit
play

NLP Research Group: MIT Wuwei Lan, Wei Sun NLP@MIT Introduction - PowerPoint PPT Presentation

NLP Research Group: MIT Wuwei Lan, Wei Sun NLP@MIT Introduction group @ CSAIL 2 Professors + 9 Ph.D. + 4 Masters + other undergraduates Faculty Regina Barzilay and Tommi S. Jaakkola Research Focus very broad :


  1. NLP Research Group: MIT Wuwei Lan, Wei Sun

  2. NLP@MIT • Introduction • group @ CSAIL • 2 Professors + 9 Ph.D. + 4 Masters + other undergraduates • Faculty • Regina Barzilay and Tommi S. Jaakkola • Research Focus • very broad : Information retrieval, deep reinforcement learning, recommender systems, Computational biology, Semantic representation and so on. • Productivity • •6~7 top conference papers / year

  3. Regina Barzilay • Reliable Information Extraction • Reinforcement learning by acquiring external evidence (EMNLP 2016) • Interpretable Neural Models • Rationalizing Neural Predictions (EMNLP 2016)

  4. Tommi S. Jaakkola Biography 1992, M.S in theoretical physics from Helsinki University of Technology 1997, PhD in computational neuroscience from MIT 1998-now Professor at MIT

  5. Research Synopsis • On the theoretical side statistical inference and estimation • On the applied side NLP , computational biology, recommender, information retrieval

  6. On-going projects and papers 1. Perturbation models Structured prediction: From gaussian perturbations to linear-time principled algorithms. In Uncertainty in Artificial Intelligence (UIA), 2016 1. Syntactic and semantic parsing word embeddings as metric recovery in semantic spaces. TACL 2016 1. Recommender systems Controlling privacy in recommender systems. NIPS 2014 1. computational biology Learning population-level diffusions with generative {RNN}s. ICML 2016 1. information retrieval/extraction Food adulteration detection using neural networks. EMLP, 2016

  7. What’s interesting? Topic Modeling in Twitter: aggregating tweets by conversations ICWSM 2016 1. Background: Topic Modeling Techniques: Latent Dirichlet Allocation(LDA) and Author-Topic Model (ATM) -> For sufficient long documents with regular vocabulary and grammatical structure 2. what’s about the tweets? (short document and noisy data) -> preprocessing tweets for ungrammatical structure and informal language -> pooling techniques to aggregate tweets into long documents: User-pooling, Hashtag-pooling and conversation-pooling 3. Can we build a model solve the topic modeling problem in twitter directly?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend