SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP - - PowerPoint PPT Presentation

si425 nlp
SMART_READER_LITE
LIVE PREVIEW

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP - - PowerPoint PPT Presentation

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly Most top-tier universities now have NLP faculty (Stanford, Cornell, Berkeley, MIT, UPenn, CMU, Hopkins, etc) Commercial NLP hiring: Google,


slide-1
SLIDE 1

SI425 : NLP

Missing Topics and the Future

slide-2
SLIDE 2

Who cares about NLP?

  • NLP has expanded quickly
  • Most top-tier universities now have NLP faculty (Stanford,

Cornell, Berkeley, MIT, UPenn, CMU, Hopkins, etc)

  • Commercial NLP hiring: Google, Amazon, Microsoft,

IBM, LinkedIn

  • Web startups in Silicon Valley are eating up NLP

students

  • Navy, DoD, NSA, NIH: all funding NLP research

2

slide-3
SLIDE 3

What NLP topics did we miss?

  • Speech Recognition

3

slide-4
SLIDE 4

What NLP topics did we miss?

  • Speech Recognition

4

slide-5
SLIDE 5

What NLP topics did we miss?

  • Machine Translation

5

slide-6
SLIDE 6

What NLP topics did we miss?

  • Machine Translation
  • IBM Models (1 through 5)
  • Neural Network Translation

6

slide-7
SLIDE 7

Machine Translation

7

slide-8
SLIDE 8

Learning Translations

  • Huge corpus of “aligned sentences”.
  • Europarl
  • Corpus of European Parliamant proceedings
  • The EU is mandated to translate into all 21 official languages
  • 21 languages, (semi-) aligned to each other
slide-9
SLIDE 9

Machine Translation Technology

  • Hand-held devices for military
  • Speak english -> recognition -> translation -> generate Urdu
  • Translate web documents
  • Education technology?
  • Doesn’t yet receive much of a focus
slide-10
SLIDE 10

Information Extraction

slide-11
SLIDE 11

What NLP topics did we miss?

  • Dialogue Systems

11

Do you think Anakin likes me?

I don’t care.

slide-12
SLIDE 12

Dialogue Systems

  • Dialogue Systems
  • Why? Heavy interest in human-robot communication.
  • UAVs require teams of 5+ people for each operating

machine

  • Goal: reduce the number of people
  • Give computer high-level dialogue commands, rather than low-level

system commands

12

slide-13
SLIDE 13

Dialogue Systems

  • Dialogue Systems
  • Dialogue is a fascinating topic. Not only do we need to

understand language, but now discourse cues:

  • Questions require replies
  • Imperatives/Commands
  • Acknowledgments: “ok”
  • Back-channels: “uh huh”, “mm hmm”
  • 13
slide-14
SLIDE 14

Dialogue Systems

  • BERT-like models
  • Input:
  • [CLS] how are you ? [SEP] great thanks [END]
  • [CLS] hello [SEP] hi what’s up [END]

14

slide-15
SLIDE 15

El Fin

  • Secret 1:

15

slide-16
SLIDE 16

El Fin

  • Secret 1:

I intentionally made some of our labs ambiguous

16

slide-17
SLIDE 17

El Fin

  • Secret 1:

I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results

17

slide-18
SLIDE 18

El Fin

  • Secret 1:

I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results

  • Secret 2:

18

slide-19
SLIDE 19

El Fin

  • Secret 1:

I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results

  • Secret 2:

I tried to teach you skills that have nothing to do with NLP

19

slide-20
SLIDE 20

El Fin

  • Secret 1:

I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results

  • Secret 2:

I tried to teach you skills that have nothing to do with NLP Experimentation Error Analysis

20

slide-21
SLIDE 21

El Fin

  • Secret 1:

I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results

  • Secret 2:

I tried to teach you skills that have nothing to do with NLP Experimentation Error Analysis

  • Secret 3:

21

slide-22
SLIDE 22

El Fin

  • Secret 1:

I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results

  • Secret 2:

I tried to teach you skills that have nothing to do with NLP Experimentation Error Analysis

  • Secret 3:

I appreciate the hard work you put into the class

22

slide-23
SLIDE 23

23

slide-24
SLIDE 24

What NLP topics did we miss?

Unsupervised Learning

24

slide-25
SLIDE 25

What NLP topics did we miss?

Unsupervised Learning

  • Most of this semester used data

that had human labels.

  • Bootstrapping was our main counter-

example: it is mostly unsupervised.

  • Many many algorithms being

researched to learn language and knowledge without humans, only using text.

25