CMSC 473/673 Natural Language Processing Fall 2019 Instructor: - - PowerPoint PPT Presentation

cmsc 473 673 natural language processing fall 2019
SMART_READER_LITE
LIVE PREVIEW

CMSC 473/673 Natural Language Processing Fall 2019 Instructor: - - PowerPoint PPT Presentation

CMSC 473/673 Natural Language Processing Fall 2019 Instructor: Frank Ferraro Natural language processing ITE 358 ferraro@umbc.edu Semantics Monday: 2:15-3 Tuesday: 11:00-11:30 Vision & language processing by appointment Learning with


slide-1
SLIDE 1

CMSC 473/673 Natural Language Processing Fall 2019

slide-2
SLIDE 2

Instructor: Frank Ferraro

ITE 358 ferraro@umbc.edu Monday: 2:15-3 Tuesday: 11:00-11:30 by appointment Natural language processing Semantics Vision & language processing Learning with low-to-no supervision

slide-3
SLIDE 3

TA: Devajit Asem

Location TBD devajit.asem@umbc.edu Wednesday: 4-5pm Friday: 2-3pm by appointment Databases NLP IR (information retrieval) Web development

slide-4
SLIDE 4

Q: What is NLP (natural language processing?)

slide-5
SLIDE 5

Natural Language Processing tensorflow

slide-6
SLIDE 6

August 2018

slide-7
SLIDE 7

Potential Applications

ASR (automatic speech recognition) Machine translation Natural language generation Document labeling/classification Document summarization Corpus exploration Relation/information extraction Entity identification

slide-8
SLIDE 8

Potential Applications

ASR (automatic speech recognition) Machine translation Natural language generation Document labeling/classification Document summarization Corpus exploration Relation/information extraction Entity identification

Q: What’s an example?

slide-9
SLIDE 9

Automatic speech recognition

slide-10
SLIDE 10

Potential Applications

ASR (automatic speech recognition) Machine translation Natural language generation Document labeling/classification Document summarization Corpus exploration Relation/information extraction Entity identification

Q: What’s an example?

slide-11
SLIDE 11

SPORTS

Document classification

slide-12
SLIDE 12

Machine translation Document classification

slide-13
SLIDE 13

Potential Applications

ASR (automatic speech recognition) Machine translation Natural language generation Document labeling/classification Document summarization Corpus exploration Relation/information extraction Entity identification

Q: What’s an example?

slide-14
SLIDE 14

https://cdn.arstechnica.net/wp-content/uploads/2015/11/Screen-Shot-2015-11-02-at-9.11.40-PM-640x543.png

Natural language generation

slide-15
SLIDE 15

Document summarization

slide-16
SLIDE 16

Course Goals

Be introduced to some of the core problems and solutions of NLP (big picture)

slide-17
SLIDE 17

Course Goals

Be introduced to some of the core problems and solutions of NLP (big picture) Learn different ways that success and progress can be measured in NLP

slide-18
SLIDE 18

Course Goals

Be introduced to some of the core problems and solutions of NLP (big picture) Learn different ways that success and progress can be measured in NLP Relate to statistics, machine learning, and linguistics Implement NLP programs

slide-19
SLIDE 19

Course Goals

Be introduced to some of the core problems and solutions of NLP (big picture) Learn different ways that success and progress can be measured in NLP Relate to statistics, machine learning, and linguistics Implement NLP programs Read and analyze research papers Practice your (written) communication skills

slide-20
SLIDE 20

Administrivia

slide-21
SLIDE 21

Web Presence

https://piazza.com/umbc/fall2019/cmsc473673 https://www.csee.umbc.edu/courses/undergraduate/473/f19

www

Schedule, slides, assignments, readings, materials, syllabus here Course announcements, Q&A, discussion board here

slide-22
SLIDE 22

Please Read the Syllabus (On the Website)

https://www.csee.umbc.edu/courses/undergraduate/473/f19/content/materials/syllabus.pdf

slide-23
SLIDE 23

Grading

Component 473 673 Assignments 45% 30% Midterm 10% 10% Graduate Paper

  • 30%

Course Project 45% 30%

slide-24
SLIDE 24

Computation of Component Grades

Each component (e.g., “Assignment” component) is: max(micro-average, macro-average)

slide-25
SLIDE 25

Computation of Component Grades

Each component (e.g., “Assignment” component) is: max(micro-average, macro-average)

65/90 95/100 95/110 100/110

Assignment grades (not representative)

slide-26
SLIDE 26

Computation of Component Grades

Each component (e.g., “Assignment” component) is: max(micro-average, macro-average)

65/90 95/100 95/110 100/110

Assignment grades (not representative) microaverage = 65 + 95 + 95 + 100 90 + 100 + 110 + 110 ≈ 86.59% macroaverage = 1 4 65 90 + 95 100 + 95 110 + 100 110 ≈ 86.12% We’ll learn what these are in the semester

slide-27
SLIDE 27

Computation of Component Grades

Each component (e.g., “Assignment” component) is: max(micro-average, macro-average)

65/90 95/100 95/110 100/110

microaverage = 65 + 95 + 95 + 100 90 + 100 + 110 + 110 ≈ 86.59% macroaverage = 1 4 65 90 + 95 100 + 95 110 + 100 110 ≈ 86.12%

slide-28
SLIDE 28

Final Grades

≥ Letter 90 A 80 B 70 C 65 D F ≥ Letter 90 A- 80 B- 70 C- 65 D F

473 673

slide-29
SLIDE 29

Running the Assignments

A "standard" x86-64 Linux machine, like gl A passable amount of memory (2GB-4GB) Modern but not necessarily cutting edge software Don’t assume a GPU (if you want to write CUDA yourself, talk to me)

If in doubt, ask first

slide-30
SLIDE 30

Running the Project

An x86-64 Linux machine Memory and hardware constraints lifted (somewhat)

If in doubt, ask first

slide-31
SLIDE 31

Programming Languages for Assignments

Use the tools you feel comfortable with Python+numpy, C, C++, Java, Matlab, …: OK (straight Python may not cut it) Libraries: Generally OK, as long as you don’t use their implementation of what you need to implement Math accelerators (blas, numpy, etc.): OK

If in doubt, ask first

slide-32
SLIDE 32

Programming Languages for the Project

Use the tools you feel comfortable with Python+numpy, C, C++, Java, Matlab, …: OK (straight Python may not cut it) Libraries: Use what you want Math accelerators (blas, numpy, etc.): OK

slide-33
SLIDE 33

Late Policy

Everyone has a budget of 10 late days

slide-34
SLIDE 34

Late Policy

Everyone has a budget of 10 late days If you have them left: assignments turned in after the deadline will be graded and recorded, no questions asked

slide-35
SLIDE 35

Late Policy

Everyone has a budget of 10 late days If you have them left: assignments turned in after the deadline will be graded and recorded, no questions asked If you don’t have any left: still turn assignments

  • in. They could count in your favor in borderline

cases

slide-36
SLIDE 36

Late Policy

Everyone has a budget of 10 late days Use them as needed throughout the course They’re meant for personal reasons and emergencies Do not procrastinate

slide-37
SLIDE 37

Late Policy

Everyone has a budget of 10 late days Contact me privately if an extended absence will occur

You must know how many you’ve used