SLIDE 1
Natural Language Processing Basics
Yingyu Liang University of Wisconsin-Madison
Natural Language Processing Basics Yingyu Liang University of - - PowerPoint PPT Presentation
Natural Language Processing Basics Yingyu Liang University of Wisconsin-Madison Natural language Processing (NLP) The processing of the human languages by computers One of the oldest AI tasks One of the most important AI tasks One
Yingyu Liang University of Wisconsin-Madison
Preprocess Zipf’s Law
Bag-of-Words tf-idf
𝑥 =
𝑤
𝑥 = log
𝑥 = 𝑢𝑔 𝑥 ∗ 𝑗𝑒𝑔 𝑥
𝑥1, 𝑢𝑔−𝑗𝑒𝑔 𝑥2, … , 𝑢𝑔−𝑗𝑒𝑔 𝑥𝑛]
Statistical language model N-gram Smoothing
𝑢=1 𝜐
𝑢=𝑜 𝜐
𝑢=𝑜 𝜐
P[𝑏𝑥𝑏𝑧|𝑒𝑝 𝑠𝑏𝑜] does not work, use P 𝑏𝑥𝑏𝑧 𝑠𝑏𝑜 as replacement