INF4820: Algorithms for AI and NLP Classification
Milen Kouylekov & Stephan Oepen
Language Technology Group University of Oslo
- Sep. 18, 2014
INF4820: Algorithms for AI and NLP Classification Milen Kouylekov - - PowerPoint PPT Presentation
INF4820: Algorithms for AI and NLP Classification Milen Kouylekov & Stephan Oepen Language Technology Group University of Oslo Sep. 18, 2014 Today Vector spaces Quick recap Vector space models for Information Retrieval (IR)
◮ Quick recap ◮ Vector space models for Information Retrieval (IR)
◮ Representing classes and membership ◮ Rocchio classifiers ◮ kNN classifiers 2
3
4
◮ Represent the search query as a vector, just like for the documents. ◮ The relevance of documents relative to the query can be ranked
5
◮ Recognize Entities ◮ Assign them a class (ex. Person Location and Organization)
6
◮ Classify Sentences into classes Positive, Negative Neutral
7
◮ Classify pair of sentences A and B into 2 classes: YES (A implies B) and
8
9
10
11
12
13
14
15
16
17
◮ The decision boundary defined by the Voronoi tessellation. 18
19
20
21
22
◮ Both are instances of finding nearest neighbors.
23
◮ Many ways to do this. . .
24
25
26
◮ The ratio of correct predictions. ◮ Not suitable for unbalanced numbers of positive / negative examples.
◮ The number of detected class members that were correct.
◮ The number of actual class members that were detected. ◮ Trade-off: Positive predictions for all examples would give 100% recall
◮ Balanced measure of precision and recall (harmonic mean). 27
28
29