Term Extraction and Course Supervisor : Jin Guo Clustering - - PowerPoint PPT Presentation

term extraction and
SMART_READER_LITE
LIVE PREVIEW

Term Extraction and Course Supervisor : Jin Guo Clustering - - PowerPoint PPT Presentation

COMP 762 Term Extraction and Course Supervisor : Jin Guo Clustering Presented by: Shruti Bhanderi Overview Text Chunking STS subcontractors shall track the progress of development activities in the progress report. Processing of Extracted


slide-1
SLIDE 1

COMP 762

Term Extraction and Clustering

Course Supervisor : Jin Guo Presented by: Shruti Bhanderi

slide-2
SLIDE 2

Overview

slide-3
SLIDE 3

Text Chunking

STS subcontractors shall track the progress of development activities in the progress report.

slide-4
SLIDE 4

Processing of Extracted Terms

STS subcontractors the progress development activities the progress report

Remove determiners, cardinal numbers and possessive pronouns.

STS subcontractors progress development activities progress report

Lemmatization

STS subcontractor progress development activity progress report

slide-5
SLIDE 5

Compute Similarities between Terms

  • 1. Syntactic Similarity Measures
  • 2. Semantic Similarity Measures
slide-6
SLIDE 6

Syntactic Similarity

Syntactic Measures Distance-based Token-based Corpus-based Levenstein Cosine SoftTFIDF

slide-7
SLIDE 7

Semantic Similarity

❖ Is-a (vertical) and has-a (horizontal) relation chains

(WordNet).

HSO LESK LCH PATH LIN JCN RES WUP

slide-8
SLIDE 8
slide-9
SLIDE 9

Clustering

slide-10
SLIDE 10

Clustering algorithms

❖ K means. ❖ Hierarchical Clustering ❖ Expectation Maximization

slide-11
SLIDE 11

K means

slide-12
SLIDE 12

Hierarchical Clustering

Agglomerative Divisive

Bottom up Top down

slide-13
SLIDE 13

Expectation Maximization

slide-14
SLIDE 14

Thank you!