< > - +
Intersection Graphs for Text Analysis
Elizabeth Leeds leedsem@nswc.navy.mil David Marchette marchettedj@nswc.navy.mil Naval Surface Warfare Center Code B10
Interface 2004 – p.1/16
Intersection Graphs for Text Analysis Elizabeth Leeds David - - PowerPoint PPT Presentation
Intersection Graphs for Text Analysis Elizabeth Leeds David Marchette leedsem@nswc.navy.mil marchettedj@nswc.navy.mil Naval Surface Warfare Center Code B10 < > - + Interface 2004 p.1/16 Overview bag-of-words approach to document
< > - +
Interface 2004 – p.1/16
< > - +
Interface 2004 – p.2/16
< > - +
Interface 2004 – p.3/16
< > - +
Interface 2004 – p.4/16
< > - +
Interface 2004 – p.5/16
< > - +
Interface 2004 – p.6/16
< > - +
Interface 2004 – p.7/16
< > - +
Graph Size = 500 Mutual Information Threshold = 1 141 edges between classes
ANTHRO ASTRO BEHAVIOR EARTH LIFE MATH&COMP MEDICINE PHYSICS
Interface 2004 – p.8/16
< > - +
ANTHRO ASTRO BEHAVIOR EARTH LIFE MATH&COMP MEDICINE PHYSICS
Interface 2004 – p.9/16
< > - +
Interface 2004 – p.10/16
< > - +
−2 −1 1 2 3 0.2 0.3 0.4 0.5 0.6 MI Threshold fraction of edges out of class
graph size 300 graph size 400 graph size 500
Interface 2004 – p.11/16
< > - +
Interface 2004 – p.12/16
< > - +
Interface 2004 – p.13/16
< > - +
Graph Size = 300 Mutual Information Threshold = 0.5
31 edges between classes
ANTHRO ASTRO
Interface 2004 – p.14/16
< > - +
Graph Size = 300 Mutual Information Threshold = 0.5
31 edges between classes
ANTHRO ASTRO
Interface 2004 – p.14/16
< > - +
Graph Size = 300 Mutual Information Threshold = 0.5
31 edges between classes
ANTHRO ASTRO
Interface 2004 – p.14/16
< > - +
Graph Size = 300 Mutual Information Threshold = 0.5
27 edges between classes
ANTHRO ASTRO MATH&COMP
Interface 2004 – p.14/16
< > - +
−2 −1 1 2 3 0.0 0.1 0.2 0.3 0.4 0.5 0.6 MI Threshold fraction of edges out of class
ANTHRO, ASTRO ANTHRO, ASTRO, MED ANTHRO, ASTRO, MED, EARTH ALL 8 CLASSES
Interface 2004 – p.15/16
< > - +
Interface 2004 – p.16/16