SLIDE 24 Introduction The distributional hypothesis
Semantic distances
◮ main result of distributional
analysis are “semantic” distances between words
◮ typical applications
◮ nearest neighbours ◮ clustering of related words ◮ construct semantic map
potato
cat banana chicken mushroom corn dog pear cherry lettuce penguin swan eagle
duck elephant pig cow lion helicopter peacock turtle car pineapple boat rocket truck motorcycle snail ship chisel scissors screwdriver pencil hammer telephone knife spoon pen kettle bottle cup bowl 0.0 0.2 0.4 0.6 0.8 1.0 1.2
Word space clustering of concrete nouns (V−Obj from BNC)
Cluster size
- ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●
- −0.4
−0.2 0.0 0.2 0.4 0.6 0.8 −0.4 −0.2 0.0 0.2 0.4 0.6
Semantic map (V−Obj from BNC)
groundAnimal fruitTree green tool vehicle chicken eagleduck swan owl penguin peacock dog elephant cow cat lion pig snail turtle cherry banana pear pineapple mushroom corn lettuce potato
bottle pencil pen cup bowl scissors kettle knife screwdriver hammer spoon chisel telephone boat car ship truck rocket motorcycle helicopter
Stefan Evert (U Osnabrück) Making Sense of DSM wordspace.collocations.de 13 / 115