JoBimText Framework for Distributional Semantics
Alexander Panchenko TU Darmstadt — FG Language Technology
JoBimText Framework for Distributional Semantics Alexander - - PowerPoint PPT Presentation
JoBimText Framework for Distributional Semantics Alexander Panchenko TU Darmstadt FG Language Technology Most slides by Martin Riedl & Eugen Ruppert from TU Darmstadt Plan Distributional
Alexander Panchenko TU Darmstadt — FG Language Technology
Martin Riedl & Eugen Ruppert from TU Darmstadt
Advantages: ¡
“give ¡a ¡man ¡a ¡fish ¡and ¡you ¡ feed ¡him ¡for ¡a ¡day…”
Disadvantages: ¡
words, ¡new ¡meanings ¡…
Word sense — a word cluster http://www.serelex.org
bar#NN paper#NN
IS-A relations (~hypernyms)
cougar is-a {animal, cat, speices}
toyota is-a {car, company}
Hearst patterns
Matches in text
{[cream soda=hypo]}
{[burger=hypo]}, and {[fry=hypo]}
Sense hypernyms — frequent IS-A relations in a word cluster
Context clues of a sense — frequent context features in a word cluster
Lion Porsche Corvette Leopard
assign the correct word sense based on the given context.
a great community”
python is a programming language with a great community
hyper [language, languages, programming_language, programming_languages, scripting_language, technology, …]
hyper [forms, groups, people, topics, …]
hyper []
hyper []
hyper [item, items]
python snake is very dangerous
…] hyper [animals, animal, species, specie, wildlife, creature, …]
hyper [animals, animal, species, specie, …]
hyper []
hyper []
Mouse0 Mouse1 Mouse2 Mouse3 finger rodent software malignant thumb guy circuitry embryonic brain baboon users fetal skin horse screen cancerous
Input: sentence, target words, proto-ontology Output: senses for target words for targetWord in sentence:
similarBims = getSimilarBims(bim) for senseCluster in senseClusters(targetWord): for clusterTerm in senseCluster: for bim in {originalBim, similarBims}: if clusterTerm has bim: addScore(senseCluster) assignedSense = maxScore(senseClusters) return { (targetWord, assignedSense) }