Leong & Mihalcea: Measuring the Semantic Relatedness Between - PowerPoint PPT Presentation

Leong & Mihalcea: Measuring the Semantic Relatedness Between Words and Images Seminar: Distributionelle Semantik jenseits der Wortbedeutung (Matthias Hartung) Michael Haas, haas@cl.uni-heidelberg.de 22-07-2013

Overview ◮ Introduction Multimodal Semantics ◮ Algorithm: Text + Pictures ◮ Results ◮ Questions? Too fast? Ask!

Multimodal Semantics ◮ Distributional Semantics on text corpora: uni-modal ◮ Integrate different modalities: multi-modal ◮ Feature Norms ◮ Pictures ◮ Why: ◮ Obvious things go un-mentioned ◮ Human cognition is situated → Distributional semantics is like ”learning meaning by listening to the radio” 1 1 McClelland, cited according to Johns &Jones, 2011

Algorithm: Text + Pictures ◮ Task: measure semantic relatedness between words and images ◮ Data Set: ImageNet, extension of WordNet ◮ Select 167 synsets ◮ Select nouns from synsets and glosses ◮ Select one image at random from synset ◮ How to compare images and words?

Algorithm: Representation ◮ For text: build term-document matrix ◮ Vector length: 167 documents ◮ For images: represent image as bag of visual words

Algorithm: Bag of visual words ◮ General approach for feature extraction from images ◮ Feature Detection: split image into partitions ◮ Feature Description: represent image as set of vectors ◮ Visual Codeword Generation: cluster vectors

Algorithm: Bag of visual words ◮ Extract 20px square patches at every 10px boundary ◮ Represent using SIFT descriptors: Scale-Invariant Feature Transform ◮ Cluster into 1000 code words → Image is now represented as a bag of visual code words

CMSM for Sentiment Analysis: Eval Results Figure : Bruni et al., 2012

Algorithm: Map images into document space ◮ Represent each code word as vector: distribution over document space → Image is represented as set of vectors ◮ Flatten image represention: sum over all vectors → Image is now represented as a single vector in document space

Algorithm: Compare images and words ◮ Words and images are mapped into document space ◮ Reduce dimensions using LSA ◮ Measure similarity: cosine similarity → Direct comparison of vectors in term-document and codeword-document space

Evaluation ◮ Image-Centered Scenario → Given 12 associated words, rank according to relatedness to image ◮ Arbitrary-Image Scenario → Measure similarity between arbitrary images and words irregardless of synset membership ◮ Gold Standard: extract 12 words from synset, relatedness rated by MTurkers

Evaluation: Baselines ◮ Random baseline ◮ Vector-based baseline w/o LSA ◮ Upper bound: human performance based on annotator data

Evaluation: Results ◮ Image-Centered ◮ Vector-based baseline: 0 . 262 correlation to gold standard ◮ LSA-based: 0 . 339 ◮ Human upper bound: 0 . 687 ◮ Arbitrary-Image ◮ Vector-based: 0 . 291 ◮ LSA-Based: 0 . 353 ◮ Human upper bound: 0 . 764 ◮ Adding more synsets brings correlation values to ∼ 0 . 45

Summary ◮ Comparing images to text: it works! ◮ More data is better data ◮ How can we enrich textual data with image data? → For starters, just concatenate textual vector and pictoral vector (Bruni et al., 2012)

References I Leong, C. W., & Mihalcea, R. (2011, January). Measuring the semantic relatedness between words and images. In Proceedings of the Ninth International Conference on Computational Semantics (pp. 185-194). Association for Computational Linguistics. Bruni, E., Boleda, G., Baroni, M., & Tran, N. K. (2012, July). Distributional semantics in technicolor. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1 (pp. 136-145). Association for Computational Linguistics.

Leong & Mihalcea: Measuring the Semantic Relatedness Between - PowerPoint PPT Presentation

Leong & Mihalcea: Measuring the Semantic Relatedness Between Words and Images Seminar: Distributionelle Semantik jenseits der Wortbedeutung (Matthias Hartung) Michael Haas, haas@cl.uni-heidelberg.de 22-07-2013 Overview Introduction

Compositional Distributional Semantic Models for Semantic Relatedness and Entailment Sidharth

Multi-Relational Semantic Similarity Li Harry Zhang, Steven R. Wilson, Rada Mihalcea

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Breaking the Rules of Game Design: when to go against Autonomy, Competence, and Relatedness

Optimizing User Adoption Speaker: Laurence Leong February, 2019 Laurence Leong Vice President

Yin Xu, Wai Kay Leong, Ali Razeen Ben Leong Duke University National University of Singapore 1

To Attend or not to Attend: A Case Study on Syntactic Structures for Semantic Relatedness

Semantic Documents Relatedness using Concept Graph Representation Date : 2016/07/12 Author :

An Annotation of Semantic Change based on Usage Relatedness October 28, 2017 Dominik

Semantic relatedness and cross-lingual passage retrieval Eneko Agirre 1 , Olatz Ansa 1 , Xabier

Corpus-based Semantic Relatedness for the Construction of Polish WordNet Bartosz Broda 1 ,

ACL 2012 Multilingual Sentiment and Subjectivity Analysis Rada Mihalcea, University of North

Algorithm Analysis Rada Mihalcea http://www.cs.unt.edu/~rada/CSCE3110 Queues Reading: Chap. 3

Relatedness between NZ native chrysomeline beetles and Paropsis NZ chrysomeline fauna: 41 spp.

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Unique representation Recall idea of coordinate system for a vector space V : Generators a 1 ,

Differential Vector Calculus Steve Rotenberg CSE169: Computer Animation UCSD Winter 2020

CS675: Convex and Combinatorial Optimization Spring 2018 Introduction to Matroid Theory

Insights from the FMA John Botica and Derek Grantham Insights from the FMA- whats

Atacking Split Manufacturing from a Deep Learning Perspective Haocheng Li 1 , Satwik Patnaik 2 ,

CSSE463: Image Recognition Day 14 Lab due Weds. These solutions assume that you don't

Linear Transformations Marco Chiarandini Department of Mathematics & Computer Science

Personal Photo Enhancement using Example Images Neel Joshi Wojciech Matusik, Edward H. Adelson,

Leong & Mihalcea: Measuring the Semantic Relatedness Between - PowerPoint PPT Presentation

Leong & Mihalcea: Measuring the Semantic Relatedness Between Words and Images Seminar: Distributionelle Semantik jenseits der Wortbedeutung (Matthias Hartung) Michael Haas, haas@cl.uni-heidelberg.de 22-07-2013 Overview Introduction

Compositional Distributional Semantic Models for Semantic Relatedness and Entailment Sidharth

Multi-Relational Semantic Similarity Li Harry Zhang, Steven R. Wilson, Rada Mihalcea

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Breaking the Rules of Game Design: when to go against Autonomy, Competence, and Relatedness

Optimizing User Adoption Speaker: Laurence Leong February, 2019 Laurence Leong Vice President

Yin Xu, Wai Kay Leong, Ali Razeen Ben Leong Duke University National University of Singapore 1

To Attend or not to Attend: A Case Study on Syntactic Structures for Semantic Relatedness

Semantic Documents Relatedness using Concept Graph Representation Date : 2016/07/12 Author :

An Annotation of Semantic Change based on Usage Relatedness October 28, 2017 Dominik

Semantic relatedness and cross-lingual passage retrieval Eneko Agirre 1 , Olatz Ansa 1 , Xabier

Corpus-based Semantic Relatedness for the Construction of Polish WordNet Bartosz Broda 1 ,

ACL 2012 Multilingual Sentiment and Subjectivity Analysis Rada Mihalcea, University of North

Algorithm Analysis Rada Mihalcea http://www.cs.unt.edu/~rada/CSCE3110 Queues Reading: Chap. 3

Relatedness between NZ native chrysomeline beetles and Paropsis NZ chrysomeline fauna: 41 spp.

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Unique representation Recall idea of coordinate system for a vector space V : Generators a 1 ,

Differential Vector Calculus Steve Rotenberg CSE169: Computer Animation UCSD Winter 2020

CS675: Convex and Combinatorial Optimization Spring 2018 Introduction to Matroid Theory

Insights from the FMA John Botica and Derek Grantham Insights from the FMA- whats

Atacking Split Manufacturing from a Deep Learning Perspective Haocheng Li 1 , Satwik Patnaik 2 ,

CSSE463: Image Recognition Day 14 Lab due Weds. These solutions assume that you don't

Linear Transformations Marco Chiarandini Department of Mathematics &amp; Computer Science

Personal Photo Enhancement using Example Images Neel Joshi Wojciech Matusik, Edward H. Adelson,

Linear Transformations Marco Chiarandini Department of Mathematics & Computer Science