What is WordNet? Establishes Organizes over ontological and - PowerPoint PPT Presentation

22K categories and 15M images Animals Plants Structures Person • • • • Bird Tree Artifact Scenes • • • • Fish Flower Tools Indoor • • • • Mammal Food Appliances Geological Formations • • • • Invertebrate Materials Structures Sport Activity • • • • Deng et al. 2009, www.image-net.org Russakovsky et al. 2015 Slide credit: Fei-Fei Li

What is WordNet? Establishes Organizes over ontological and 150,000 words lexical Original paper into 117,000 relationships in by categories NLP and related [George called synsets . tasks. Miller, et al 1990] cited over 5,000 times Slide credit: Fei-Fei Li and Jia Deng

Individually Illustrated WordNet Nodes jacket: a short coat German shepherd: breed of large shepherd dogs used in police work and as a guide for the blind. microwave: kitchen appliance that cooks food by passing an electromagnetic wave through it. mountain: a land mass that projects well above its surroundings; higher than a hill. Slide credit: Fei-Fei Li and Jia Deng

Entity Step 1: Ontological Mammal structure based on WordNet Dog German Shepherd Slide credit: Fei-Fei Li and Jia Deng

Dog Step 2: Populate German Shepherd categories with thousands of images from the Internet Slide credit: Fei-Fei Li and Jia Deng

Dog Step 3: Clean results German Shepherd by hand Slide credit: Fei-Fei Li and Jia Deng

Three Attempts at Launching Slide credit: Fei-Fei Li and Jia Deng

1 st Attempt: The Psychophysics Experiment ImageNet PhD Students Miserable Undergrads Slide credit: Fei-Fei Li and Jia Deng

1 st Attempt: The Psychophysics Experiment # of synsets: 40,000 (subject to: imageability analysis) • • # of candidate images to label per synset: 10,000 • # of people needed to verify: 2-5 Speed of human labeling: 2 images/sec (one • fixation: ~200msec) • Massive parallelism (N ~ 10^2-3) 40,000 × 10,000 × 3 / 2 = 6000,000,000 sec ≈ 19 years Slide credit: Fei-Fei Li and Jia Deng

2 nd Attempt: Human-in-the-Loop Solutions Slide credit: Fei-Fei Li and Jia Deng

2 nd Attempt: Human-in-the-Loop Solutions Human-generated Machine-generated datasets transcend datasets can only match algorithmic limitations, the best algorithms of leading to better the time. machine perception. Slide credit: Fei-Fei Li and Jia Deng

3 rd Attempt: Crowdsourcing ImageNet PhD Students Crowdsourced Labor 49k Workers from 167 Countries 2007-2010 Slide credit: Fei-Fei Li and Jia Deng

The Result: Goes Live in 2009 Slide credit: Fei-Fei Li and Jia Deng

Others Targeted Detail LabelMe Lotus Hill Per-Object Regions and Labels Hand-Traced Parse Russell et al, 2005 Trees Yao et al, 2007 Slide credit: Fei-Fei Li and Jia Deng

ImageNet Targeted Scale SUN, 131K [Xiao et al. ‘10] LabelMe, 37K [Russell et al. ’07] 15M PASCAL VOC, 30K [Deng et al. ’09] [Everingham et al. ’06-’12] Caltech101, 9K [Fei-Fei, Fergus, Perona, ‘03] Slide credit: Fei-Fei Li and Jia Deng

Challenge procedure every year 1. Training data released: images and annotations • For classification, 1000 synsets with ~1k images/synset 2. Test data released: images only (annotations hidden) • For classification, ~ 100 images/synset 3. Participants train their models on train data 4. Submit text file with predictions on test images 5. Evaluate and release results, and run a workshop at ECCV/ICCV to discuss results

ILSVRC image classification task Steel drum Objects: 1000 classes Training: 1.2M images Validation: 50K images Test: 100K images

ILSVRC image classification task Steel drum Output: Output: Scale Scale ✗ ✔ T-shirt T-shirt Steel drum Giant panda Drumstick Drumstick Mud turtle Mud turtle

ILSVRC image classification task Steel drum Output: Output: Scale Scale ✗ ✔ T-shirt T-shirt Steel drum Giant panda Drumstick Drumstick Mud turtle Mud turtle Σ 1 Error = 1[incorrect on image i] 100,000 100,000 images

Why not all objects? Labels Input (1000 objects) Table Chair Bowl Dog Cat … + + - - - - 100 million questions + - + - + - + + - - - - - - - + - - (100K test images)

ILSVRC Task 2: Classification + Localization Steel drum Objects: 1000 classes Training: 1.2M images, 500K with bounding boxes Validation: 50K images, all 50K with bounding boxes Test: 100K images, all 100K with bounding boxes

Data annotation cost Draw a tight bounding box around the moped

Data annotation cost Draw a tight bounding box around the moped This took 14.5 seconds ( 7 sec [JaiGra ICCV’13], 10.2 sec [RusLiFei CVPR’15], 25.5 sec [SuDenFei AAAIW’12])

[Hao Su et al. AAAI 2010]

ILSVRC Task 2: Classification + Localization Steel drum Output Persian Loud cat speaker Steel ✔ drum Folding Picket chair fence

ILSVRC Task 2: Classification + Localization Steel drum Output Persian Loud cat speaker Steel ✔ drum Folding Picket chair fence Output (bad localization) Output (bad classification) Persian Persian Loud Loud cat cat speaker speaker King Steel ✗ ✗ penguin drum Folding Folding Picket Picket chair chair fence fence

ILSVRC Task 2: Classification + Localization Steel drum Output Persian Loud cat speaker Steel ✔ drum Folding Picket chair fence Error = Σ 1 1[incorrect on image i] 100,000 100,000 images

From classification+localization to segmentation… Segmentation propagation in ImageNet (in a few minutes)

ILSVRC Task 3: Detection Allows evaluation of generic object detection in cluttered scenes at scale Person Car Motorcycle Helmet Objects: 200 classes Training: 450K images, 470K bounding boxes Validation: 20K images, all bounding boxes Test: 40K images, all bounding boxes

ILSVRC Task 3: Detection All instances of all target object classes expected to be localized on all test images Evaluation modeled after PASCAL VOC: Algorithm outputs a list of bounding box • detections with confidences Person A detection is considered correct if overlap • Car with ground truth is big enough Motorcycle Helmet Evaluated by average precision per object • class Winners of challenge is the team that wins • the most object categories Everingham, Van Gool, Williams, Winn and Zisserman. The PASCAL Visual Object Classes (VOC) Challenge. IJCV 2010.

Multi-label annotation Labels Input (200 objects) Table Chair Bowl Dog Cat … + + - - - - 24 million questions + - + - + - + + - - - - - - - + - - (120K images) [Deng et al. CHI’14]

Man-made objects Semantic Animals hierarchy Furniture Labels Input (200 objects) Table Chair Bowl Dog Cat … + + - - - - + - + - + - + + - - - - - - - + - - (120K images) [Deng et al. CHI’14]

person ImageNet object detection challenge hammer flower pot 120,931 images 200 object classes power drill Compare to PASCAL VOC [EveVanWilWinZis ‘12] Goal: 22,591 images 20 object classes Get as much utility (new labels) as possible, person person car dog for as little cost (human time) as possible, helmet chair motorcycle given a desired level of accuracy Result: 6.2x savings in human cost Large-scale object detection benchmark [Deng et al. CHI’14] [Russakovsky et al. IJCV’15]

Annotation research In-house annotation: Caltech 101, PASCAL [FeiFerPer CVPR’04, EveVanWilWinZis IJCV’10]

Annotation research In-house annotation: Caltech 101, PASCAL [FeiFerPer CVPR’04, EveVanWilWinZis IJCV’10] Decentralized annotation: LabelMe, SUN [RusTorMurFre IJCV’07, XiaHayEhiOliTor CVPR’10] AMT annotation: quality control; ImageNet [SorFor CVPR’08, DenDonSocLiLiFei CVPR’09]

Annotation research In-house annotation: Caltech 101, PASCAL [FeiFerPer CVPR’04, EveVanWilWinZis IJCV’10] Decentralized annotation: LabelMe, SUN [RusTorMurFre IJCV’07, XiaHayEhiOliTor CVPR’10] AMT annotation: quality control; ImageNet [SorFor CVPR’08, DenDonSocLiLiFei CVPR’09] Probabilistic models of annotators [WelBraBelPer NIPS’10]

Annotation research In-house annotation: Caltech 101, PASCAL [FeiFerPer CVPR’04, EveVanWilWinZis IJCV’10] Decentralized annotation: LabelMe, SUN [RusTorMurFre IJCV’07, XiaHayEhiOliTor CVPR’10] AMT annotation: quality control; ImageNet [SorFor CVPR’08, DenDonSocLiLiFei CVPR’09] Probabilistic models of annotators [WelBraBelPer NIPS’10] Iterative bounding box annotation [SuDenFei AAAIW’10] Reconciling segmentations [VitHay BMVC’11] Building an attribute vocabulary [ParGra CVPR’11] Efficient video annotation: VATIC [VonPatRam IJCV12]

Annotation research Computer vision community In-house annotation: Caltech 101, PASCAL [FeiFerPer CVPR’04, EveVanWilWinZis IJCV’10] Decentralized annotation: LabelMe, SUN [RusTorMurFre IJCV’07, XiaHayEhiOliTor CVPR’10] AMT annotation: quality control; ImageNet [SorFor CVPR’08, DenDonSocLiLiFei CVPR’09] Probabilistic models of annotators [WelBraBelPer NIPS’10] Iterative bounding box annotation [SuDenFei AAAIW’10] Reconciling segmentations [VitHay BMVC’11] Building an attribute vocabulary [ParGra CVPR’11] Efficient video annotation: VATIC [VonPatRam IJCV12]

What is WordNet? Establishes Organizes over ontological and - PowerPoint PPT Presentation

22K categories and 15M images Animals Plants Structures Person Bird Tree Artifact Scenes Fish Flower Tools Indoor Mammal Food Appliances Geological Formations

KDI An Example of Linguistic Resource: WordNet Fausto Giunchiglia and Mattia Fumagallli

From LEX i TRON to Asian WordNet on From LEX i TRON to Asian WordNet on Collaborative

Whither WordNet? Christiane Fellbaum George A. Miller Princeton University WordNet was made

Wordnet Ontology as a Wordnet Ontology as a Geographical Information Geographical Information

Verbs in the Open Multilingual Wordnet Francis Bond Linguistics and Multilingual Studies,

WordnetLoom a Multilingual Wordnet Editing System Focused on Graph-based Presentation Tomasz

YAGO: A LARGE ONTOLOGY FROM WIKIPEDIA AND WORDNET Fabian M. Suchanek, Gjergji Kasneci, Gerhard

YAGO - Yet Another Great Ontology YAGO: A Large Ontology from Wikipedia and WordNet 1 Presentation

Extensive Evaluation of a FrameNet-WordNet mapping resource Diego De Cao Danilo Croce Roberto

Using WordNet for Query Expansion: ADAPT @ FIRE 2016 Microblog Track Wei Li , Debasis Ganguly,

Comparing Ontology-based and Corpus- based Domain Annotations in WordNet. A paper by: Bernardo

Integrang WordNet and Wikonary with lemon John MCrae 1 , Elena Monel-Ponsoda 2 and

Why Wikipedia Needs to Make Friends with WordNet Kow Kuroda, Francis Bond , ** and Kentaro

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Personalized PageRank over WordNet for Similarity and Word Sense Disambiguation Eneko Agirre

Boot-strapping a WordNet using multiple existing WordNets Francis Bond, Hitoshi Isahara, Kyoko

Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao Zhou, Qixian ang Ye Ye

for Open-set Land Cover Classification of Satellite Imagery Razieh Kaviani Baghbaderani 1 , Ying

Energy Infrastructure Map of the World Varun Nair Tamasha Pathirathna Xiaolan You Qiwei

Whe here To To Find nd The Themes F For Google Sl Slides Spice up any presentation with

neat-EO.pink : Computer Vision framework for GeoSpatial Imagery @o_courtin @FOSDEM 2020 Error

Functional Imagery David Kavanagh Training Jackie Andrade Functional Imagery What would you

SAR Phenomenology Dr. Armin Doerry Detailed contact information at www.doerry.us This

Image Space Embeddings and Generalized Convolutional Neural Networks Nate Strawn September

Sambuz

Useful Links

Newsletter

Mail Us

What is WordNet? Establishes Organizes over ontological and - PowerPoint PPT Presentation

22K categories and 15M images Animals Plants Structures Person Bird Tree Artifact Scenes Fish Flower Tools Indoor Mammal Food Appliances Geological Formations

KDI An Example of Linguistic Resource: WordNet Fausto Giunchiglia and Mattia Fumagallli

From LEX i TRON to Asian WordNet on From LEX i TRON to Asian WordNet on Collaborative

Whither WordNet? Christiane Fellbaum George A. Miller Princeton University WordNet was made

Wordnet Ontology as a Wordnet Ontology as a Geographical Information Geographical Information

Verbs in the Open Multilingual Wordnet Francis Bond Linguistics and Multilingual Studies,

WordnetLoom a Multilingual Wordnet Editing System Focused on Graph-based Presentation Tomasz

YAGO: A LARGE ONTOLOGY FROM WIKIPEDIA AND WORDNET Fabian M. Suchanek, Gjergji Kasneci, Gerhard

YAGO - Yet Another Great Ontology YAGO: A Large Ontology from Wikipedia and WordNet 1 Presentation

Extensive Evaluation of a FrameNet-WordNet mapping resource Diego De Cao Danilo Croce Roberto

Using WordNet for Query Expansion: ADAPT @ FIRE 2016 Microblog Track Wei Li , Debasis Ganguly,

Comparing Ontology-based and Corpus- based Domain Annotations in WordNet. A paper by: Bernardo

Integrang WordNet and Wikonary with lemon John MCrae 1 , Elena Monel-Ponsoda 2 and

Why Wikipedia Needs to Make Friends with WordNet Kow Kuroda*, Francis Bond* , ** and Kentaro

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Personalized PageRank over WordNet for Similarity and Word Sense Disambiguation Eneko Agirre

Boot-strapping a WordNet using multiple existing WordNets Francis Bond, Hitoshi Isahara, Kyoko

Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao Zhou, Qixian ang Ye Ye

for Open-set Land Cover Classification of Satellite Imagery Razieh Kaviani Baghbaderani 1 , Ying

Energy Infrastructure Map of the World Varun Nair Tamasha Pathirathna Xiaolan You Qiwei

Whe here To To Find nd The Themes F For Google Sl Slides Spice up any presentation with

neat-EO.pink : Computer Vision framework for GeoSpatial Imagery @o_courtin @FOSDEM 2020 Error

Functional Imagery David Kavanagh Training Jackie Andrade Functional Imagery What would you

SAR Phenomenology Dr. Armin Doerry Detailed contact information at www.doerry.us This

Image Space Embeddings and Generalized Convolutional Neural Networks Nate Strawn September

Sambuz

Useful Links

Newsletter

Mail Us

Why Wikipedia Needs to Make Friends with WordNet Kow Kuroda, Francis Bond , ** and Kentaro