1
play

1 Examples The ETH-80 Dataset (Bastian Leibe and Bernt Schiele) - PDF document

Outline Datasets and Dataset Creation Importance of datasets Existing datasets Issues with current datasets New ways of acquiring large and diverse datasets Visual Recognition and Search LabelMe: a database and web-based tool


  1. Outline Datasets and Dataset Creation • Importance of datasets • Existing datasets • Issues with current datasets • New ways of acquiring large and diverse datasets Visual Recognition and Search • LabelMe: a database and web-based tool Maysam Moussalem • Conclusion 2 Importance of datasets Existing datasets • Caltech 101 • Datasets needed at all stages of object recognition • Caltech 256  Learning visual models  Detecting and localizing instances of these models • PASCAL Visual Object Classes challenges  Evaluating performance • Oxford buildings, flowers datasets • A good dataset must be • CMU Face databases  Very large • MIT Objects and Scenes  Very diverse • Photo-tourism patches  Well-annotated • … • Drive research by providing common ground 3 4 Issues with current datasets… Examples • Unfortunately, most of these offer limited range of image variability!  Similar viewpoints and orientations  Sizes and image positions normalized The Oxford Flowers Dataset (Maria-Elena Nilsback  Little or no occlusion and background clutter and Andrew Zisserman)  Often only one instance of object in image  … 5 6 1

  2. Examples The ETH-80 Dataset (Bastian Leibe and Bernt Schiele) The Caltech 101 average image 8 (constructed by A. Torralba) 7 Problems with existing datasets • Some algorithms may exploit restrictions in datasets  E.g. those lacking scale, rotation invariance… A bit better… • Images are not challenging enough The Pascal 2006 average image (constructed by T. Malisiewicz)  More sophisticated algorithms might not show better results  Results tend to converge around 100% accuracy 9 10 Outline New ways of acquiring large and diverse datasets • Web-based annotation tools • Importance of datasets  Rely on collaborative effort of large population online • Existing datasets • Examples • Issues with current datasets  ESP • New ways of acquiring large and diverse datasets  Peekaboom • LabelMe: a database and web-based tool  LabelMe • Conclusion 11 12 2

  3. ESP (von Ahn and Dabbish) Peekaboom (von Ahn, Liu, and Blum) Two-player online game • Rules of the game •  Partners don’t know each other  Partners can’t communicate  Only thing in common: image  Objective is to type in same word Since 2003, 34, 334, 076 images • have been labeled this way! 13 14 LabelMe LabelMe: a database and web-based tool • Online annotation tool • Allows sharing of images and annotations • Provides many functionalities  Drawing polygons  Querying images  Browsing the database 15 16 LabelMe (technical specs) Browsing the images online • Runs on (almost) any web browser • Includes standard Javascript drawing interface • Stores resulting labels in XML file  Portable, annotations easy to extend • Provides Matlab toolbox for manipulating database  Database queries  Communication with online tool  Image transformations  … 17 18 3

  4. Downloading the dataset, or a part of it… Labeling the images (much slower!) User has to draw boundary • around image by placing polygon control points  How many control points should there be? Then, a popup balloon • comes up an user needs to give a name to the object  How to choose the label? 19 20 LabelMe: Examples of annotated scenes LabelMe: Issues and Concerns • Quality control  Provided by users who go over and correct labeling • Complexity of polygons drawn by users  Simple or convex polygons • Choice of objects to label  E.g. crowd of people: do you label individuals or all together  User decides • Labels themselves  Level of precision, specificity 21 22 LabelMe: Issues and Concerns Issues with polygons 23 24 4

  5. Issues with labels As a result of this extension What to do when users • choose labels such as  Car  Cars  Red car  Car frontal  Taxi  …? Analysis and retrieval hard • LabelMe + WordNet! •  Electronic dictionary Synonyms return (almost) the same results •  Tree with semantic  Here, motorcycle (left) and motorbike (right) categories 25 26 Interesting… Statistics • Description  Raw description entered by user; single or multiple words • Average  Average intensity of object patches with same description  Shown when at least 10 instances of object available • Occupied area  Percentage of pixels occupied relative to image size • Boundary points  Number of points used • Object location If you enter as query “apple”, first few entries are actually  Distribution of locations occupied by each instance • “pineapple”!!  Helps understand photographers’ biases 27 28 Summary and Conclusion • Importance of datasets • Existing datasets • Issues with current datasets • New ways of acquiring large and diverse datasets • LabelMe: a database and web-based tool • Conclusion 29 30 5

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend