CrowdSearch: Exploiting Crowds for Accurate Real-time Image Search - - PowerPoint PPT Presentation

crowdsearch exploiting crowds for accurate real time
SMART_READER_LITE
LIVE PREVIEW

CrowdSearch: Exploiting Crowds for Accurate Real-time Image Search - - PowerPoint PPT Presentation

CrowdSearch: Exploiting Crowds for Accurate Real-time Image Search on Mobile Phones Michael Fusaro Multimedia Search Modern mobile phones are powerful Most have powerful built-in cameras Effective search capabilities for multimedia are a


slide-1
SLIDE 1

CrowdSearch: Exploiting Crowds for Accurate Real-time Image Search on Mobile Phones

Michael Fusaro

slide-2
SLIDE 2

Worcester Polytechnic Institute

Multimedia Search

Modern mobile phones are powerful Most have powerful built-in cameras Effective search capabilities for multimedia are a necessity Problems Image searching is a tough nut to crack Video search even harder

slide-3
SLIDE 3

Worcester Polytechnic Institute

Idea: Crowdsourcing

Crowdsourcing: outsourcing tasks to a undefined group of people Improve image search Humans are good at recognizing images How did CrowdSearch harness this?

slide-4
SLIDE 4

Worcester Polytechnic Institute

Amazon Mechanical Turk

Crowdsourcing Internet marketplace that enables programmers to coordinate tasks that are usually not feasible with a computer Accessible through an open API Users need to be paid

slide-5
SLIDE 5

Worcester Polytechnic Institute

What Is CrowdSearch

Accurate search system for mobile phones Consists of 3 parts

  • 1. Mobile phone application

submit queries display results

  • 2. Back-end server

automated image search submit AMT tasks

  • 3. Crowdsourcing system
  • 1. validate automated image search results
slide-6
SLIDE 6

Worcester Polytechnic Institute

CrowdSearch Application

slide-7
SLIDE 7

Worcester Polytechnic Institute

Harnessing Amazon Mechanical Turk Efficiently

Realities Tasks cost money Significant delays Optimize for cost Post tasks serially pro: least expensive con: takes longer Optimize for delay Post tasks in parallel pro: faster con: expensive

slide-8
SLIDE 8

Worcester Polytechnic Institute

Harnessing Amazon Mechanical Turk Effectively

slide-9
SLIDE 9

Worcester Polytechnic Institute

CrowdSearch: Algorithm

CrowdSearch tries to strike a balance between the serial and parallel posting schemes Goal of Algorithm Return at least one positive result within the predefined deadline

slide-10
SLIDE 10

Worcester Polytechnic Institute

The Algorithm

For all current validation tasks For each partial sequence received Traverse all possible sequences that lead to a majority 'Yes' answer Calculate probability of sequence occurring under the deadline If the sum of all these probabilities is greater or equal to the threshold: return true Otherwise: return false Two important functions DelayPredict() ResultPredict()

slide-11
SLIDE 11

Worcester Polytechnic Institute

Example

slide-12
SLIDE 12

Worcester Polytechnic Institute

Probability of 'YNYY' occurring after 'YNY' is 0.16 / 0.25 = 0.64

How ResultPredict() Works

slide-13
SLIDE 13

Worcester Polytechnic Institute

AMT validation delay has two parts acceptance delay submission delay

How DelayPredict() Works

slide-14
SLIDE 14

Worcester Polytechnic Institute

Back-end Image Search Engine

Two major steps happen during a search

  • 1. Extract local features from

image Uses a modified form of Scale-invariant feature transform (SIFT)

  • 2. Identify closest matching image

using these features

slide-15
SLIDE 15

Worcester Polytechnic Institute

Experiment: Does it work?

Back-end server was trained on thousands of images Separated into 4 categories Human faces Buildings Flowers Book covers 500 test images used for experiment Three performance characteristics measured precision recall cost

slide-16
SLIDE 16

Worcester Polytechnic Institute

Results - Precision

slide-17
SLIDE 17

Worcester Polytechnic Institute

Results - Recall

slide-18
SLIDE 18

Worcester Polytechnic Institute

Results - Cost

slide-19
SLIDE 19

Worcester Polytechnic Institute

Conclusions

CrowdSearch algorithm was able to optimize for delay and money constraints Achieved > 95% search precision for several categories of images

slide-20
SLIDE 20

Worcester Polytechnic Institute

Questions?

slide-21
SLIDE 21

Worcester Polytechnic Institute

Bibliography

CrowdSearch: Exploiting Crowds for Accurate Real-time Image Search

  • n Mobile Phones. Yan, T., Kumar, V., Ganesan, D. In Proceedings of the

8th International Conference on Mobile Systems, Applications, and Services (MobiSys). San Francisco, CA, June, 2010. Amazon Mechanical Turk. 5 February 2011. <http://en.wikipedia.

  • rg/wiki/Amazon_Mechanical_Turk>

Scale-invariant Feature Transform. 5 February 2011. <http://en.wikipedia.

  • rg/wiki/Scale-invariant_feature_transform>