Improving People Search Using Query Expansions
How Friends Help To Find People
Thomas Mensink & Jakob Verbeek LEAR Team, INRIA Rhˆ
- ne-Alpes
Grenoble, France
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 1 / 21
Improving People Search Using Query Expansions How Friends Help To - - PowerPoint PPT Presentation
Improving People Search Using Query Expansions How Friends Help To Find People Thomas Mensink & Jakob Verbeek LEAR Team, INRIA Rh one-Alpes Grenoble, France Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 1 / 21
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 2 / 21
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 2 / 21
◮ generative mixture model ◮ linear discriminant model
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 3 / 21
Angela Merkel Hu Jintao German Chancellor Angela Merkel shakes hands with Chinese President Hu Jintao (.. .) Kate Hudson Naomi Watts Kate Hudson and Naomi Watts, Le Divorce, Venice Film Festival - 8/31/2003.
◮ Manual construction of labeled training sets costly ◮ Continued labeling effort needed for new people
◮ Averaged over our set of 23 people with ground truth annotation
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 4 / 21
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 5 / 21
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 5 / 21
(Berg et al. CVPR ’04), (Everingham et al. BMVC ’06), (Stone et al. CVPR ’08)
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 6 / 21
(Ozkan & Duygulu CVPR’06), (Guillaumin, Mensink, Verbeek & Schmid CVPR’08)
◮ Given query name X ◮ Select all images with X in caption ◮ Analyze faces in those images to rank or classify them
◮ Text filtering makes queried person the most frequent ◮ Task is reduced to finding the big mode among clutter
◮ If text-filtering yields a precision < 40% ◮ Mode finding might return wrong person Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 6 / 21
◮ Find names co-occuring with the queried person: “friends” ◮ Query database for images with friends in caption, but not X ◮ Adds “negative” examples, different from typical query expansion in retrieval Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 7 / 21
◮ Find names co-occuring with the queried person: “friends” ◮ Query database for images with friends in caption, but not X ◮ Adds “negative” examples, different from typical query expansion in retrieval
Initial situation (left), models based on queries for friends (middle), simplified person identification (right).
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 7 / 21
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 8 / 21
◮ Hand labeling of all faces in images with one of the 23 query names in caption
◮ Detector of facial features: mouth, nose, eyes, . . .
(Everingham et al. BMVC ’06)
◮ Concatenate SIFT descriptors of all facial feature
Image gradients Keypoint descriptor
Examples of facial feature detection SIFT descriptor
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 9 / 21
◮ generative mixture model ◮ linear discriminant model
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 10 / 21
◮ Coded in assignment variable γ ∈ {0, 1, . . . , F}
◮ A-priori over γ: equal for γ = 0 ◮ Gaussian density for faces of X ◮ generic “background model” for other faces
F
F
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 11 / 21
◮ Coded in assignment variable γ ∈ {0, 1, . . . , F}
◮ A-priori over γ: equal for γ = 0 ◮ Gaussian density for faces of X ◮ generic “background model” for other faces
F
F
◮ Background model fixed, only foreground Gaussian and prior updated ◮ After convergence evaluate p(γ|F) Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 11 / 21
◮ Using images with friend in the caption but without X ◮ At most 15 friends, at least 5 images per friend
N
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 12 / 21
◮ Using images with friend in the caption but without X ◮ At most 15 friends, at least 5 images per friend
N
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 12 / 21
7 16 29 30 39 39 40 42 44 45 45 46 47 50 51 51 52 53 53 54 55 55 55 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Precision Percentage of faces representing queried person
Comparing mixture model without (green), and with (yellow) query expansion
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 13 / 21
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.4 0.5 0.6 0.7 0.8 0.9 1 Recall Precision
Precision averaged over the 23 queries at different levels of recall
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 14 / 21
◮ Too little data to allow learning of richer model
◮ Laplace prior for sparsity in the weight vector
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 15 / 21
◮ Too little data to allow learning of richer model
◮ Laplace prior for sparsity in the weight vector
◮ random set of faces without X in caption ◮ faces in query expansion Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 15 / 21
◮ on average only 44% correct
◮ most of the time
◮ Learn initial classifier from all faces after text search ◮ Re-label most suspicious faces as negative ◮ Re-train classifier using new labels ◮ Repeat until one face per positive image is left Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 16 / 21
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.4 0.5 0.6 0.7 0.8 0.9 1 Recall Precision
Precision averaged over the 23 queries at different levels of recall
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 17 / 21
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.4 0.5 0.6 0.7 0.8 0.9 1 Recall Precision
Precision averaged over the 23 queries at different levels of recall
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 18 / 21
◮ public data set, 13.000 hand labeled faces, no captions
In each row: top 10 ranked faces for one person
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 19 / 21
◮ Generative model benefits most from expansion ◮ Discriminative model yields best performance
◮ These remain the most difficult cases
◮ +10% precision compared to our CVPR’08 work ◮ +20% precision compared to Ozkan & Duygulu CVPR’06 Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 20 / 21
Mensink & Verbeek (INRIA) Improving People Search Using Query Expansions ECCV 2008 21 / 21