Adaptation for Objects and Attributes Kristen Grauman Department of - PowerPoint PPT Presentation

Adaptation for Objects and Attributes Kristen Grauman Department of Computer Science University of Texas at Austin With Adriana Kovashka (UT Austin), Boqing Gong (USC), and Fei Sha (USC)

Learning-based visual recognition Last 10+ years: impressive strides by learning appearance models (usually discriminative). CAR! CAR New image CAR Image features Annotator NOT CAR Training images

Typical assumptions 1. Test set will look like the training set. 2. Human labelers “see” the same thing.

Mismatched domains TRAIN TEST Flickr YouTube

Mismatched domains TRAIN TEST Catalog images Mobile phone photos

Mismatched domains TRAIN TEST ImageNet PASCAL VOC

Mismatched domains “It is worthwile to note that, even with 140K training ImageNet images, we do not perform as well as with 5K PASCAL VOC training images.” – Perronnin et al. CVPR 2010 TRAIN TEST ImageNet PASCAL VOC

Mismatched domains Problem: Poor cross-domain generalization • Different underlying distributions • Overfit to datasets’ idiosyncrasies Possible solution: Unsupervised domain adaptation

Unsupervised domain adaptation Setup Source domain (with labeled data) Target domain (no labels for training) Different distributions Objective Learn classifier to work well on the target

Much recent research Correcting sampling bias + - + + [This work] - - [Sethy et al., ’09] - [Sugiyama et al., ’08] [Muandet et al., ’13] [Pan et al., ’09] [Huang et al., Bickel et al., ’07] [Gong et al., ’12] Inferring [Argyriou et al, ’08] [Sethy et al., ’06] [Chen et al., ’12] [Daumé III, ’07] domain- [Shimodaira, ’00] [Gopalan et al., ’11] [Blitzer et al., ’06] invariant [Evgeniou and Pontil, ’05] features + - ++ [Duan et al., ’09] -- -+ + [Duan et al., Daumé III et al., Saenko et al., ’10] - + - + - + + [Kulis et al., Chen et al., ’11] - Adjusting mismatched models

Problem Existing methods attempt to adapt all source data points, including “hard” ones. Source Target

Problem Existing methods attempt to adapt all source data points, including “hard” ones. Our idea Automatically identify the “most adaptable” instances Use them to create series of easier auxiliary domain adaptation tasks [Gong et al., ICML 2013]

Landmarks Landmarks are labeled source instances distributed similarly to the target domain. Source Target [Gong et al., ICML 2013]

Landmarks Landmarks are labeled source instances distributed similarly to the target domain. Source Roles: Ease adaptation difficulty Provide discrimination (biased to target) Target [Gong et al., ICML 2013]

Key steps Coarse Source Landmarks Target Fine- grained 1 Identify landmarks at multiple scales. [Gong et al., ICML 2013]

Key steps 3 Obtain domain- invariant features 4 Construct auxiliary domain 2 adaptation tasks Predict target labels [Gong et al., ICML 2013]

Identifying landmarks Objective Source Target [Gong et al., ICML 2013]

Maximum mean discrepancy (MMD) Empirical estimate [Gretton et al. ’06] a universal RKHS kernel function induced by the l -th landmark (from the source domain) [Gong et al., ICML 2013]

Method for identifying landmarks Integer programming where [Gong et al., ICML 2013]

Method for identifying landmarks Convex relaxation [Gong et al., ICML 2013]

Scale for landmark similarity? Gaussian kernels How to choose the bandwidth? Our solution: Examine distributions at multiple granularities Multiple bandwidths  multiple sets of landmarks [Gong et al., ICML 2013]

Landmarks at multiple scales Headphone Mug Target target σ=2 6 0 σ=2 Source 22 -3 σ=2 Unselected [Gong et al., ICML 2013]

Key steps Construct auxiliary domain 2 adaptation tasks [Gong et al., ICML 2013]

Constructing easier auxiliary tasks Source Landmarks Target At each scale σ Intuition: distributions are closer (cf. Theorem 1) [Gong et al., ICML 2013]

Constructing easier auxiliary tasks New source Landmarks New target At each scale σ Intuition: distributions are closer (cf. Theorem 1) [Gong et al., ICML 2013]

Constructing easier auxiliary tasks Each task provides new basis of features via geodesic flow kernel (GFK): - Integrate out domain changes - Obtain domain-invariant representation [Gong, et al. ’12] [Gong et al., CVPR 2012]

Key steps MKL 3 Obtain domain- invariant features Construct auxiliary domain 2 adaptation tasks

Combining features discriminatively Multiple kernel learning on the labeled landmarks Arriving at domain-invariant feature space Discriminative loss biased to the target

Key steps 3 Obtain domain- invariant features 4 Construct auxiliary domain 2 adaptation tasks Predict target labels

Experiments Four vision datasets/domains on visual object recognition [Griffin et al. ’07, Saenko et al. 10’] Four types of product reviews on sentiment analysis Books, DVD, electronics, kitchen appliances [Biltzer et al. ’07]

Cross-dataset object recognition

Datasets as domains? ASSUMED Domain 2 Domain 1 Domain 3 Domain 4 Domain 5

Datasets as domains? REALITY Domain 2 Domain 1 Domain 3 Domain 8 Domain 7 Domain 6 Domain 9 Domain 5 Domain 10 Domain 4 Domain 5

Datasets as domains? REALITY Domain 2 Domain 1 Domain 3 Dataset != Domain Domain 8 Domain 7 Cross- dataset adaptation is suboptimal Domain 6 Domain 9 Domain 5 Domain 10 Domain 4 Domain 5

How to define a domain? NLP : Language -specific domains Speech : Speaker -specific domains Vision : ?? pose -specific? illumination -specific? occlusion ? image resolution ? background ? Challenges: Many continuous factors vs. few discrete Factors overlap and interact

Discovering latent visual domains We propose to discover domains – “reshaping” them to cross dataset boundaries Maximum distinctiveness MMD where Maximum learnability Determine K with domain-wise cross-validation [Gong et al., NIPS 2013]

Results: discovering domains Discovered Discovered domain II domain I [Gong et al., NIPS 2013]

Results: discovering domains Cross-dataset Cross-viewpoint action recognition object recognition 42 50 41 49 40 48 39 Accuracy 47 Accuracy 38 46 37 45 36 44 35 43 34 42 33 Domains= Hoffman et Domains= Hoffman et Discovered Discovered datasets al. 2012 datasets al. 2012 domains (ours) domains (ours) Domain I Domain II

Summary so far landmarks labeled source instances distributed similarly to the target auxiliary tasks provably easier to solve discriminative loss despite unlabeled target reshaping datasets to latent domains discover cross-dataset domains maximally distinct & learnable

Typical assumptions 1. Test set will look like the training set. 2. Human labelers “see” the same thing.

Visual attributes • High-level semantic properties shared by objects • Human-understandable and machine-detectable high outdoors flat metallic heel brown red indoors has- four-legged ornaments [Oliva et al. 2001, Ferrari & Zisserman 2007, Kumar et al. 2008, Farhadi et al. 2009, Lampert et al. 2009, Endres et al. 2010, Wang & Mori 2010, Berg et al. 2010, Branson et al. 2010, Parikh & Grauman 2011, …]

Standard approach Learn one monolithic model per attribute “formal” Vote on Annotator A labels “not formal” Annotator B Annotator C

Problem There may be valid perceptual differences within an attribute. Formal? More ornamented? User labels: User labels: 50% “yes” 50% “first” or 20% “second” 50% “no” 30% “equally” Binary attribute Relative attribute

Imprecision of attributes Fine-grained meaning Overweight? or just Chubby?

Imprecision of attributes Context Is formal? = formal wear for a conference? OR = formal wear for a wedding?

Imprecision of attributes Cultural Is blue or green ? English : “blue” Russian : “neither” (“голубой” vs. “синий”) Japanese : “both” (“ 青 ” = blue and green)

But do we need to be that precise? Yes. Applications like image search require that user’s perception matches system’s predictions. “white high heels” “less formal than these” [WhittleSearch, Kovashka et al. CVPR 2012]

Our idea • Treat learning perceived attributes as an adaptation problem. • Adapt generic attribute model with minimal user-specific labeled examples. • Obtain implicit user-specific labels from user’s search history [Kovashka and Grauman, ICCV 2013]

Our idea “formal” Vote on labels “not formal” “formal” “formal” “not formal” “not formal” [Kovashka and Grauman, ICCV 2013]

Learning adapted attributes • Adapting binary attribute classifiers: Given user-labeled data and generic model , J. Yang et al. ICDM 2007.

Learning adapted attributes • Adapting relative attribute rankers: Given user-labeled data and generic model , B. Geng, et al. TKDE 2010.

Adaptation for Objects and Attributes Kristen Grauman Department of - PowerPoint PPT Presentation

Adaptation for Objects and Attributes Kristen Grauman Department of Computer Science University of Texas at Austin With Adriana Kovashka (UT Austin), Boqing Gong (USC), and Fei Sha (USC) Learning-based visual recognition Last 10+ years:

61A Lecture 16 Terminology: Python object system: Functions are objects. Wednesday, October 3

Data Examples Announcements Examples: Objects Land Owners Instance attributes are found before

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Objects and Classes Objects with attributes Objects are the basis of object-oriented programming.

Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Minema Minema

Introduction to Data Science: Principles ordered categorical data do not have magnitude

From E/R Diagrams to Relations Entity set relation Attributes attributes

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Coastal Adaptation Kellie Fisher FCERM Senior Advisor Why Adaptation? Adaptation to a

61A Lecture 16 Wednesday, October 3 Terminology: Attributes, Functions, and Methods 2

1 Attributes, Functions, and Methods Looking Up Attributes by Name All objects have attributes,

Data Examples Announcements Examples: Objects Land Owners Instance attributes are found before

Live Objects Live Objects Live Objects Live Objects Krzys Ostrowski, Ken Birman, Danny Dolev

ReNoun Fact Extraction for Nominal Attributes Mohamed Yahya, Steven Whang, Rahul Gupta, and

Descriptor Codes with Attributes Descriptor Codes with Attributes Oscar R. Cantu August 2009

Community Based Climate Change Adaptation: a Case of Community Forestry Programme of Nepal.

Software Evolvability: An industrys view 2 nd Open Workshop on Resilience in Computing Systems

Causal Embeddings For Recommendation Stephen Bonner & Flavian Vasile Criteo Research

Following the Energy Sectors Roadmap Carol Hawk CEDS R&D Program Manager Energy Sector

Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit

Welcome to the Section of: Engineering Design & Product Development And now, welcome

Injective stabilization in categories Alex Sorokin Northeastern University, Boston P arnu,

M -coextensivity and the strict refinement property Michael Hoefnagel University of

Adaptation for Objects and Attributes Kristen Grauman Department of - PowerPoint PPT Presentation

Adaptation for Objects and Attributes Kristen Grauman Department of Computer Science University of Texas at Austin With Adriana Kovashka (UT Austin), Boqing Gong (USC), and Fei Sha (USC) Learning-based visual recognition Last 10+ years:

61A Lecture 16 Terminology: Python object system: Functions are objects. Wednesday, October 3

Data Examples Announcements Examples: Objects Land Owners Instance attributes are found before

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Objects and Classes Objects with attributes Objects are the basis of object-oriented programming.

Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Minema Minema

Introduction to Data Science: Principles ordered categorical data do not have magnitude

From E/R Diagrams to Relations Entity set relation Attributes attributes

Objects &amp; Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Coastal Adaptation Kellie Fisher FCERM Senior Advisor Why Adaptation? Adaptation to a

61A Lecture 16 Wednesday, October 3 Terminology: Attributes, Functions, and Methods 2

1 Attributes, Functions, and Methods Looking Up Attributes by Name All objects have attributes,

Data Examples Announcements Examples: Objects Land Owners Instance attributes are found before

Live Objects Live Objects Live Objects Live Objects Krzys Ostrowski, Ken Birman, Danny Dolev

ReNoun Fact Extraction for Nominal Attributes Mohamed Yahya, Steven Whang, Rahul Gupta, and

Descriptor Codes with Attributes Descriptor Codes with Attributes Oscar R. Cantu August 2009

Community Based Climate Change Adaptation: a Case of Community Forestry Programme of Nepal.

Software Evolvability: An industrys view 2 nd Open Workshop on Resilience in Computing Systems

Causal Embeddings For Recommendation Stephen Bonner &amp; Flavian Vasile Criteo Research

Following the Energy Sectors Roadmap Carol Hawk CEDS R&amp;D Program Manager Energy Sector

Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit

Welcome to the Section of: Engineering Design &amp; Product Development And now, welcome

Injective stabilization in categories Alex Sorokin Northeastern University, Boston P arnu,

M -coextensivity and the strict refinement property Michael Hoefnagel University of

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Causal Embeddings For Recommendation Stephen Bonner & Flavian Vasile Criteo Research

Following the Energy Sectors Roadmap Carol Hawk CEDS R&D Program Manager Energy Sector

Welcome to the Section of: Engineering Design & Product Development And now, welcome