LTR at GetYourGuide Marketplace A Journey through our experience - PowerPoint PPT Presentation

LTR at GetYourGuide Marketplace A Journey through our experience Ashraf Aaref and Felipe Besson June 13th 2018 MICES 2018 MIX-CAMP E-COMMERCE SEARCH

Who are we? We work for the search team at GYG Ashraf ● Software Engineer ○ Felipe ● Data Engineer ○

Agenda? What is GetYourGuide and our challenges? ● V1: Our first try to apply LTR ● Lesson learned ● Next step, V2? ● Questions ●

What is GetYourGuide? GetYourGuide is a marketplace for activities, such as guided tours, ticketed attractions, airport transfers, different experiences, and more… +33K Activities ● +20 Languages ● +7K destinations ● +400 Employees ●

Full-text Search Location driven ● Discovery ● Rank Business metrics + Text Relevance

Location pages (LPs) Location driven ● Dates are very important ● High-intent customers ● Paid traffic ● Rank Business metrics

Problems with LP Ranking ● Focus on business metrics ● Customer intentions (search keywords) "Eiffel Tower ticket" = "Eiffel Tower restaurant" ○ Difficult to introduce new and diverse products ● We needed to learn how to rank activities in LPs! ●

Let the machine do it for you! (LTR) Extracted from ACML 2009 Tutorial Nov. 2, 2009 Nanjing

First iteration (V1) Scope and decisions

Learning to Rank (LTR) at GYG Apply Machine Learning to introduce relevance factors into our ranking formula Use our user intention data to have a dynamic LP ranking

V1 Focus ● Vertical: Points of Interest Ticket, Tour, Museum, Historic site, park, … ○ ● Only in English (we have 22 languages) ● Location pages have no explicit user query Search Keywords: ○ "Statue of Liberty boat tour" location intention

MVP mindset Follow the standard steps of a LTR solution Collect the judgements Train & validate Run A/B Analyse results the model experiment Extract features Define next iteration

We started the journey!

Judgement List Document Judgement 3 3 q = "Eiffel Tower restaurant" 2 1 0

Human labeling judgement list ● Judgements were collected from Domain Experts Internal stakeholders of GYG ○ ● Judgement scale ○ 0 - 3 ● ~ 30k judgements ● Pre analysis of current rank NDCG@7 = 0.55 ○

Human labeling judgement list x ✓ Good approach when data is Relevance is subjective from user incomplete/inconsistent to user x ✓ When what is a relevant result Hard to scale is still unclear x Crowdsourcing is expensive ✓ No need to normalize queries deeply

Enriching Judgements with features

Feature Engineering Query document Business metrics Document ● ● ● BM25 of single text Raw metrics: clicks, Activity attributes: fields bookings, price, duration, impressions # reviews ● Multi-match ● combinations Rates: CTR, CR

How to collect these features ?

Our stack ● Elasticsearch ○ LTR Plugin by OpenSource Connections ● RankLib ● Databricks to run our data pipelines ○ Collect features ○ Train and validate models

New pipeline to collect features Eiffel tower queries + LTR judgement list plugin featureset v1 model v1 configuration features Training Model training set and validation

Training and validating Models

Goals ● Have a model suitable for location pages relevance + business metrics ○ ● Evaluation metric: NDCG@10 ● Success (business): CTR (Click-Through Rate) ● Constraints Do not include user features ○

Best V1 Model ● LambdaMart ● NDCG@10 = 0.9282 Query document Business metrics Document ● Title ● Clicks ● #Reviews ● Highlight ● Bookings ● Review rating ● Description ● Impressions ● Deal price ● Best field ● CR ● Best seller multi-match

We got a model, we just need to run on production!

Best V1 model didn't work "Eiffel tower skip-the-line ticket" C M U O R D R E E L N T R A R N A K N K

We couldn't put in production, shall we give up?

No, We never give up

Main lessons learned ● Relevance of results for LP ● Judgement list extraction ● Quality of our queries ● Distribution of judgements Berlin Buzzwords 2018

What is relevance for your business ? ● Our use case: Location pages First point of contact of many visitors ○ Few rank positions to change ○ Business metrics matter (e.g., revenue) ○ ● Experts labeling This document is relevant for this query ? 0 - 3 ○ This document is a potential conversion ? ○ Berlin Buzzwords 2018

Another approach ● Data approach for e-commerces Perceived utility of: ○ search results (Click through rate) ■ product page (Add-to-cart) ■ Overall user satisfaction (Conversion) ○ Business value (Revenue) ○ ● Experts could refine judgements collected from data Reference: On Application of Learning to Rank for E-Commerce Search by Santu, Sondhi and Zhai (2017) Berlin Buzzwords 2018

Quality of our queries ● Didn't consider real user query but the keyword search engine matches ● Location part is not relevant for scoring many queries "Statue of Liberty boat tour" All results good! contain this location Berlin Buzzwords 2018

Distribution of our Judgements per page perc of judgement (%) location page id

Everything is connected Insufficient Experts criteria to judge Judgements Not Balanced ● Queries ● No business ● Low diversity metrics considered LTR pipeline ● Location (noise) judgements bad scoring Model Problems

Next steps for V2 ● Collect judgements from data ● Redefine our criteria for measuring relevance ● Apply LTR in another GYG search features ● Extract the intentions from the keywords Query understanding might help ○ ● Judge the judgements very often

We hope to turn on V2 and fly Thank you

Questions @AshrafAaref @fmbesson

LTR at GetYourGuide Marketplace A Journey through our experience - PowerPoint PPT Presentation

LTR at GetYourGuide Marketplace A Journey through our experience Ashraf Aaref and Felipe Besson June 13th 2018 MICES 2018 MIX-CAMP E-COMMERCE SEARCH Who are we? We work for the search team at GYG Ashraf Software Engineer Felipe

Preservation SUB-ZERO Product Lineup Panasonic deep freezers with advanced Japanese Technology

Marketplace Mandate Part 2 : Proper Motivation for the Marketplace Marketplace Mandate Part 2 :

The SHOP Marketplace The SHOP Marketplace New Health Insurance Options for Small Businesses June

August 2016 Investor Presentation Liontown Resources Limited (ASX: LTR) (Liontown or the

Health Insurance Marketplace 101 Health Insurance Marketplace 101 Find health care options that

Affordable Care Act Marketplace Implementation Briefing Virginia Organizing Marketplace Public

The Region IV The Region IV Healthcare Marketplace Healthcare Marketplace Presented to the

Agents and Brokers September 9, 2013 Kelly Smith Director Small Business Marketplace 1

The Marketplace-Method Initiating new cooperations between business and civil society Social

and Potential April 3, 2019 Kelley Hamrick, Manager, FT s Ecosystem Marketplace Kelley is

Dorchester County Projects Cambridge Marketplace Entrance Cambridge Marketplace Medical Center

State-based Health Insurance Marketplace Performance September 2019 Current Marketplace Models

2017 Molina Marketplace-OHIO Kahassai Tafese, Broker Channel Manager Marketplace Eastern

DFSI & VIP Program with WISE-PaaS Marketplace Chiwen Lin, WISE-PaaS Marketplace 6 th

Marketplace Ads for Goods Marketplace Ads for Services Spam conversion experiment

Growth and Upside from Imminent Lithium and Gold Exploration New Investor Presentation Liontown

Background & Prerequisites Basic probability and statistics Algorithms Helpful but not

Understanding Marketing DrupalCon Global 2020 @dgorton @sarahfruy #understandmarketing 1

Email Marketing Rocks Lets find out why Thank you to our supporters Webinar reminders You

Business in the age of exponentialism Johan Thorbjrnsson Country Manager Norway, DoubleClick

E= E=SM A Formula for Email and Social Media Marketing Success Sergio Balegno, Senior Analyst

Neural Factorization Machines for Sparse Predictive Analytics Xiangnan He, Tat-Seng Chua Research

Enticing Notification Text & the Impact on Engagement Kieran Fraser & Owen Conlan

How to E ff ectively Measure the ROI of Content Marketing Pawan Deshpande, CEO Curata Pawan

Sambuz

Useful Links

Newsletter

Mail Us

LTR at GetYourGuide Marketplace A Journey through our experience - PowerPoint PPT Presentation

LTR at GetYourGuide Marketplace A Journey through our experience Ashraf Aaref and Felipe Besson June 13th 2018 MICES 2018 MIX-CAMP E-COMMERCE SEARCH Who are we? We work for the search team at GYG Ashraf Software Engineer Felipe

Preservation SUB-ZERO Product Lineup Panasonic deep freezers with advanced Japanese Technology

Marketplace Mandate Part 2 : Proper Motivation for the Marketplace Marketplace Mandate Part 2 :

The SHOP Marketplace The SHOP Marketplace New Health Insurance Options for Small Businesses June

August 2016 Investor Presentation Liontown Resources Limited (ASX: LTR) (Liontown or the

Health Insurance Marketplace 101 Health Insurance Marketplace 101 Find health care options that

Affordable Care Act Marketplace Implementation Briefing Virginia Organizing Marketplace Public

The Region IV The Region IV Healthcare Marketplace Healthcare Marketplace Presented to the

Agents and Brokers September 9, 2013 Kelly Smith Director Small Business Marketplace 1

The Marketplace-Method Initiating new cooperations between business and civil society Social

and Potential April 3, 2019 Kelley Hamrick, Manager, FT s Ecosystem Marketplace Kelley is

Dorchester County Projects Cambridge Marketplace Entrance Cambridge Marketplace Medical Center

State-based Health Insurance Marketplace Performance September 2019 Current Marketplace Models

2017 Molina Marketplace-OHIO Kahassai Tafese, Broker Channel Manager Marketplace Eastern

DFSI &amp; VIP Program with WISE-PaaS Marketplace Chiwen Lin, WISE-PaaS Marketplace 6 th

Marketplace Ads for Goods Marketplace Ads for Services Spam conversion experiment

Growth and Upside from Imminent Lithium and Gold Exploration New Investor Presentation Liontown

Background &amp; Prerequisites Basic probability and statistics Algorithms Helpful but not

Understanding Marketing DrupalCon Global 2020 @dgorton @sarahfruy #understandmarketing 1

Email Marketing Rocks Lets find out why Thank you to our supporters Webinar reminders You

Business in the age of exponentialism Johan Thorbjrnsson Country Manager Norway, DoubleClick

E= E=SM A Formula for Email and Social Media Marketing Success Sergio Balegno, Senior Analyst

Neural Factorization Machines for Sparse Predictive Analytics Xiangnan He, Tat-Seng Chua Research

Enticing Notification Text &amp; the Impact on Engagement Kieran Fraser &amp; Owen Conlan

How to E ff ectively Measure the ROI of Content Marketing Pawan Deshpande, CEO Curata Pawan

Sambuz

Useful Links

Newsletter

Mail Us

DFSI & VIP Program with WISE-PaaS Marketplace Chiwen Lin, WISE-PaaS Marketplace 6 th

Background & Prerequisites Basic probability and statistics Algorithms Helpful but not

Enticing Notification Text & the Impact on Engagement Kieran Fraser & Owen Conlan