Applying Deep Learning To Airbnb Search Unique Challenges High - - PowerPoint PPT Presentation

applying deep learning to airbnb search unique challenges
SMART_READER_LITE
LIVE PREVIEW

Applying Deep Learning To Airbnb Search Unique Challenges High - - PowerPoint PPT Presentation

MALAY HALDAR / April 2019 / QCon Applying Deep Learning To Airbnb Search Unique Challenges High Value Transactions Sparse Listing Data Sparse User Data Users are picky when spending Each listing can be booked at Travel is not a daily use


slide-1
SLIDE 1

Applying Deep Learning To Airbnb Search

MALAY HALDAR / April 2019 / QCon

slide-2
SLIDE 2
slide-3
SLIDE 3
slide-4
SLIDE 4

Unique Challenges

High Value Transactions Sparse Listing Data Sparse User Data Each listing can be booked at most 365 times a year. Travel is not a daily use case. Users are picky when spending hundreds of dollars.

slide-5
SLIDE 5

How Does Airbnb Search Work?

slide-6
SLIDE 6
slide-7
SLIDE 7

Evolution Of Search Ranking At Airbnb

2014 2015 2016 2017 2018 2019 Scoring Functions GBDT DNN

slide-8
SLIDE 8
slide-9
SLIDE 9

First Neural Net : April 2017

{ Continuous features with heavy feature engineering } 31 Relus L2 loss

slide-10
SLIDE 10

Key Learnings

  • Keep the first launch as simple as

possible

  • Manage expectations, aim for

neutral

slide-11
SLIDE 11

Second Neural Net: Lambdarank NN June 2017

  • sigmoid_cross_entropy_with_logits X lambdarank_weights

{ Booked listing features } { Unbooked listing features }

slide-12
SLIDE 12

Key Learnings

  • Pairwise preference
  • Tensorflow
slide-13
SLIDE 13

Third Neural Net: Hybrid Mar 2018

slide-14
SLIDE 14

DNN : June 2018

{ Raw query and listing stats } 127 Relus 83 Relus

slide-15
SLIDE 15
slide-16
SLIDE 16

Key Learnings

  • Transitioning to deep learning is

about scaling the system.

  • Complexity is easy to build,

simplicity is hard.

slide-17
SLIDE 17

Gains

slide-18
SLIDE 18

Problem Formulation

slide-19
SLIDE 19

Pointwise

slide-20
SLIDE 20

Pairwise

slide-21
SLIDE 21

Feature Engineering

Do Not Cook Your Features

slide-22
SLIDE 22

Continuous Features

slide-23
SLIDE 23

Categorical Features

Embedding Categorical Feature (0x3fc90)

  • 1.203
  • 1.862

1.701

  • 2.003

0.192

:

slide-24
SLIDE 24

System Engineering

  • I/O
  • I/O
  • I/O
slide-25
SLIDE 25
slide-26
SLIDE 26

Feature Importance

...This problem is easily solved by showing a rug (indicators for data points on the x-axis) or a histogram. The assumption of independence is the biggest issue with PD plots. It is assumed that the feature(s) for which the partial dependence is computed are not correlated with other features. For example, suppose you want to predict ...

https://christophm.github.io/interpretable-ml-book/pdp.html

slide-27
SLIDE 27

TopBot Analysis

slide-28
SLIDE 28

Hyperparameter Tuning Optional, not a necessity.

slide-29
SLIDE 29
slide-30
SLIDE 30

Many more interesting topics...

Failed models!

slide-31
SLIDE 31

Applying Deep Learning To Airbnb Search

Malay Haldar, Mustafa Abdool, Prashant Ramanathan, Tao Xu, Shulin Yang, Huizhong Duan, Qing Zhang, Nick Barrow-Williams, Bradley C. Turnbull, Brendan M. Collins, Thomas Legrand

https://arxiv.org/abs/1810.09591