Transferring Deep Learning into a Recommender System
By Christophe Duong, Data Scientist
Big Data Paris 2017
Transferring Deep Learning into a Recommender System By Christophe - - PowerPoint PPT Presentation
Transferring Deep Learning into a Recommender System By Christophe Duong, Data Scientist Big Data Paris 2017 High-end Product -> Appearance is crucial Fewer recurrent buyers -> Web visit patterns are essential -> Short visit
By Christophe Duong, Data Scientist
Big Data Paris 2017
sales)
recommender systems
summer)
Six steps to a predictive model
Data Exploration & Understandin g Data Preparation Model Creation Evaluation Deployment Data Acquisition
Dataset 1
Scored dataset Scored dataset
Iteration 1 Iteration 2 Iteration n
Creating a predictive model is a highly iterative process. Data Science Studio enables its users to create and manage these projects from end-to-end. This process is not industry specific, and can be applied to many use cases.
Dataset 2 Dataset n
Business Understandin g
Adapted from the CRISP-DM methodology
Recommenders as features Machine learning to optimize purchasing probability Combine Recommend Describe
Cleaning, combining and enrichment of data Recommendation Engines Optimization of home display
the application automatically runs and compiles heterogeneous data Generation of recommendations based on user behaviour
Every customer is shown the 10 sales he is the most likely to buy Customer visits Purchases
Meta model combine recommendations to directly optimize purchasing probability
Meta Model
+7% revenue
Sales information Sales Images
(A/B testing)
Sales Images
Labelling Model
Pool + Palm Trees Hotel + Mountains Pool + Forest + Hotel + Sea Sea + Beach +Forest + Hotel Sales descriptions
CONTENT BASED
Recommender System
“I don’t have GPUs server” “I don’t have a deep leaning expert” “I don’t have labelled data” (or too few) “I don’t have the time to wait for model training ” I don’t want to pay for private apis” / “I’m afraid their labelling will change over time”
“I don’t have (or few) labelled data”
PLACES DATABASE VPG SUN DATABASE 205 categories 2.5 M images 307 categories 110 K images
tower: 0.53 skyscraper: 0.26 swimming_pool/outdoor: 0.65 inn/outdoor: 0.06
If there is open data, there is an open pre trained model !
Example with Places (Caffe Model Zoo) :
If you know how to recognize… after a little bit of training… you will be able to recognize Transfer Learning
PLACES DATABASE VOYAGE PRIVE SUN DATABASE Training (optional) Pre-trained model VGG16 tower: 0.53 skyscraper: 0.26 Re-Training Transferred Data : Last convolutional layer features Re-trained model TensorFlow 2 fully connected layers Caffe Model Zoo GPU CPU GPU
Leverage existing knowledge !
Accuracy: 72%, Top-5 Acc: 90 % > state of the art on dataset alone
Issue with our approach:
Topic scores determine the importance of topics in an image
TOPIC TOPIC SCORE (%) Golf course – Fairway – Putting green 31 Hotel – Inn – Apartment building outdoor 30 Swimming pool – Lido Deck – Hot tub outdoor 22 Beach – Coast - Harbor 17 TOPIC TOPIC SCORE (%) Tower – Skyscraper – Office building 62 Bridge – River – Viaduct 38
All Visits :
First Recommendation
Only Images ?
Third column = Mix