Chocolate Chip Pancakes Ajay Mandlekar Anand Poozhikunnel - PowerPoint PPT Presentation

Chocolate ¡Chip ¡ Pancakes ¡ Ajay ¡Mandlekar ¡ Anand ¡Poozhikunnel ¡ Rahul ¡Bachal ¡

What ¡we ¡Tried ¡ • SVD ¡ • SVD++ ¡ • TimeSVD++ ¡ • KNN ¡ • KNN ¡on ¡Residuals ¡of ¡SVDs ¡ • RBM ¡ • Blending ¡on ¡Probe ¡ • Blending ¡on ¡Quiz ¡ ¡

General ¡Notes ¡on ¡Models ¡ • Models ¡were ¡trained ¡using ¡StochasEc ¡Gradient ¡Descent ¡(SGD), ¡with ¡ no ¡shuffling ¡ • For ¡most ¡part, ¡stuck ¡with ¡60 ¡epochs ¡for ¡training ¡ • Used ¡a ¡parameter ¡scan ¡to ¡cycle ¡through ¡many ¡different ¡ permutaEons ¡of ¡all ¡the ¡parameters ¡ • Used ¡MulEthreading, ¡by ¡spliQng ¡each ¡model ¡into ¡8 ¡different ¡ threads, ¡to ¡speed ¡up ¡Eme ¡per ¡epoch ¡

Latent ¡Factor ¡Models ¡ Ø SVD ¡ Ø Originally ¡just ¡used ¡movie ¡and ¡user ¡factors, ¡then ¡added ¡bias ¡ terms ¡which ¡improved ¡performance ¡ Ø Reduced ¡Eme ¡per ¡epoch ¡to ¡under ¡10 ¡seconds ¡using ¡mulE-‑ threading ¡ Ø Performance ¡increased ¡with ¡number ¡of ¡factors ¡used, ¡however, ¡ with ¡diminishing ¡returns ¡ Ø Best ¡performance ¡of ¡a ¡single ¡model ¡was ¡5.2% ¡using ¡2000 ¡factors ¡

Latent ¡Factor ¡Models ¡ v SVD++ ¡ v Began ¡with ¡a ¡naïve ¡implementaEon ¡which ¡took ¡on ¡the ¡order ¡of ¡10 ¡ hours ¡per ¡epoch ¡ v Moved ¡to ¡a ¡smarter ¡implementaEon ¡which ¡reduced ¡Eme ¡to ¡on ¡ the ¡order ¡of ¡10 ¡seconds ¡per ¡epoch ¡ v ¡At ¡first, ¡the ¡performance ¡of ¡this ¡model ¡was ¡only ¡5.3% ¡above ¡ water ¡(on ¡a ¡good ¡day) ¡ v Then, ¡a ¡few ¡well-‑hidden ¡bugs ¡were ¡exterminated, ¡giving ¡a ¡ performance ¡boost ¡up ¡to ¡about ¡6.3% ¡above ¡water ¡ v AddiEonal ¡parameter ¡scans ¡resulted ¡in ¡around ¡6.6% ¡above ¡water ¡

Latent ¡Factor ¡Models ¡ • AutomaEc ¡Parameter ¡Tuner ¡2 ¡-‑ ¡APT2 ¡ • Implemented ¡from ¡paper ¡by ¡Toscher ¡and ¡Jahrer ¡ • For ¡a ¡parameter ¡p, ¡try ¡new ¡values ¡p*e ¡and ¡p/e ¡for ¡e ¡< ¡1. ¡ • Increase ¡e ¡by ¡e^0.9 ¡if ¡new ¡values ¡result ¡in ¡higher ¡RMSE ¡ • Running ¡parameter ¡scan ¡for ¡6 ¡parameters ¡with ¡2 ¡passes ¡ through ¡all ¡parameters ¡and ¡5 ¡tries ¡per ¡parameter ¡took ¡~ ¡10 ¡ hours ¡ • New ¡parameters ¡improved ¡score ¡by ¡~10-‑20 ¡bps ¡

Latent ¡Factor ¡Models ¡ • TimeSVD++ ¡ • Implemented ¡three ¡variants ¡of ¡TimeSVD++. ¡ • Model ¡A) ¡Only ¡uElized ¡movie ¡bias ¡Eme ¡effects ¡and ¡linear ¡user ¡bias ¡ Eme ¡effects. ¡ ¡ • Movies ¡were ¡binned ¡by ¡date ¡and ¡each ¡bin ¡had ¡its ¡own ¡parameter. ¡ • DeviaEon ¡funcEon ¡over ¡a ¡user’s ¡raEng ¡date ¡from ¡the ¡user’s ¡mean ¡ date. ¡ ¡ • Each ¡user ¡had ¡an ¡alpha ¡parameter ¡scaling ¡the ¡deviaEon. ¡ ¡ • Performance ¡– ¡6.75% ¡above ¡water ¡ • Model ¡T) ¡Added ¡constant ¡bias ¡terms ¡for ¡every ¡date, ¡for ¡every ¡ user. ¡ ¡ • Performance ¡– ¡7.2% ¡above ¡water ¡

Latent ¡Factor ¡Models ¡ • TimeSVD++ ¡ • Model ¡C) ¡Added ¡ ¡linear ¡user ¡factor ¡Eme ¡effects ¡to ¡user ¡factors. ¡ • Performance ¡– ¡7.05% ¡above ¡water ¡ • Worse ¡than ¡Model ¡B, ¡so ¡did ¡not ¡pursue ¡this. ¡ ¡ • Did ¡not ¡implement ¡full ¡TimeSVD++ ¡(has ¡constant ¡bias ¡terms ¡for ¡ every ¡user ¡factor) ¡due ¡to ¡memory ¡issues. ¡

K ¡Nearest ¡Neighbors ¡(KNN) ¡ • Used ¡Pearson ¡CorrelaEon ¡Coefficients ¡as ¡the ¡similarity ¡metric. ¡ • Used ¡fisher ¡/ ¡inverse ¡fisher ¡transform ¡to ¡get ¡lower ¡bound ¡on ¡ 95% ¡confidence ¡interval ¡on ¡Pearson ¡correlaEon ¡ ¡ • Weight ¡for ¡movie ¡m2 ¡as ¡a ¡neighbor ¡of ¡movie ¡m1 ¡is ¡simply ¡ their ¡correlaEon ¡squared ¡Emes ¡the ¡logarithm ¡of ¡the ¡number ¡ of ¡common ¡viewers. ¡ ¡ ¡ • Pure ¡kNN ¡model ¡predicEon ¡-‑ ¡a ¡weighted ¡average ¡over ¡raEngs ¡ of ¡the ¡K ¡neighbors ¡with ¡highest ¡weights. ¡ ¡ ¡ • Performance ¡– ¡roughly ¡1% ¡above ¡water. ¡

KNN ¡ • The ¡best ¡use ¡of ¡KNN ¡was ¡to ¡feed ¡it ¡the ¡residuals ¡of ¡another ¡ model ¡in ¡order ¡to ¡generate ¡modified ¡qual ¡predicEons. ¡ ¡ • Generally ¡improved ¡performance ¡by ¡about ¡0.3% ¡for ¡most ¡ models. ¡ ¡

Restricted ¡Boltzmann ¡Machines ¡(RBM) ¡ • Used ¡graphchi ¡package ¡to ¡run ¡RBM. ¡ ¡ ¡ • Performance ¡– ¡2.5% ¡above ¡water ¡aoer ¡some ¡fine ¡tuning ¡of ¡ parameters. ¡ ¡ • Wanted ¡to ¡apply ¡kNN ¡to ¡residuals ¡of ¡RBM ¡but ¡couldn’t ¡find ¡ out ¡how ¡to ¡get ¡residuals ¡out ¡of ¡graphchi. ¡

Blending ¡ • First ¡blended ¡on ¡probe. ¡ • Worked ¡well, ¡but ¡wanted ¡to ¡train ¡our ¡models ¡on ¡probe. ¡ ¡ • Moved ¡to ¡blending ¡on ¡quiz. ¡ • Allowed ¡us ¡to ¡use ¡probe ¡in ¡training, ¡but ¡added ¡danger ¡of ¡ overfiQng ¡to ¡quiz ¡set. ¡ ¡ • We ¡also ¡tried ¡clipping ¡values ¡below ¡1 ¡and ¡above ¡5 ¡before ¡and ¡ aoer ¡blending ¡models ¡– ¡worked ¡well. ¡ ¡ • Also ¡added ¡constant ¡model ¡which ¡consisted ¡of ¡quiz ¡avg ¡ raEngs. ¡ ¡

Blending ¡ • RBMs ¡blended ¡well ¡with ¡latent ¡factor ¡ ¡ ¡ ¡ ¡ ¡models ¡despite ¡bad ¡performance. ¡ • kNN ¡blended ¡poorly ¡ L ¡ ¡ • However, ¡kNN ¡on ¡residuals ¡of ¡SVD, ¡SVD++, ¡EmeSVD++ ¡all ¡ contributed ¡nicely ¡to ¡the ¡blend. ¡ ¡ • Ironically, ¡blending ¡in ¡our ¡incorrect ¡SVD++ ¡implementaEon ¡ helped ¡our ¡score. ¡ ¡

The ¡Mystery ¡Machine ¡ ¡ • Constant ¡Average ¡Model ¡ • 3 ¡SVD ¡models ¡ • 8 ¡SVD++ ¡models ¡(some ¡were ¡incorrect) ¡ • 3 ¡EmeSVD++ ¡models ¡ • 4 ¡kNN ¡on ¡SVD++ ¡models ¡ • 5 ¡kNN ¡on ¡EmeSVD++ ¡models ¡ • 1 ¡ALS ¡model ¡(alternaEng ¡least ¡squares) ¡ • 6 ¡RBM ¡models ¡ • 2 ¡pure ¡kNN ¡models ¡ • Total ¡– ¡32 ¡models ¡ ¡

The ¡Mystery ¡Machine ¡ • We ¡clipped ¡each ¡model ¡before ¡we ¡added ¡it ¡to ¡the ¡blend ¡ • We ¡used ¡the ¡RMSE ¡of ¡each ¡model ¡pre-‑clipping, ¡however ¡ • The ¡model ¡which ¡was ¡simply ¡just ¡a ¡constant ¡guess ¡for ¡all ¡ tuples ¡improved ¡the ¡blend ¡surprisingly ¡well, ¡despite ¡being ¡a ¡ bad ¡standalone ¡model ¡

What ¡We ¡Learned ¡ • ImplemenEng ¡algorithms ¡from ¡papers. ¡ • Machine ¡Learning ¡on ¡large ¡data ¡sets. ¡ • OpEmizing ¡code ¡to ¡speed ¡up ¡performance. ¡ • Memory ¡used ¡(available ¡RAM, ¡cache ¡performance ¡and ¡locality) ¡ • Run ¡Eme ¡and ¡smart ¡ways ¡to ¡group ¡computaEons. ¡ • Thinking ¡conservaEvely ¡about ¡the ¡limitaEons ¡of ¡hardware ¡and ¡what ¡ is ¡absolutely ¡needed ¡for ¡the ¡algorithm. ¡ ¡ • Lesson ¡in ¡debugging. ¡ ¡ • Things ¡may ¡appear ¡to ¡be ¡working, ¡but ¡they ¡might ¡not ¡actually ¡be ¡ working. ¡ ¡ • SVD++ ¡-‑ ¡Thought ¡it ¡was ¡working ¡(doing ¡beser ¡than ¡SVD), ¡then ¡talked ¡ to ¡other ¡groups ¡and ¡realized ¡that ¡ours ¡was ¡doing ¡much ¡worse ¡than ¡it ¡ should. ¡ • 2 ¡hours ¡of ¡intense ¡debugging ¡later, ¡found ¡the ¡problem. ¡ ¡

Thank ¡you ¡ Professor ¡Yaser ¡Abu-‑Mostafa ¡ The ¡TA’s ¡ ¡Especially ¡Joseph ¡Choi ¡ Yehuda ¡Koren ¡and ¡the ¡Neulix ¡Forums ¡ Avi, ¡Nand ¡and ¡Sarthak ¡of ¡3 ¡Shades ¡of ¡Brown ¡ ¡ ¡

Chocolate Chip Pancakes Ajay Mandlekar Anand Poozhikunnel - PowerPoint PPT Presentation

Chocolate Chip Pancakes Ajay Mandlekar Anand Poozhikunnel Rahul Bachal What we Tried SVD SVD++ TimeSVD++ KNN KNN on Residuals of SVDs

Is Chocolate a Personality Is Chocolate a Personality Predictor? Predictor? Susan C. Sharpe,

FOR CHOCOLATE MILK By:Branihya WHAT CHOCOLATE MILK HAS Saturated fat of 0% Trans fat 0%

Chocolate is our business, finest chocolate We are never satisfied! We continue to that can be

Chocolate. Recipe from Antonio Colmenero de Ledesma, Chocolate, or An Indian drink ...

Calibration des Microroc (II) Alex, Cyril, Giom, Jean, Max 09 Mai 2011, Annecy 1 Reminder 2

15-252 More Great Ideas in Theoretical Computer Science Lecture 1: Sorting Pancakes September

15-252 More Great Ideas in Theoretical Computer Science Lecture 1: Sorting Pancakes January

Dynamic Memory Allocation CS 3410 Computer System Organization & Programming Note: these

Chocolate & Confectionery Fats Market in scope Chocolate USD billion 1 0 7 MT million 7 .5

CHOCOLATE Chocolate is made from the fermented and roasted ground beans of Theobroma cacao.

1 Brief History and Introduction of Ghirardelli Chocolate Company 2 Brief History of

CHOCOLATE PLANTS Introducing MAZZETTI RENATO SRL The laboratory refining machines WAFA 20 use

Mysteries of Conching Revealed Carly Meck Blommer Chocolate START SLIDESHOW > Conching is

Current & Future Trends influencing the Chocolate Industry Chocoa - The Chocolate Makers Forum

April 1, 2014 Who We Are: The Super Premium Brand at the Right Price The Chocolate for Experts

DIWALI GIFTING PRICE RANGE RS. 500-750 PER UNIT LE PURE CHOCOLATE 15 chocolate box Offer

C O M P A N Y P R O F I L E 2020 The company begins exporting its products and is soon a regular

Design and Designing Chocolate Packaging Learning Objective To understand how colour can

Joseph Barnes, Ben Jenkins, Montana Wells Mission Statement Partnering with others to help

About Me This is Barry Callebaut Shaping the world of chocolate and cocoa 2 5/1/2019 From bean

Godiva Case: Should Godiva launch a new category? Yinping Hu Sam Roux Pedro Varo Chocolate

COMPANY- PRESENTATION January 2016 KEY FIGURES BARONIE GROUP Turnover approx . 565

Assessment By Hannah Clayton What we have to measure Exercise 1 To measure different types of

A Parent Workshop Third Grade Team Creek View Elementary School February 26, 2020 Georgia

Chocolate Chip Pancakes Ajay Mandlekar Anand Poozhikunnel - PowerPoint PPT Presentation

Chocolate Chip Pancakes Ajay Mandlekar Anand Poozhikunnel Rahul Bachal What we Tried SVD SVD++ TimeSVD++ KNN KNN on Residuals of SVDs

Is Chocolate a Personality Is Chocolate a Personality Predictor? Predictor? Susan C. Sharpe,

FOR CHOCOLATE MILK By:Branihya WHAT CHOCOLATE MILK HAS Saturated fat of 0% Trans fat 0%

Chocolate is our business, finest chocolate We are never satisfied! We continue to that can be

Chocolate. Recipe from Antonio Colmenero de Ledesma, Chocolate, or An Indian drink ...

Calibration des Microroc (II) Alex, Cyril, Giom, Jean, Max 09 Mai 2011, Annecy 1 Reminder 2

15-252 More Great Ideas in Theoretical Computer Science Lecture 1: Sorting Pancakes September

15-252 More Great Ideas in Theoretical Computer Science Lecture 1: Sorting Pancakes January

Dynamic Memory Allocation CS 3410 Computer System Organization &amp; Programming Note: these

Chocolate &amp; Confectionery Fats Market in scope Chocolate USD billion 1 0 7 MT million 7 .5

CHOCOLATE Chocolate is made from the fermented and roasted ground beans of Theobroma cacao.

1 Brief History and Introduction of Ghirardelli Chocolate Company 2 Brief History of

CHOCOLATE PLANTS Introducing MAZZETTI RENATO SRL The laboratory refining machines WAFA 20 use

Mysteries of Conching Revealed Carly Meck Blommer Chocolate START SLIDESHOW &gt; Conching is

Current &amp; Future Trends influencing the Chocolate Industry Chocoa - The Chocolate Makers Forum

April 1, 2014 Who We Are: The Super Premium Brand at the Right Price The Chocolate for Experts

DIWALI GIFTING PRICE RANGE RS. 500-750 PER UNIT LE PURE CHOCOLATE 15 chocolate box Offer

C O M P A N Y P R O F I L E 2020 The company begins exporting its products and is soon a regular

Design and Designing Chocolate Packaging Learning Objective To understand how colour can

Joseph Barnes, Ben Jenkins, Montana Wells Mission Statement Partnering with others to help

About Me This is Barry Callebaut Shaping the world of chocolate and cocoa 2 5/1/2019 From bean

Godiva Case: Should Godiva launch a new category? Yinping Hu Sam Roux Pedro Varo Chocolate

COMPANY- PRESENTATION January 2016 KEY FIGURES BARONIE GROUP Turnover approx . 565

Assessment By Hannah Clayton What we have to measure Exercise 1 To measure different types of

A Parent Workshop Third Grade Team Creek View Elementary School February 26, 2020 Georgia

Dynamic Memory Allocation CS 3410 Computer System Organization & Programming Note: these

Chocolate & Confectionery Fats Market in scope Chocolate USD billion 1 0 7 MT million 7 .5

Mysteries of Conching Revealed Carly Meck Blommer Chocolate START SLIDESHOW > Conching is

Current & Future Trends influencing the Chocolate Industry Chocoa - The Chocolate Makers Forum