analytics for an online retailer
play

Analytics for an Online Retailer: Demand Forecasting and Price - PowerPoint PPT Presentation

Analytics for an Online Retailer: Demand Forecasting and Price Optimization Kris Johnson MIT, Operations Research Center Alex Lee MIT, Systems Design & Management Murali Narayanaswamy Rue La La, VP Pricing & Operations Strategy


  1. Analytics for an Online Retailer: Demand Forecasting and Price Optimization Kris Johnson – MIT, Operations Research Center Alex Lee – MIT, Systems Design & Management Murali Narayanaswamy – Rue La La, VP Pricing & Operations Strategy Philip Roizin – Rue La La, Chief Financial Officer David Simchi-Levi – MIT, Operations Research Center Jonathan Waggoner – Rue La La, Chief Operating Officer

  2. Online Retailing: Online Fashion Sample Sales Industry • Offers extremely limited- time discounts (“flash sales”) on designer apparel & accessories • Emerged in mid-2000s and has had nearly 50% annual growth in last 5 years • Key players – Rue La La (US) – Gilt Groupe (US) – Markafoni (Turkish) – Trendyol (Turkish) Page 2

  3. Snapshot of Rue La La’s Website Page 3

  4. “Style” Page 4

  5. “SKU” Page 5

  6. Flash Sales Operations Merchants purchase items from designers Designers First event that style is ship items to sold = “1 st exposure” warehouse* Merchants decide when to sell items (create “event”) During event, Sell out Yes customers of item? purchase items No End *Sometimes designer will hold inventory

  7. https://www.youtube.com/watch?v=ahOHAsECeIw&feature=youtu.be

  8. 1st Exposure Sell-Through Distribution 70% 60% 50% Department 1 % of Items Department 2 40% Department 3 30% Department 4 Department 5 20% 10% 0% 0%-25% 25%-50% 50%-75% 75%-100% SOLD OUT (100%) % Inventory Sold (Sell-Through) *Data disguised to protect confidentiality

  9. Approach Goal: Maximize expected revenue from 1 st exposure styles Demand Forecasting Price Optimization Challenges: Challenges:   Predicting demand for Structure of demand forecast  items that have never Demand of each style is been sold before dependent on price of  competing styles  Estimating lost sales exponential # variables Techniques: Techniques:   Clustering Novel reformulation of price  Machine learning models optimization problem  for regression Creation of efficient algorithm to solve daily Page 9

  10. Example Sales Curve for an Item that Doesn’t Sell Out (sales < inventory) 100% 90% Percent of Total Sales 80% 70% 60% 50% demand = actual sales 40% 30% 20% 10% 0% 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 Hours Into Event Page 10

  11. Example Sales Curve for an Item that Does Sell Out (sales = inventory) 100% 90% stock out 10 hours into event Percent of Total Sales 80% 70% 60% demand = actual sales + 50% estimated lost sales during 40% period after stock out 30% 20% 10% 0% 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 Hours Into Event Page 11

  12. Estimating Lost Sales • Use data from items that did not stock out to predict lost sales of items that did stock out • For each event length… – Aggregate hourly sales given set of characteristics, i.e. event start time of day – Create sales curve for each set of characteristics • Results in hundreds of sales curves • Use clustering to help further aggregate Page 12

  13. Example Clustering Results: Demand Curves for 2-Day Events 8PM event with 100 units inventory sells out after 5 hours Page 13

  14. Forecasting Model: Explanatory Variables Included Each input is calculated for a unique {style, event} pair. Page 14

  15. Forecasting Model Approach • Separate data by department; for each department… – Randomly divide into training & testing data sets – Apply several machine learning techniques to training data • Linear regression • Power regression • Semi-logarithmic regression • Regression trees – Use cross-validation to choose best model Page 15

  16. Regression Tree – Illustration If condition is true, move left; otherwise, move right Demand prediction Page 16

  17. Approach Goal: Maximize expected revenue from 1 st exposure styles Demand Forecasting Price Optimization Challenges: Challenges:   Predicting demand for Structure of demand forecast  items that have never Demand of each style is been sold before dependent on price of  competing styles  Estimating lost sales exponential # variables Techniques: Techniques:   Clustering Novel reformulation of price  Machine learning models optimization problem  for regression Creation of efficient algorithm to solve daily Page 17

  18. Complexity • Three of the features used to predict demand are associated with pricing – Price – % Discount = – Relative Price of Competing Styles = • Pricing must be optimized concurrently for all competing styles Page 18

  19. Key Observation • Demand depends only on average price of competing styles • Let N = # competing styles (to be priced concurrently), and let k = the sum of prices of all styles – Average price = – Relative price of competing styles = • Finite set of possible prices – Prices must end in $4.90 or $9.90 – Consists of lower bound, upper bound, and every increment of $5.00 between the bounds – Ex: {$24.90, $29.90, $34.90, $39.90} Page 19

  20. Key Idea for Algorithm • Formulate integer optimization problem for each value of k, (IP k ) Maximize Revenue 1) Each style must be assigned exactly one price s.t. 2) Sum of prices of all styles must = k • Can show that optimal objective of (IP k ) and its linear relaxation only differ by the revenue associated with a single style! – Independent of problem size • Use this to develop efficient algorithm to solve on daily basis Page 20

  21. IMPLEMENTATION & IMPACT

  22. Pricing Decision Support Tool Rue La La Enterprise Resource Planning System Retail Price Optimizer Statistics Transact- Products Tool - R ions Impending Event Data ETL Rue La La Process Database Events Planning Regression Optimizer Tree Database Prediction (Rscript) Reports and Visualization Inventory R Optimizer Information Predictions Database Query / Drill Down Visualizer Optimal Price Inventory- Ad hoc Reports Optimization LP Bound Recommendat Constrained Input Algorithm -ions Demand Prediction Standard Reports LP_Solve API-based Optimizer

  23. https://www.youtube.com/watch?v=lc4wV6O_YDA&feature=youtu.be

  24. Live Tests • Motivated by historical analysis – Suggests model recommended price increases will increase revenue by ~10% with little to no impact on demand • Set lower bound on price = merchant suggested price – Model only recommends price increases (or no change) • Identified ~1,300 event-subclass combinations where tool recommended price increases for at least one style Page 24

  25. Live Tests lowest highest 1,300 Event-Subclass Combinations price point price point Category A Category B Category C Category D Category E Treatment Treatment Treatment Treatment Treatment (increase price) (increase price) (increase price) (increase price) (increase price) Control Control Control Control Control (no change) (no change) (no change) (no change) (no change) Page 25

  26. Mann-Whitney / Wilcoxon Rank Sum Test • Hypothesis test that assumes no particular distributional form on treatment or control groups – H 0 : raising prices has no effect on sell-through – H A : raising prices decreases sell-through • Idea of test – Combine sell-through data of treatment and control groups – Order data and assign rank to each observation – Sum ranks of all treatment group observations – If sum is too low, reject H 0 Page 26

  27. Mann-Whitney / Wilcoxon Rank Sum Test 1,300 Event-Subclass Combinations Category A Category B Category C Category D Category E Treatment Treatment Treatment Treatment Treatment (increase price) (increase price) (increase price) (increase price) (increase price) Control Control Control Control Control (no change) (no change) (no change) (no change) (no change) Does not Rejects H 0 Does not Does not Does not α = 1% reject H 0 reject H 0 reject H 0 reject H 0 α = 10% α = 20% α = 20% α = 20% Page 27

  28. Visual Comparison Comparison of Sell-Through: Treatment vs. Control Groups 70% Sell-Through (% Inventory Sold) Control 60% 50% Treatment 40% 30% 20% 10% 0% Category A Category B Category C Category D Category E Page 28

  29. Revenue Impact • Treatment group’s increase in revenue, assuming demand is impacted by price increases as shown on previous slide 16% $70,000 14% $60,000 12% $50,000 Sell-Through 10% % Increase $40,000 8% in Revenue $30,000 6% $ Increase in Revenue 4% $20,000 2% $10,000 0% $- Category A Category B Category C Category D Category E -2% $(10,000) -4% -6% $(20,000) Page 29

  30. https://www.youtube.com/watch?v=AzJhAxkpkEU&feature=youtu.be

  31. Conclusion • Created and implemented pricing decision support tool that recommends prices for 1 st exposure styles – Used clustering to estimate lost sales – Built regression trees to predict demand – Developed efficient algorithm to solve multi-product price optimization problem • Implementation of these analytics techniques shows expected increase in revenue of ~10% with little impact on demand Page 31

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend