Forecasting sales for the TOP 5 selling SKUs
Term 5: Forecasting Analytics Presented To: Prof. Galit Shmueli and Prof. Mayukh Dass Presented By: Arka Sarkar Kushal Paliwal Malvika Gaur Shwaitang Singh
Forecasting sales for the TOP 5 selling SKUs Term 5: Forecasting - - PowerPoint PPT Presentation
Forecasting sales for the TOP 5 selling SKUs Term 5: Forecasting Analytics Presented To: Prof. Galit Shmueli and Prof. Mayukh Dass Presented By: Arka Sarkar Kushal Paliwal Malvika Gaur Shwaitang Singh Business Goal Business
Term 5: Forecasting Analytics Presented To: Prof. Galit Shmueli and Prof. Mayukh Dass Presented By: Arka Sarkar Kushal Paliwal Malvika Gaur Shwaitang Singh
INR 0.9 million INR 0.8 million INR 0.78 million INR 0.58 million INR 0.53 million
SKU: 100004925 Total Sales (2011 & 2012): INR 1.4 Million Sells 10 times more than 2 ltr. Jar, and 5 times more than 1 ltr. Pouch
Sales predominantly
Peak and Outliers
Quantity bought per transaction Number of such transactions 3 664 6 11 9 3 15 1 Grand Total 679 Average Quantity bought per day: 7.35 Most occurring purchase size: 3 units Although not in this case, we’ve found a ‘bulk buyer’ who shops sporadically for other SKUs Replaced with most
Missing Values
Replaced with zero in the dataset. Represent no sale on that particular day
Check to see if the data can actually be predicted Tested for all SKUs Results: Slope coefficient of AR(1) models significantly (more than 3 standard deviations away) different from 1 – hence they do not follow a random walk
Used the Naïve Forecast as a performance benchmark RMSE 13.81094 MAPE 150.20%
Does the data exhibit level, trend and seasonality? Can seasonality be captured by dummy variables? Model Choices: Multi layered model Smoothing model Did not consider so far: Neural Network approach
Yes, a weekly seasonality is exhibited as demonstrated by the ACF plot Seasonal Naïve Forecast is an improvement over the naïve forecast RMSE 11.31513 MAPE 89.70%
Created using 6 dummy variables to account for weekly seasonality Training: August 1st, 2011 to July 24th, 2012 Validation (1 week): July 25th, 2012 to July 31st, 2012 Test (1 month): August 1st, 2012 to August 31st, 2012
RMSE 5.49633 MAPE 77.07% RMSE 11.31513 MAPE 89.70%
Seasonal Naïve Forecast Multiple Linear Regression Forecast
There appears to be some signal (lag(4)) in the residuals; we remodel using an AR model
10 20 30 Residual Row Id.
Time Plot of Actual Vs Forecast (Training Data)
Actual Forecast
RMSE 5.262024111 MAPE 71.66%
alpha beta gamma RMSE MAPE 0.2 0.15 0.3 6.666152 87.56% 0.2 0.15 0.5 6.692811 81.70% 0.2 0.15 0.7 6.360498 78.15% 0.2 0.15 0.9 5.827601 80.04%
5 10 15 20 25 30 Quantity Sold Dayindex
Time Plot of Actual Vs Forecast (Training Data)
Actual Forecast
Actual vs. Forecast for Holt Winters Various scenarios tried and the results:
RMSE 13.81094 MAPE 150.20% RMSE 11.31513 MAPE 89.70% RMSE 5.49633 MAPE 77.07%
RMSE 5.2620 MAPE 71.66%
Naïve Forecast Naïve Seasonal Forecast Multiple Linear Regression Multiple Linear Regression with error prediction
alpha beta gamma RMSE MAPE 0.2 0.15 0.3 6.666152 87.56% 0.2 0.15 0.5 6.692811 81.70% 0.2 0.15 0.7 6.360498 78.15% 0.2 0.15 0.9 5.827601 80.04%
Holt Winters
10 20 30 40 366 368 370 372 374 376 Predicted Value Actual Value Residual
10 20 30 40 50 366 367 368 369 370 371 372 373 374 Predicted Value Actual Value Residual
RMSE MAPE 14.4117 52.50% RMSE MAPE 16.0635 45.22%
RMSE MAPE 9.4568 104.19% RMSE MAPE 8.4231 36.93%
10 20 30 40 50 60 366 367 368 369 370 371 372 373 374 Predicted Value Actual Value Residual
10 20 30 40 50 366 367 368 369 370 371 372 373 374 Predicted Value Actual Value Residual
Using the holiday calendar in sync with existing data Use econometric models: incorporate the effects of price changes and discounts, competitive brand pricing Model behavior of customers: predict/forecast repeat purchases, bulk purchases etc.