business backtesting of ml models
play

Business Backtesting of ML Models: A Case Study in Real Estate QCon - PowerPoint PPT Presentation

Business Backtesting of ML Models: A Case Study in Real Estate QCon New York June 2017 Nelson Ray Who has run an A/B test before? Did it go off without a hitch? Unguided A/B Testing Focus of this Talk Observational Analysis Simulation-Based


  1. Business Backtesting of ML Models: A Case Study in Real Estate QCon New York June 2017 Nelson Ray

  2. Who has run an A/B test before?

  3. Did it go off without a hitch?

  4. Unguided A/B Testing

  5. Focus of this Talk Observational Analysis Simulation-Based Inference Confidence Cost Quasi-experiments A/B Test

  6. Talk Structure • Real Estate 101 for Home Buyers and Sellers • The Opendoor Way • Resale Risk • Problems with A/B Testing • How Simulation Helped in Real Estate • A General Recipe for Simulation • Team Info.

  7. Introduction Hi, I’m Nelson! A/B Testing Support Group I’m a recovering A/B testing user from such places as… • Facebook • Metamarkets • Google • Opendoor I’ll cover how to perform a business backtest of your ML models using simulations!

  8. M I S S I O N Empower everyone with the freedom to move $25T of assets 63.5% of Americans are homeowners #1 consumer expenditure ($17,798/yr) $1.4T of annual transaction volume $100B in fees

  9. C U R R E N T P R O C E S S S E L L E R S B U Y E R S 100+ day process with 14% failure rate 100+ day process with friction at each step Seller Realtor Sale Ready List Contract Closing Contract Search Realtor Discovery Buyer Research online Decide to move Improvements MLS, Zillow, Trulia O ff er Final walkthrough O ff er Open houses Research online Finances Decides to move Receive bids Yard work Open houses Counter, Acceptance O ffl ine signatures Inspection Showings Receive bids Location Cleaning Showings Inspection Title records Financing Viewings Interview Timing Interview Photographs Maintenance Financing period Choose Choose DAY 121 DAY 0 DAY 0 $1000’s in 90+ Days 4-5% in price Fears of the Months of Months of upfront costs drops and home condition viewing research and 14% of deals fall- concessions and financing suboptimal gathering through listings data 6-7% in fees 5.5M Americans per year buy and sell through this process

  10. C U R R E N T P R O C E S S S E L L E R S B U Y E R S 100+ day process with 14% failure rate 100+ day process with friction at each step Seller Realtor Sale Ready List Contract Closing Contract Search Realtor Discovery Buyer Research online Decide to move Improvements MLS, Zillow, Trulia O ff er Final walkthrough O ff er Open houses Research online Finances Decides to move Receive bids Yard work Open houses Counter, Acceptance O ffl ine signatures Inspection Showings Receive bids Location Cleaning Showings Inspection Title records Financing Viewings Interview Timing Interview Photographs Maintenance Financing period Choose Choose DAY 121 DAY 0 DAY 0 $1000’s in 90+ Days 4-5% in price Fears of the Months of Months of upfront costs drops and home condition viewing research and 14% of deals fall- concessions and financing suboptimal gathering through listings data 6-7% in fees 5.5M Americans per year buy and sell through this process

  11. S E L L E R S Fill out a short home profile to ensure we can accurately price your home. Simply enter your address to experience an automated, hassle-free sales process. And receive an offer in minutes with a full report of your home’s value. Selling your home is as easy as clicking next

  12. C U R R E N T P R O C E S S S E L L E R S B U Y E R S 100+ day process with 14% failure rate 100+ day process with friction at each step Seller Realtor Sale Ready List Contract Closing Contract Search Realtor Discovery Buyer Research online Decide to move Improvements MLS, Zillow, Trulia O ff er Final walkthrough O ff er Open houses Research online Finances Decides to move Receive bids Yard work Open houses Counter, Acceptance O ffl ine signatures Inspection Showings Receive bids Location Cleaning Showings Inspection Title records Financing Viewings Interview Timing Interview Photographs Maintenance Financing period Choose Choose DAY 121 DAY 0 DAY 0 $1000’s in 90+ Days 4-5% in price Fears of the Months of <50% Months of upfront costs drops and home condition viewing have a research and 14% of deals fall- concessions and financing suboptimal bachelor’s gathering through listings degree data 6-7% in fees 5.5M Americans per year buy and sell through this process

  13. B U Y E R S Thousands of buyers shop with us monthly Searching and showings are self-service, on-demand Our buyers have exclusive access to our inventory All homes come with a money-back guarantee and a 2-year warranty Buying a home is as easy as clicking next

  14. What is our risk in reselling a home?

  15. Home 1 • Listed ~$800k • 6+ months on market

  16. Home 2 • Listed ~$300k • 1 month on market

  17. Our Philosophy • Opendoor bears risk in reselling the house • Costs vary substantially by house • Fair to each seller to charge based on their expected cost

  18. Framing the problem

  19. House Economics Conversion Profit Fee Fee

  20. Formalization • Infinite number of pricing models • Assuming we even had a candidate f’, how do we test this? • A/B testing approach • randomize on offers: f vs f’ • evaluate {# of houses, profit}

  21. Metric Measurement Lag • Time to observe # • days • Time to observe $ • months

  22. Formalization • Infinite number of pricing models • Assuming we even had a candidate f’, how do we test this? • A/B testing approach • randomize on offers: f vs f’ • evaluate {# of houses, profit} • Many months of measurement lag

  23. Simulating O ff ers • Historical transaction data • House lists on the market • Simulate our buying process • Estimate our costs • Observe actual outcome for house

  24. Simulating O ff ers Actual resale cost: $50k Actual resale cost: $10k Expected resale costs Expected resale costs • f under : $10k -> {P accept : .9, $: -40k} • f under : $5k -> {P accept : .9, $: -5k} • f base : $55k -> {P accept : .1, $: 5k} • f base : $8k -> {P accept : .7, $: -2k}

  25. Simulating O ff ers $ f base # f under

  26. Simulating O ff ers $ f 2base f base # f under

  27. Simulating O ff ers $ f 2base f base f 3base # f under

  28. Simulating O ff ers $ f 4base f 2base f base f 3base # f under

  29. Simulating O ff ers $ f 4base f 2base f base f 3base # f under

  30. Simulating O ff ers $ f 4base f 2base f base f 3base # f under

  31. Simulating O ff ers $ $ f 4base f 2base f base f base f 3base # # f under f under

  32. Simulating O ff ers $ $ f 4base f 2base f base f base f 3base # # f under f under

  33. Understanding Current Trade-O ff s • Clarity into trade-offs $ $ • Identify suitable candidates • Backtesting with business metrics • Seconds vs months f 4base • Only cost is computational f 2base f base f base • Though quality dependent f 3base on simulation models # # f under f under

  34. Estimating Future Trade-O ff s $ f 4base f 2base f base f 3base # f under

  35. Estimating Future Trade-O ff s $ C desired f 4base f 2base f base f 3base # f under

  36. Estimating Future Trade-O ff s $ C oracle C desired f 4base f 2base f base f 3base # f under

  37. Oracle Performance Actual resale cost: $50k Actual resale cost: $10k Expected resale costs Expected resale costs • f under : $10k -> {P accept : .9, $: -40k} • f under : $5k -> {P accept : .9, $: -5k} • f base : $55k -> {P accept : .1, $: 5k} • f base : $8k -> {P accept : .7, $: -2k} • f oracle : $50k -> {P accept : .15, $: 0k} • f oracle : $10k -> {P accept : .65, $: 0k}

  38. Estimating Future Trade-O ff s $ C oracle C desired f 4base f 2base f base f 3base # f under

  39. Estimating Future Trade-O ff s $ C oracle C desired C Bayes f 4base f 2base f base f 3base # f under

  40. Estimating Future Trade-O ff s $ • Estimate what is theoretically achievable C oracle C 20% C Bayes • Set ML improvement goal • Translation into business trade-offs • “Easy” part is to hit ML target f 4base f 2base f base f 3base # f under

  41. Simulation Accuracy

  42. Unguided A/B Testing

  43. The Guide

  44. Guided A/B Testing

  45. Pyramid of Causal Inference Observational Analysis Simulation-Based Inference Confidence Cost Quasi-experiments A/B Test

  46. Recipe for guided testing

  47. Simulating O ff ers • Historical transaction data Data generating process • House lists on the market • Simulate our buying process User model • Estimate our costs • Observe actual outcome for house

  48. Recipe: Data Generating Process Simple version: replay historical data • Home buying and selling • Past housing transactions • Ridesharing services • Passenger app sessions • Search engine ad auctions • Stock of potential ads

  49. Recipe: User Model • Home buying and selling • P(sell | cost) • Ridesharing services • P(accept ride | price, ETA) • Search engine ad auctions • P(click | user features, ad features)

  50. A/B test responsibly

  51. Simulate before testing

  52. Opendoor by the numbers • Founded: March 2014 • Transactions / month: 500 • Number of employees: 300 • 50 data scientists and engineers • We’re hiring! • E-mail: nelson@opendoor.com

  53. Title Text Acknowledgements Title Text

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend