HOUSING PRICE PREDICTION An Nguyen Advisors: Chris Fernandes, Nick - - PowerPoint PPT Presentation

housing price prediction
SMART_READER_LITE
LIVE PREVIEW

HOUSING PRICE PREDICTION An Nguyen Advisors: Chris Fernandes, Nick - - PowerPoint PPT Presentation

HOUSING PRICE PREDICTION An Nguyen Advisors: Chris Fernandes, Nick Webb & Harlan Holt 0. BACKGROUND Sold Price: $160,000 Features: Zestimate: Beds $184,777 Baths Size 0. BACKGROUND 26.2% market share 9.6% market share 3.5% market


slide-1
SLIDE 1

HOUSING PRICE PREDICTION

An Nguyen

Advisors: Chris Fernandes, Nick Webb & Harlan Holt

slide-2
SLIDE 2
  • 0. BACKGROUND

Features: Beds Baths Size Sold Price: $160,000 Zestimate: $184,777

slide-3
SLIDE 3
  • 0. BACKGROUND
  • 26.2% market share
  • 110 million houses
  • Zestimate
  • 9.6% market share
  • 3.5% market share
slide-4
SLIDE 4
  • 1. PROBLEMS
  • Zillow correctly estimates ~50% of their houses within 5% range of the

actual sold price

slide-5
SLIDE 5
  • 2. QUESTIONS
  • Can I get close to or beat the Zestimate?
slide-6
SLIDE 6
  • 1. PROBLEMS
  • Zillow correctly estimates ~50% of their houses within 5% range of the
  • Zillow tends to overestimate their properties
slide-7
SLIDE 7
  • 2. QUESTIONS
  • Can I get close to or beat the Zestimate?
  • Can my models get rid of the overestimation problem?
slide-8
SLIDE 8
  • 1. PROBLEMS
  • Zillow correctly estimates ~50% of their houses within 5% range of the
  • Zillow tends to overestimate their properties
  • Do we need a lot of attributes to have a good prediction for house price?
slide-9
SLIDE 9
  • 2. QUESTIONS
  • Can I get close to or beat the Zestimate?
  • Can my models get rid of the overestimation problem?
  • Most important attributes?
slide-10
SLIDE 10
  • 2. QUESTIONS
  • Can I get close to or beat the Zestimate?
  • Can my models get rid of the overestimation problem?
  • Most important attributes?
slide-11
SLIDE 11
  • 2. DATA COLLECTION

Cowlitz, WA Montgomery, IL Cayuga, NY Hunt, TX Upson, GA 29.3% 19.8% 8.7% 16.7% 10.7%

slide-12
SLIDE 12
  • 2. DATA COLLECTION

Cowlitz, WA Montgomery, IL Cayuga, NY Hunt, TX Upson, GA 354 houses 195 houses 209 houses 399 houses 310 houses

slide-13
SLIDE 13
  • 2. DATA COLLECTION
  • Sources: Zillow, Trulia, and Redfin
  • Tools: Python, Selenium, and VBA
  • Attributes:

○ Internal Factors: Beds, Baths, Size, Appliances, Garage, etc. ○ External Factors: Tax Info, School Info, Walkability, Nearby Lifestyle Amenities, Comparable Houses’ Sold Prices

slide-14
SLIDE 14

Redfin Zillow Trulia

$325,000 $325,000 $233,427

slide-15
SLIDE 15
  • 3. MODELS
  • Linear Regression (Baseline model):

○ Frequently used in Economics paper

  • Support Vector Regression (SVR):

○ Good at finding signals and ignoring noises

  • Random Forest (RF):

○ Good for datasets with missing values

slide-16
SLIDE 16
  • 3. RESULTS
slide-17
SLIDE 17
  • 3. RESULTS

UPSON Bed Bath Dishwasher Assessment Lot Date Built Walk Score . . . COWLITZ Bed Bath Asphalt Roof Assessment Elementary- School Score Size . . . HUNT Size Date Built Tax Amount Assessment Hardwood- Floor Walk Score . . . MONTGOMERY Bed Bath Dishwasher Assessment Size Date Built Walk Score . . . CAYUGA Size Tax Amount Dishwasher Assessment Last Remodel-Year Walk Score . . .

slide-18
SLIDE 18
  • 3. RESULTS

SAME ATTRIBUTE: 1. Bed 2. Bath 3. Dishwasher 4. Size 5. Tax Amount 6. Walk Score 7. Price Listed 8. Date Built 9. Assessment 10. Comparables’ Sold Price

slide-19
SLIDE 19
  • 3. RESULTS
slide-20
SLIDE 20
  • 3. RESULTS

Overestimated : Underestimated Ratio = 3:2 Zillow Overestimated : Underestimated Ratio = 1:1 My Predictor

slide-21
SLIDE 21
  • 3. RESULTS
  • Weights of Important Attributes Across 5 Counties:

○ $1 increase in Tax Assessment increases Sold Price by 54 cents ○ $1 increase in Comparables’ Sold Price increases Sold Price by 34 cents ○ $1 increase in Price Listed increases Sold Price by 38 cents ○ 1 more Bathroom increases Sold Price by $15,787

slide-22
SLIDE 22
  • 4. CONCLUSION
  • Can I get close to or beat the Zestimate?

○ Beat Hunt’s accuracy score and come close to Cowlitz’s and Upson’s.

  • Can my models get rid of the overestimation problem?

○ Reduce the overestimated to underestimated ratio from 3:2 to 1:1

  • Most important attributes?

○ Tax Assessment, Comparables’ Sold Price, Price Listed, and Num of Bathrooms.

slide-23
SLIDE 23

THANK YOU FOR LISTENING!