Mining Airfare Data to Minimize Ticket Purchase Price Oren Etzioni ( - PowerPoint PPT Presentation

Mining Airfare Data to Minimize Ticket Purchase Price Oren Etzioni ( UW ) Craig Knoblock ( USC ) Alex Yates ( UW ) Rattapoom Tuchinda ( USC )

Price change over time for American Airlines flight #192:223, LAX-BOS, departing on Jan. 2. Etzioni, UW 2

Consumers ’ Dilemma To Buy or Not to Buy…that is the question.. Data mining à Price drops Etzioni, UW 3

Advisor Model 1. Consumer wants to buy a ticket. 2. Hamlet: ‘ buy ’ (this is a good price). 3. Or: ‘ wait ’ (a better price will emerge). 4. Notify consumer when price drops. Etzioni, UW 4

Arbitrage Model 1. “ going price ” is $900. 2. Hamlet anticipates a price of $400. 3. Hamlet offers a $600 fare. 4. Hamlet buys when the price drops to $400. 5. Consumer saves $300; Hamlet earns $200. (of course, Hamlet could lose money!) Etzioni, UW 5

Will Flights sell out? 1. Watch the number of empty seats. 2. Upgrade to business class. 3. Place on another flight and give a free ticket. In our experiment: upgrades were sufficient. Etzioni, UW 6

Is Airfare Prediction Possible??? � Complex “ yield management ” algorithms. - airlines have tons of historical data. � Exogenous events create randomness. How about the stock market? � True markets are unpredictable. � For Hamlet, prices are set by the airlines! Etzioni, UW 7

Surprising Experimental Result Savings: buy immediately versus Hamlet. Optimal: buy at the best possible time. HAMLET ’ s savings were 61.8% of optimal! Though it be madness, yet there be method in it. Etzioni, UW 8

Data Set � Used Fetch.com ’ s data collection infrastructure. � Collected over 12,000 price observations: – Lowest available fare for a one-week roundtrip. – LAX-BOS and SEA-IAD. – 6 airlines including American, United, etc. – 21 days before each flight, every 3 hours. Etzioni, UW 9

Learning Task Formulation Input: price observation data. Algorithm: label observations (decision point); run learner. Output: Classify each decision point à buy versus wait. Etzioni, UW 10

Formulation Fine Points � Want to learn from the latest data. � Run learner nightly to produce a new model. – Learner is trained on data gathered to date. � Learned policy is a sequence of 21 models. � Test set: 8 * 21 decision points for the last 1/3 of the flights. Etzioni, UW 11

Labeling Training Data O now takeoff 11 days 5 days IF price drops between and now THEN label(O)=wait ELSE label(O) à Pr(price will drop between now and takeoff) We estimate Pr based on behavior of past flights. Etzioni, UW 12

Candidate Approaches � Fixed: “ asap ” , 14 days prior, 7 days,… � By hand: an expert looks at the data. � Time series: P F ( P , P ,... P ). = t t 1 t 2 1 − − – Not effective at price jumps! � Reinforcement learning: Q-learning. – Used in computational finance. � Rule learning: Ripper, … Etzioni, UW 13

Ripper • Features include price, airline, route, hours- before-takeoff, etc. • Learned 20-30 rules… IF hours - before - takeoff 252 AND price 2223 ≥ ≥ AND route LAX - BOS THEN wait . = Etzioni, UW 14

Simple Time Series � Predict price using a fixed window of k price observations weighted by α . � We used a linearly increasing function for α k ( i ) p ∑ α t k i − + p i 1 = + = t 1 k ( i ) ∑ α i 1 = Etzioni, UW 15

Q-learning Natural fit to problem ( ) ( ) ( ( ) ) Q a , s R a , s max Q a , s ʹ″ ʹ″ = + γ ⋅ a ʹ″ Q ( b , s ) price ( ) s = − 300000 if flight sells out after s . − ⎧ ( ) Q w , s = ⎨ ( ( ) ( ) ) max Q b , s , Q w , s otherwise. ʹ″ ʹ″ ⎩ Etzioni, UW 16

Hamlet � Stacking with three base learners: 1. Ripper (e.g., R=wait) 2. Time series 3. Q-learning (e.g., Q=buy) � Ripper used as the meta-level learner. � Output: classifies each decision point as ‘ buy ’ or ‘ wait ’ . Etzioni, UW 17

Experimental Results � Real price data; Simulated passengers. – Uniform distribution over decision points. (sensitivity) Requesting specific flights (also 3hr interval). � Learner run once per day on “ past data ” . � Execution: label each purchase point until buy (or sell out). � Compute savings (or loss). Etzioni, UW 18

Savings by Method • Net savings = cost now – cost at purchase point. • Penalty for sell out = upgrade cost. 0.42% of the time. • Total ticket cost is $4,579,600. Net Savings by Method Legend: $350,000 7.0% $300,000 Time Series $250,000 Q-Learning 4.4% $200,000 By Hand 3.8% 3.8% 3.4% $150,000 Ripper $100,000 Hamlet Optimal $50,000 -9.5% $0 Etzioni, UW 19

Sensitivity Analysis � Passenger requests any nonstop flight in a 3 hour interval: Interval Savings Legend: $350,000 7.1% $300,000 Time Series $250,000 Q-Learning 4.2% $200,000 By Hand 3.8% 3.6% 3.3% $150,000 Ripper $100,000 Hamlet $50,000 Optimal -5.7% $0 Etzioni, UW 20

Upgrade Penalty Method Upgrade Cost % Upgrades Optimal $0 0% By hand $22,472 0.36% Ripper $33,340 0.45% Time Series $693,105 33.00% Q-learning $29,444 0.49% Hamlet $38,743 0.42% Etzioni, UW 21

Discussion � 76% of the time --- no savings possible. � Uniform distribution over 21 days. � 33% of the passengers arrived in the last week. � No passengers arrived >21 days before. Simulation understates possible savings! Etzioni, UW 22

Savings on “ Feasible ” Flights Method Net Savings Optimal 30.6% By hand 21.8% Ripper 20.1% Time Series 25.8% Q-learning 21.8% Hamlet 23.8% Comparison of Net Savings (as a percent of total ticket price) on Feasible Flights Etzioni, UW 23

Related Work � Trading agent competition. – Auction strategies � Temporal data mining. � Time Series. � Computational finance. Etzioni, UW 24

Future Work � More tests: international, multi-leg, hotels, etc. � Cost sensitive learning (tried MetaCost). � Additional base learners � Bagging/boosting � Refined predictions � Commercialization: patent, license. Etzioni, UW 25

Conclusions 1. Dynamic pricing is prevalent. 2. Price mining a-la-Hamlet is feasible. 3. Price drops can be surprisingly predictable. 4. Need additional studies and algorithms. 5. Great potential to help consumers! All ’ s well that ends well. Etzioni, UW 26

Savings by Method • Savings over “ buy now ” . • Penalty for sell out = upgrade cost. • Total ticket cost is $4,579,600. Method Savings Losses Upgrade Cost % Upgrades Net Savings % Savings % of Optimal Optimal $320,572 $0 $0 0% $320,572 7.0% 100.0% By hand $228,318 $35,329 $22,472 0.36% $170,517 3.8% 53.2% Ripper $211,031 $4,689 $33,340 0.45% $173,002 3.8% 54.0% Time Series $269,879 $6,138 $693,105 33.00% -$429,364 -9.5% -134.0% Q-learning $228,663 $46,873 $29,444 0.49% $152,364 3.4% 47.5% Hamlet $244,868 $8,051 $38,743 0.42% $198,074 4.4% 61.8% Etzioni, UW 27

Sensitivity Analysis � Passenger requests any nonstop flight in a 3 hour interval: Method Net Savings % of Optimal % upgrades Optimal $323,802 100.0% 0.0% By hand $163,523 55.5% 0.0% Ripper $173,234 53.5% 0.0% Time Series -$262,749 -81.1% 6.3% Q-Learning $149,587 46.2% 0.2% Hamlet $191,647 59.2% 0.1% Etzioni, UW 28

Another Chart Savings by Method $400,000 $300,000 $200,000 $100,000 $0 Gross Savings learning By hand Ripper Series Hamlet Optimal Time ($100,000) Net Savings Q- ($200,000) ($300,000) ($400,000) ($500,000) Etzioni, UW 29

Mining Airfare Data to Minimize Ticket Purchase Price Oren Etzioni ( - PowerPoint PPT Presentation

Mining Airfare Data to Minimize Ticket Purchase Price Oren Etzioni ( UW ) Craig Knoblock ( USC ) Alex Yates ( UW ) Rattapoom Tuchinda ( USC ) Price change over time for American Airlines flight #192:223, LAX-BOS, departing on Jan.

Demystifying Connexxus Travel ABOG (Irvine, CA) April 25, 2016 Whats In It for You? Airfare

The Freedom Ticket A Southeast Queens Proof of Concept December 2015 What is Freedom Ticket ?

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Contingent Purchase Price in Taxable Acquisitions Contingent Purchase Price in Taxable

STATION UPGRADE TICKET HALL STATION UPGRADE TICKET HALL Existing Proposed 2 Improved

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Introduction What is data mining? to Data mining functionalities Data Mining Major

Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008

DATA MINING LECTURE 2 What is data? The data mining pipeline What is Data Mining? Data

online bus ticket 18 January 2013 Online Bus Ticket www.obilet.com EBAN oBilet

Instructions Overview The Ego Id Ticket provides an unprecedented opportunity to track and

Opportunities for Ticket Holders Social Security s Ticket to Work Program Operations Support

Blank NJUNS Ticket Blank NJUNS Ticket Poles and Asset Page NJUNS Legacy and Ongoing User

Nine Elms Community Liaison Group 18 July 2019 Apr 19 Jun 19 Ticket Hall 2 April

HealthScratch Ticket Spin The Wheel 1 NEW TICKET EVERY DAY 3 CHANCES TO SPIN EVERY DAY Weekly

Cyber@UC Meeting 42 CEH Cryptography If Youre New! Join our Slack ucyber.slack.com SIGN

MLIC: A MaxSAT-Based framework for learning interpretable classification rules Dmitry Malioutov 1

Compatibility, coherence and the running intersection property Enrique Miranda Marco Zaffalon

BU CS 332 Theory of Computation Lecture 6: Reading: NFAs > Regular expressions

Iterative Optimization of Rule Sets Jiawei Du 16. November 2010 Prof. Dr. Johannes Frnkranz

Lecture 3 - Passwords and Authentication CMPSC 443 - Spring 2012 Introduction Computer and

7/13/18 D IV ERSIT Y & IN C LU SIO N Jill Ripper D irector , T alent Acquisition H OW

Reasoning Analytically About Password-Cracking Software Enze Alex Liu, Amanda Nakanishi,

Sambuz

Useful Links

Newsletter

Mail Us

Mining Airfare Data to Minimize Ticket Purchase Price Oren Etzioni ( - PowerPoint PPT Presentation

Mining Airfare Data to Minimize Ticket Purchase Price Oren Etzioni ( UW ) Craig Knoblock ( USC ) Alex Yates ( UW ) Rattapoom Tuchinda ( USC ) Price change over time for American Airlines flight #192:223, LAX-BOS, departing on Jan.

Demystifying Connexxus Travel ABOG (Irvine, CA) April 25, 2016 Whats In It for You? Airfare

The Freedom Ticket A Southeast Queens Proof of Concept December 2015 What is Freedom Ticket ?

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Contingent Purchase Price in Taxable Acquisitions Contingent Purchase Price in Taxable

STATION UPGRADE TICKET HALL STATION UPGRADE TICKET HALL Existing Proposed 2 Improved

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Introduction What is data mining? to Data mining functionalities Data Mining Major

Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008

DATA MINING LECTURE 2 What is data? The data mining pipeline What is Data Mining? Data

online bus ticket 18 January 2013 Online Bus Ticket www.obilet.com EBAN oBilet

Instructions Overview The Ego Id Ticket provides an unprecedented opportunity to track and

Opportunities for Ticket Holders Social Security s Ticket to Work Program Operations Support

Blank NJUNS Ticket Blank NJUNS Ticket Poles and Asset Page NJUNS Legacy and Ongoing User

Nine Elms Community Liaison Group 18 July 2019 Apr 19 Jun 19 Ticket Hall 2 April

HealthScratch Ticket Spin The Wheel 1 NEW TICKET EVERY DAY 3 CHANCES TO SPIN EVERY DAY Weekly

Cyber@UC Meeting 42 CEH Cryptography If Youre New! Join our Slack ucyber.slack.com SIGN

MLIC: A MaxSAT-Based framework for learning interpretable classification rules Dmitry Malioutov 1

Compatibility, coherence and the running intersection property Enrique Miranda Marco Zaffalon

BU CS 332 Theory of Computation Lecture 6: Reading: NFAs &gt; Regular expressions

Iterative Optimization of Rule Sets Jiawei Du 16. November 2010 Prof. Dr. Johannes Frnkranz

Lecture 3 - Passwords and Authentication CMPSC 443 - Spring 2012 Introduction Computer and

7/13/18 D IV ERSIT Y &amp; IN C LU SIO N Jill Ripper D irector , T alent Acquisition H OW

Reasoning Analytically About Password-Cracking Software Enze Alex Liu, Amanda Nakanishi,

Sambuz

Useful Links

Newsletter

Mail Us

BU CS 332 Theory of Computation Lecture 6: Reading: NFAs > Regular expressions

7/13/18 D IV ERSIT Y & IN C LU SIO N Jill Ripper D irector , T alent Acquisition H OW