The intuition behind stacking EN S EMBLE METH ODS IN P YTH ON - PowerPoint PPT Presentation

The intuition behind stacking EN S EMBLE METH ODS IN P YTH ON Román de las Heras Data Scientist, SAP / Agile Solutions

Relay races Effective team leader (anchor): Know the team : strengths and weaknesses De�ne tasks : responsibilities Take part : participation ENSEMBLE METHODS IN PYTHON

Relay race for models Passing the baton <--> Passing predictions ENSEMBLE METHODS IN PYTHON

Stacking architecture ENSEMBLE METHODS IN PYTHON

Combiner model as anchor Effective combiner model (anchor): Know the team : strengths and weaknesses De�ne tasks : responsibilities Take part : participation ENSEMBLE METHODS IN PYTHON

Time to practice! EN S EMBLE METH ODS IN P YTH ON

Build your �rst stacked ensemble EN S EMBLE METH ODS IN P YTH ON Román de las Heras Data Scientist, SAP / Agile Solutions

Stacking models with scikit-learn Some reasons to build from scratch: 1. scikit-learn has no stacking implementation 2. We will build stacking models from scratch 3. scikit-learn estimators can be used as a base ENSEMBLE METHODS IN PYTHON

General Steps General steps for the implementation: 1. Prepare the dataset 2. Build the �rst-layer estimators 3. Append the predictions to the dataset 4. Build the second-layer meta estimator 5. Use the stacked ensemble for predictions ENSEMBLE METHODS IN PYTHON

1. Prepare the dataset # Select input features and target # Apply any required transformations selected_feats = ['feat1', 'feat2', ..., # Example: categorical to 'dummies' 'featN'] features = pd.get_dummies(features) features = dataset.iloc[:,selected_feats] target = dataset['target_feature'] # Split into train (60%) and test(40%) X_train, X_test, y_train, y_test = # Data cleaning train_test_split( # Example: Fill NA values with zero features, target, test_size=0.4, features.fillna(value=0, inplace=True) random_state=42) ENSEMBLE METHODS IN PYTHON

2. Build the �rst-layer estimators Build and �t the �rst-layer estimators # 1. A Gaussian Naive Bayes classifier clf_nb = GaussianNB() clf_nb.fit(X_train, y_train) # 2. A 5-nearest neighbors classifier using the 'Ball-Tree' algorithm clf_knn = KNeighborsClassifier(n_neighbors=5, algorithm='ball_tree') clf_knn.fit(X_train, y_train) ENSEMBLE METHODS IN PYTHON

3. Append the predictions to the dataset Calculate predictions and add them to the training set: # Predict with the first-layer estimators on X_train pred_nb = clf_nb.predict(X_train) pred_knn = clf_knn.predict(X_train) # Create a Pandas DataFrame with the predictions pred_df = pd.DataFrame({ 'pred_nb': pred_nb, 'pred_knn': pred_knn }) # Concatenate X_train with the predictions DataFrame X_train_2nd = pd.concat([X_train, pred_df], axis=1) ENSEMBLE METHODS IN PYTHON

4. Build the second-layer meta estimator # Instantiate the second-layer estimator # Example: a Logistic Regression classifier clf_stack = LogisticRegression() # Train the model using the second training set clf_stack.fit(X_train_2nd, y_train) ENSEMBLE METHODS IN PYTHON

5. Use the stacked ensemble for predictions # Predict with the first-layer estimators on X_train pred_nb = clf_nb.predict(X_test) pred_knn = clf_knn.predict(X_test) pred_df = pd.DataFrame({ 'pred_nb': pred_nb, 'pred_knn': pred_knn }) # Concatenate X_test with the predictions DataFrame X_test_2nd = pd.concat([X_test, pred_df], axis=1) # Obtain the final predictions from the second-layer estimator pred_stack = clf_stack.predict(X_test_2nd) ENSEMBLE METHODS IN PYTHON

It's your turn! EN S EMBLE METH ODS IN P YTH ON

Let’s mlxtend it! EN S EMBLE METH ODS IN P YTH ON Román de las Heras Data Scientist, SAP / Agile Solutions

Mlxtend M achine L earning E xten sions Utilities and tools for Data Science tasks: Feature selection Ensemble methods Visualization Model evaluation Intuitive and friendly API Compatible with scikit-learn estimators 1 Raschka, Sebastian (2018) MLxtend: Providing machine learning and data science utilities and extensions to Python's scienti�c computing stack: http://rasbt.github.io/mlxtend/ ENSEMBLE METHODS IN PYTHON

Stacking implementation from mlxtend Characteristics: Individual estimators are trained on the complete features The meta-estimator is trained using the predictions as the only meta-features The meta-estimator can be trained with labels or probabilities as target ENSEMBLE METHODS IN PYTHON

StackingClassi�er with mlxtend from mlxtend.classifier import StackingCla # Build the Stacking classifier clf_stack = StackingClassifier( classifiers=[clf1, clf2, ... clfN], # Instantiate the 1st-layer classifiers meta_classifier=clf_meta, clf1 = Classifier1(params1) use_probas=False, clf2 = Classifier2(params2) use_features_in_secondary=False) ... clfN = ClassifierN(paramsN) # Use the fit and predict methods # like with scikit-learn estimators # Instantiate the 2nd-layer classifier clf_stack.fit(X_train, y_train) clf_meta = ClassifierMeta(paramsMeta) pred = clf_stack.predict(X_test) ENSEMBLE METHODS IN PYTHON

StackingRegressor with mlxtend from mlxtend.regressor import StackingRegr # Build the Stacking regressor reg_stack = StackingRegressor( regressors=[reg1, reg2, ... regN], # Instantiate the 1st-layer regressors meta_regressor=reg_meta, reg1 = Regressor1(params1) use_features_in_secondary=False) reg2 = Regressor2(params2) ... regN = RegressorN(paramsN) # Use the fit and predict methods # like with scikit-learn estimators reg_stack.fit(X_train, y_train) # Instantiate the 2nd-layer regressor pred = reg_stack.predict(X_test) reg_meta = RegressorMeta(paramsMeta) ENSEMBLE METHODS IN PYTHON

Let’s mlxtend it! EN S EMBLE METH ODS IN P YTH ON

Ensembling it all together EN S EMBLE METH ODS IN P YTH ON Román de las Heras Data Scientist, SAP / Agile Solutions

Chapter 1: Voting and Averaging Voting Good choices when you: Combination: mode (majority) Have built multiple different models Classi�cation Are not sure which is the best Heterogeneous ensemble method Want to improve the overall performance Averaging Combination: mean (average) Classi�cation and Regression Heterogeneous ensemble method ENSEMBLE METHODS IN PYTHON

Chapter 2: Bagging Weak estimator Good choice when you: Performs just better than random guessing Want to reduce variance Light model and fast model Need to avoid over�tting Base for homogeneous ensemble methods Need more stability and robustness Bagging (Bootstrap Aggregating) * Observation: Random subsamples with replacement Bagging is computationally expensive Large amount of "weak" estimators Aggregated by Voting or Averaging Homogeneous ensemble method ENSEMBLE METHODS IN PYTHON

Chapter 3: Boosting Gradual learning Good choice when you: Homogeneous ensemble method type Have complex problems Based on iterative learning Need to apply parallel processing or Sequential model building distributed computing Boosting algorithms Have big datasets or high-dimensional AdaBoost categorical features Gradient Boosting: XGBoost LightGBM CatBoost ENSEMBLE METHODS IN PYTHON

Chapter 4: Stacking Stacking Good choice when you: Combination: meta-estimator (model) Have tried Voting / Averaging but results Classi�cation and Regression are not as expected Heterogeneous ensemble method Have built models which perform well in different cases Implementation From scratch using pandas and sklearn Using the existing MLxtend library ENSEMBLE METHODS IN PYTHON

Thank you and well ensembled! EN S EMBLE METH ODS IN P YTH ON

The intuition behind stacking EN S EMBLE METH ODS IN P YTH ON - PowerPoint PPT Presentation

The intuition behind stacking EN S EMBLE METH ODS IN P YTH ON Romn de las Heras Data Scientist, SAP / Agile Solutions Relay races Effective team leader (anchor): Know the team : strengths and weaknesses Dene tasks : responsibilities

Book Stacking Harmonic Sums table Albert R Meyer, April 6, 2012 Albert R Meyer,

and Science? Ron Spronk intuition connoisseurship science intuition

Stacking for supervised learning Stacking for supervised learning Niall Rooney, NIKEL,

Information Option Stacking (draft-zheng-dhc-relay-agent-stacking-00) Robin Zheng IETF 76 - DHC

CRS stacking: a simplified explanation Motivation CRS stack Jrgen Mann 1 , Jrg Schleicher 2 ,

THE NATURAL HISTORY OF INTUITION Adapted from The Symposium on Intuition Institute of Noetic

Intuition: The Inner Game of Brief Therapy Presented by Bette Freedson, MSW Intuition: The

How to use intuition in your business to cultivate productivity and flow Brand Designer &

Showmanship Gaiting, Stacking, and More Topics What is Showmanship? Ring Producer

Presenting a live 90-minute webinar with interactive Q&A All-Sums-With-Stacking Rule:

Stacking Blocks and Counting Permutations Hana Mizuno Occidental College mizuno@oxy.edu

Lecture 21: Stacking CS109A Introduction to Data Science Pavlos Protopapas and Kevin Rader

Stacking Energies and RNA Structure Prediction Bioinformatics Senior Project Adrian Lawsin

Efficient Training of BERT by Progressively Stacking Linyuan Gong, Di He , Zhuohan Li, Tao Qin,

Progressive Stacking in Chat We invite BIPOC (Black, Indigenous, People of Color) to add an

Parameterization, stacking, and with the CRS Stack method inversion of locally coherent events

Java: Learning to Program with Robots Chapter 01: Programming with Objects Chapter Objectives

Forward Search Temporal Planning with Simultaneous Events Daniel Furelos-Blanco 1 Anders Jonsson 1

AT ATI TEAS READING REVIEW PART 1 UNDERSTANDING TEXT STRUCTURES AND WRITING STYLES Written

The Tor Project Our mission is to be the global resource for technology, advocacy, research and

Decimal Addition Return to Table of Contents Slide 5 / 152 Place Value Chart Slide 6 / 152

Parallel Replication Tutorial Eduardo Ortega (MySQL Database Engineer) eduardo DOT ortega AT

CSE543 - Introduction to Computer and Network Security Module: Research Methods 1 Professor

INTRODUCTION ScWk 240 Research Methods and Design

The intuition behind stacking EN S EMBLE METH ODS IN P YTH ON - PowerPoint PPT Presentation

The intuition behind stacking EN S EMBLE METH ODS IN P YTH ON Romn de las Heras Data Scientist, SAP / Agile Solutions Relay races Effective team leader (anchor): Know the team : strengths and weaknesses Dene tasks : responsibilities

Book Stacking Harmonic Sums table Albert R Meyer, April 6, 2012 Albert R Meyer,

and Science? Ron Spronk intuition connoisseurship science intuition

Stacking for supervised learning Stacking for supervised learning Niall Rooney, NIKEL,

Information Option Stacking (draft-zheng-dhc-relay-agent-stacking-00) Robin Zheng IETF 76 - DHC

CRS stacking: a simplified explanation Motivation CRS stack Jrgen Mann 1 , Jrg Schleicher 2 ,

THE NATURAL HISTORY OF INTUITION Adapted from The Symposium on Intuition Institute of Noetic

Intuition: The Inner Game of Brief Therapy Presented by Bette Freedson, MSW Intuition: The

How to use intuition in your business to cultivate productivity and flow Brand Designer &amp;

Showmanship Gaiting, Stacking, and More Topics What is Showmanship? Ring Producer

Presenting a live 90-minute webinar with interactive Q&amp;A All-Sums-With-Stacking Rule:

Stacking Blocks and Counting Permutations Hana Mizuno Occidental College mizuno@oxy.edu

Lecture 21: Stacking CS109A Introduction to Data Science Pavlos Protopapas and Kevin Rader

Stacking Energies and RNA Structure Prediction Bioinformatics Senior Project Adrian Lawsin

Efficient Training of BERT by Progressively Stacking Linyuan Gong, Di He , Zhuohan Li, Tao Qin,

Progressive Stacking in Chat We invite BIPOC (Black, Indigenous, People of Color) to add an

Parameterization, stacking, and with the CRS Stack method inversion of locally coherent events

Java: Learning to Program with Robots Chapter 01: Programming with Objects Chapter Objectives

Forward Search Temporal Planning with Simultaneous Events Daniel Furelos-Blanco 1 Anders Jonsson 1

AT ATI TEAS READING REVIEW PART 1 UNDERSTANDING TEXT STRUCTURES AND WRITING STYLES Written

The Tor Project Our mission is to be the global resource for technology, advocacy, research and

Decimal Addition Return to Table of Contents Slide 5 / 152 Place Value Chart Slide 6 / 152

Parallel Replication Tutorial Eduardo Ortega (MySQL Database Engineer) eduardo DOT ortega AT

CSE543 - Introduction to Computer and Network Security Module: Research Methods 1 Professor

INTRODUCTION ScWk 240 Research Methods and Design

How to use intuition in your business to cultivate productivity and flow Brand Designer &

Presenting a live 90-minute webinar with interactive Q&A All-Sums-With-Stacking Rule: