Datamining Recursive partitioning trees Sren Hjsgaard Department - PowerPoint PPT Presentation

Oct 19, 2022 •23 likes •113 views

Datamining Recursive partitioning trees Sren Hjsgaard Department of Mathematical Sciences Aalborg University, Denmark August 22, 2012 Printed: August 22, 2012 File: datamining-slides.tex 2: August 22, 2012 Contents 1 Introduction 3

Datamining – Recursive partitioning trees Søren Højsgaard Department of Mathematical Sciences Aalborg University, Denmark August 22, 2012 Printed: August 22, 2012 File: datamining-slides.tex
2: August 22, 2012 Contents 1 Introduction 3 2 Example - wine data 4
3: August 22, 2012 1 Introduction Data mining is an umbrella for a wide variety of techniques for exploring data. We illustrate one particular technique: Recursive partitioning trees.
4: August 22, 2012 2 Example - wine data The wine data has measurements on the chemical composition of samples of 3 different cultivars (varieties) of wine. data(wine, package="gRbase") head(wine) Cult Alch Mlca Ash Aloa Mgns Ttlp Flvn Nnfp Prnt Clri Hue Oodw Prln 1 v1 14.23 1.71 2.43 15.6 127 2.80 3.06 0.28 2.29 5.64 1.04 3.92 1065 2 v1 13.20 1.78 2.14 11.2 100 2.65 2.76 0.26 1.28 4.38 1.05 3.40 1050 3 v1 13.16 2.36 2.67 18.6 101 2.80 3.24 0.30 2.81 5.68 1.03 3.17 1185 4 v1 14.37 1.95 2.50 16.8 113 3.85 3.49 0.24 2.18 7.80 0.86 3.45 1480 5 v1 13.24 2.59 2.87 21.0 118 2.80 2.69 0.39 1.82 4.32 1.04 2.93 735 6 v1 14.20 1.76 2.45 15.2 112 3.27 3.39 0.34 1.97 6.75 1.05 2.85 1450 table(wine$Cult) v1 v2 v3 59 71 48 Question: Can we construct a model that will be good at classifying the variety from the chemical measurements.
5: August 22, 2012 The general picture: We have a categorical response variable y ( 3 levels for the wine data) and a number of predictor variables x 1 , . . . x p ( 13 predictors for the wine data). Idea: • Split data into two subgroups according to the values of one of the predictors, say x 1 . • Split the first subgroup according to the values of one of the other predictors, say x 2 . • Split the second subgroup according to the values of one of the other predictors, say x 3 (or possibly also x 2 ). • and so on...
6: August 22, 2012 To get this to work we need • Some rule for deciding on which variable to split • A rule for deciding when to stop splitting This is implemented in the rpart() function in the rpart package. A simple usage where we allow one split only: library(rpart) f1<-rpart(Cult~., data=wine, control=rpart.control(maxdepth=1)) plot(f1, uniform=T,margin=0.2) text(f1, use.n=TRUE)
7: August 22, 2012 Prln>=755 | v1 v2 57/4/6 2/67/42 Read this as: • Split on whether Prln ≥ 755 . “Yes”is to the left,“no”to the right. • 57 + 4 + 6 = 67 cases appear on the leaf to the left. These cases are all given the label v1 ; • 57 cases have variety v1 , 4 are of variety v2 and 6 are of variety v3 .
8: August 22, 2012 Alternatively, we can leave it to data to suggest the number of splits f2<-rpart(Cult~., data=wine) plot(f2, uniform=T,margin=0.2) text(f2, use.n=TRUE) Prln>=755 | Flvn>=2.165 Oodw>=2.115 Hue>=0.9 v1 v3 v2 57/2/0 0/2/6 2/61/2 v2 v3 0/5/2 0/1/38
9: August 22, 2012 Having done so, at natural question is to ask how good our classification is: table(wine$Cult, predict(f1, type="class")) v1 v2 v3 v1 57 2 0 v2 4 67 0 v3 6 42 0 table(wine$Cult, predict(f2, type="class")) v1 v2 v3 v1 57 2 0 v2 2 66 3 v3 0 4 44

Recommend

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having no cycles is A graph having no cycles is acyclic acyclic. . A A forest forest is an is an acyclic acyclic graph. graph. A A

226 views • 9 slides

61A Lecture 6 Announcements Recursive Functions Recursive Functions 4 Recursive Functions

61A Lecture 6 Announcements Recursive Functions Recursive Functions 4 Recursive Functions Definition: A function is called recursive if the body of that function calls itself, either directly or indirectly 4 Recursive Functions Definition:

1.08k views • 88 slides

Recursive Methods Noter ch.2 Recursive Methods Recursive problem solution Problems

Recursive Methods Noter ch.2 Recursive Methods Recursive problem solution Problems that are naturally solved by recursion Examples: Recursive function: Fibonacci numbers Recursive graphics: Fractals Mutual recursion:

796 views • 61 slides

Recursion Announcements Recursive Functions Recursive Functions 4 Recursive Functions

Recursion Announcements Recursive Functions Recursive Functions 4 Recursive Functions Definition : A function is called recursive if the body of that function calls itself, either directly or indirectly 4 Recursive Functions Definition

1.11k views • 89 slides

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees,

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees, binary search trees, and AVL and B-Trees. 2 Trees Trees are recursive data structures. They are useful for: representing items that have a tree

622 views • 27 slides

Lesson 9 Recursive Types 2/19, 21 Chapters 20, 21 Recursive type Recursive type terms are

Lesson 9: Recursive Types Lesson 9 Recursive Types 2/19, 21 Chapters 20, 21 Recursive type Recursive type terms are infinite, but regular For finite terms, use induction based on least fixed points For infinite terms/trees, we

232 views • 7 slides

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

B- B -Trees Trees B B- -Trees Trees Search for key R ( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees Trees Each Disk-Read or Disk-Write = one Basic unit of work O(1) Typical Node x

287 views • 4 slides

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree Traversal Spanning Trees Minimum Spanning Trees Introduction to Trees Section 11.1 Section Summary Introduction to Trees Rooted Trees

639 views • 42 slides

1 Dataset Dataset Alphabet Alphabet Items that can be found in transactions

Background Powerset Explorer: A Datamining Application Jordan Lee 1 2 Background Background PAST PAST Datamining accomplished with human intuition Datamining accomplished with human intuition PRESENT Computer aided with

288 views • 12 slides

Recursive Methods Recursive problem solution Problems that are naturally solved by

Recursive Methods Recursive problem solution Problems that are naturally solved by recursion Derivative of rational function Recursive Methods Examples: Recursive function: Fibonacci numbers Recursive graphics:

284 views • 5 slides

Partitioning and Divide-and- Conquer Strategies Partitioning Strategies Partitioning simply

Partitioning and Divide-and- Conquer Strategies Partitioning Strategies Partitioning simply divides the problem into parts Example - Adding a sequence of numbers We might consider dividing the sequence into m parts of n / m numbers each, ( x 0

514 views • 33 slides

Partitioning Introduction to Partitioning Mahapatra-Texas A&M-Spring02 1 System

Partitioning Introduction to Partitioning Mahapatra-Texas A&M-Spring02 1 System partitioning System level partitioning problem Assignment of operations to hardware or software Assignment of an operation to HW or SW determines

445 views • 18 slides

2-3-4 Trees and Red- Black Trees 204 erm CS 16: Balanced Trees 2-3-4 Trees Revealed Nodes

CS 16: Balanced Trees 2-3-4 Trees and Red- Black Trees 204 erm CS 16: Balanced Trees 2-3-4 Trees Revealed Nodes store 1, 2, or 3 keys and have 2, 3, or 4 children, respectively All leaves have the same depth k n r b e h n r b e

801 views • 31 slides

/ + - * * 5 3 2 6 5 2 Examples Binary Trees BSTs Augmenting BinExpr General Trees

Trees Readings: HtDP , sections 14, 15, 16. Topics: Introductory examples and terminology Binary trees Binary search trees Augmenting trees Binary expression trees General arithmetic expression trees Nested lists Examples Binary Trees

1.19k views • 31 slides

Assessing the Stability of Forecasting Models: Recursive Parameter Estimation and Recursive

Assessing the Stability of Forecasting Models: Recursive Parameter Estimation and Recursive Residuals At each t, t = k, ... ,T-1, compute: Recursive parameter est. and forecast: Recursive residual: If all is well: Sequence of 1-step forecast

650 views • 39 slides

Non-Recursive In-Place FFT Algorithm Idea: "Unwind the in-place recursive algorithm and work

Non-Recursive In-Place FFT Algorithm Idea: "Unwind the in-place recursive algorithm and work bottom up from a bit-reversed input array in a "butterfly" pattern: - 1 - (non-recursive in-place FFT algorithm continued) Non-recursive

436 views • 5 slides

HICO: A Benchmark for Recognizing Human-Object Interactions in Images Yu-Wei Chao, Zhan Wang,

HICO: A Benchmark for Recognizing Human-Object Interactions in Images Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, and Jia Deng ICCV 2015 Presented by Chia-Wen Cheng, Chia-Cheng Hsu HICO ~47,000 labeled images in 600 human-object

619 views • 19 slides

Data Mining Concepts Duen Horng (Polo) Chau Assistant Professor Associate Director, MS

http://poloclub.gatech.edu/cse6242 CSE6242 / CX4242: Data & Visual Analytics Data Mining Concepts Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech Partly based on materials by

542 views • 19 slides

Supervised Self-Organising Maps similarity/distance (Kohonen, 1982). Ron Wehrens Institute of

Self-organising maps Map high-dimensional data to a 2D grid of units according to Supervised Self-Organising Maps similarity/distance (Kohonen, 1982). Ron Wehrens Institute of Molecules and Materials, IMM Radboud University

191 views • 5 slides

CSE 258 Web Mining and Recommender Systems Introduction What is CSE 258? In this course we will

CSE 258 Web Mining and Recommender Systems Introduction What is CSE 258? In this course we will build models that help us to understand data in order to gain insights and make predictions Examples Recommender Systems Prediction: what

949 views • 60 slides

Classification with mixtures of curved Mahalanobis metrics or LMNN in Cayley-Klein geometries

Classification with mixtures of curved Mahalanobis metrics or LMNN in Cayley-Klein geometries arXiv:1609.07082 Frank Nielsen 1 , 2 Boris Muzellec 1 Richard Nock 3 , 4 , 5 1 Ecole Polytechnique, France 2 Sony CSL, Japan 3 Data61, Australia

559 views • 39 slides

Scalable Multi-Class Gaussian Process Classification using Expectation Propagation Carlos

Scalable Multi-Class Gaussian Process Classification using Expectation Propagation Carlos Villacampa-Calvo and Daniel Hern andezLobato Computer Science Department Universidad Aut onoma de Madrid http://dhnzl.org ,

963 views • 69 slides

Mining Useful Patterns Jilles Vreeken 22 May 2015 Questions of the day How can we find useful

Mining Useful Patterns Jilles Vreeken 22 May 2015 Questions of the day How can we find useful patterns? & How can we use patterns? Standard pattern mining For a database db a pattern language and a set of constraints the go

812 views • 46 slides

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role Labeling/Verb Predicates. CSE354 - Spring 2020 Natural Language Processing Tasks Word Sense Disambiguation Traditionally: h o w ?

980 views • 75 slides