Comparison of Bayesian Network and Decision Tree Methods for - - PowerPoint PPT Presentation

comparison of bayesian network and decision tree methods
SMART_READER_LITE
LIVE PREVIEW

Comparison of Bayesian Network and Decision Tree Methods for - - PowerPoint PPT Presentation

Comparison of Bayesian Network and Decision Tree Methods for Predicting Access to the Renal Transplant Waiting List Sahar BAYAT, Marc CUGGIA, Delphine ROSSILLE, Michle KESSLER, Luc FRIMAT INSERM U936, Universit Rennes 1, IFR 140, Rennes,


slide-1
SLIDE 1

Bayat – MIE 2009

Comparison of Bayesian Network and Decision Tree Methods for Predicting Access to the Renal Transplant Waiting List

Sahar BAYAT, Marc CUGGIA, Delphine ROSSILLE, Michèle KESSLER, Luc FRIMAT

INSERM U936, Université Rennes 1, IFR 140, Rennes, France EA 4003, Nancy Université, France

slide-2
SLIDE 2

Bayat – MIE 2009

Introduction

  • Renal replacement therapy (RRT):

Hemodialysis Peritoneal dialysis Kidney transplantation

  • Kidney transplantation :

Longer survival Lower long-term cost Graft shortage

  • Selection criteria diverges from one center to another
slide-3
SLIDE 3

Bayat – MIE 2009

Introduction

  • Ideally, selection based on medical factors :

Women Elderly Distance from transplantation department Private ownership of dialysis facilities

  • NEPHROLOR healthcare network :

French region : Lorraine Access to the renal transplant waiting list : Age Medical factors Conventional statistical methods and Bayesian networks : similar results

slide-4
SLIDE 4

Bayat – MIE 2009

Objectif

  • Compare the performance of Bayesian

networks and decision trees for predicting registration on the renal transplant waiting list in NEPHROLOR network

slide-5
SLIDE 5

Bayat – MIE 2009

Material and method

  • NEPHROLOR healthcare network :

Combines public and private for-profit dialysis facilities Only one transplant centre at university hospital of Nancy

  • Study population:

Adult patients Living in Lorraine Starting RRT in NEPHROLOR network facilities (incident patients) Between July 1, 1997 and June 30, 2003

slide-6
SLIDE 6

Bayat – MIE 2009

Material and method

  • Data collection :

Social and demographic data : age, sex and distance between the patient's residence and the department performing transplantation Clinical and biological data at first RRT: existence of diabetes, cardiovascular disease, respiratory disease, hepatic disease, psychiatric disorder, past history of malignancy, physical impairment of ambulation, Body Mass Index ( <20, 20-24.99, ≥25), hemoglobin in (<11 g/dl, ≥ 11) and serum albumin (<3 g/dl, 3-3.49, ≥ 3.5) Data related to medical follow up in the NEPHROLOR network :

  • wnership of nephrology facility where the first RRT was performed

(public or private), medical follow-up in the department performing transplantation versus 12 other facilities without transplantation

slide-7
SLIDE 7

Bayat – MIE 2009

Material and method

  • Statistical analysis :
  • Data set :

:

  • 1. Training set : 90%
  • 2. Validation set : 10%

Comparison of the two sets : χ²

  • Training set : Modelling registration on the waiting list by Bayesian

network and decision tree

  • Validation set : predictive performances of both models (sensitivity,

specificity and positive predictive values)

  • Difference between the two models : McNemar test
slide-8
SLIDE 8

Bayat – MIE 2009

Material and method

  • Bayesian network :

Conditional dependences between the variables Probabilistic relationships : diseases and symptoms Directed acyclic graph : Nodes : variables Arcs : relationship between variables

not necessarily a cause-effect relationship

slide-9
SLIDE 9

Bayat – MIE 2009

Material and method

  • Decision tree approach:

Tree-structured classifier Built by partitioning data into homogenous classes Roote node split into child nodes : Selecting the variable that best classifies the samples according to a split criterion

  • CART method
slide-10
SLIDE 10

Bayat – MIE 2009

Results

  • Patients’ characteristics :

809 patients included mean age : 62.1 ± 14.2 years 59.6% male 34.5% diabetes 44.2% cardiovascular disease 11.1% respiratory disease 14.1% past history of malignancy 19.5% physical impairment 5.9% psychiatric disorder 212 (26.2%) registered on the transplant waiting list

slide-11
SLIDE 11

Bayat – MIE 2009

Results

  • Training set:

729 patients

  • Validation set:

80 patients

  • No significant difference between the characteristics of

the two sets

slide-12
SLIDE 12

Bayat – MIE 2009

Results – Bayesian network

slide-13
SLIDE 13

Bayat – MIE 2009

Results – Bayesian network

  • Predictive performances on validation set:

Sensitivity: 90.0 % (95%CI: 76.8–100) Specificity: 96.7% (95%CI: 92.2–100) Positive predictive value: 90.0% (95%CI: 76.8–100)

  • Correct predictions:

18 out of 20 registrations 58 out of 60 non registrations

slide-14
SLIDE 14

Bayat – MIE 2009

Results – Decision tree

NR: Non Registered, R: Registered, CVD: CardioVascular Disease Age Albumin Age <45 45-55 55-65 ≥ 65 BMI CVD Diabetes 55-65 ≥ 3.5 3-3.5 < 3 Distance Yes

96% NR 4% R

≥ 65

16% NR 84% R

No

65% NR 35% R 36% NR 64% R

≥ 25 20-25 < 20

90% NR 10% R

Yes Albumin No 81% NR 19% R CVD < 3 3-3.5 ≥ 3.5

75% NR 25% R 37% NR 63% R

50-100 > 100 < 50

39% NR 61% R

73% NR 27% R No Yes

slide-15
SLIDE 15

Bayat – MIE 2009

Results – Decision tree

  • Predictive performances on validation set:

Sensitivity: 90.0 % (95%CI: 76.8–100) Specificity: 96.7% (95%CI: 92.2–100) Positive predictive value: 90.0% (95%CI: 76.8–100)

  • Correct predictions:

18 out of 20 registrations 58 out of 60 non registrations

slide-16
SLIDE 16

Bayat – MIE 2009

Results – Bayesian network and Decision tree

  • High predictive performances on validation set
  • McNemar : No significant difference between the models
  • Predictions discordant for 2 patients
  • Kappa of concordance : 0.93
slide-17
SLIDE 17

Bayat – MIE 2009

Discussion

  • Decision tree and the Bayesian methods showed :

High performances for predicting access to renal transplant waiting list in NEPHROLOR network Models highly concordant Age the most important variable for both models

slide-18
SLIDE 18

Bayat – MIE 2009

Discussion

Bayesian network Decision tree

Cardiovascular disease Diabetes Albumin Cardiovascular disease Diabetes Albumin

slide-19
SLIDE 19

Bayat – MIE 2009

Discussion

Bayesian network Decision tree

Cardiovascular disease Diabetes Albumin Respiratory disease Follow-up in transplantation center Cardiovascular disease Diabetes Albumin BMI Distance from transplantation center

slide-20
SLIDE 20

Bayat – MIE 2009

Discussion

Bayesian network Decision tree

Cardiovascular disease Diabetes Albumin Respiratory disease Follow-up in transplantation center Visualizes other relationships : Cardiovascular disease Diabetes Albumin BMI Distance from transplantation center

slide-21
SLIDE 21

Bayat – MIE 2009

Discussion

Bayesian network Decision tree

Cardiovascular disease Diabetes Albumin Respiratory disease Follow-up in transplantation center Visualizes other relationships : Cardiovascular disease Diabetes Albumin BMI Distance from transplantation center

slide-22
SLIDE 22

Bayat – MIE 2009

Discussion

Bayesian network Decision tree

Cardiovascular disease Diabetes Albumin Respiratory disease Follow-up in transplantation center Visualizes other relationships Links variables: complex, direct and indirect ways interpretation more problematic Cardiovascular disease Diabetes Albumin BMI Distance from transplantation center Decision rules : Easily derived from decision tree Simpler interpretation tool for physicains

slide-23
SLIDE 23

Bayat – MIE 2009

Conclusion

  • Bayesian network and decision tree predict access to

renal transplant waiting list in NEPHROLOR with high accuracy

  • Models are complementary :

Bayesian network : global view of associations Decision tree : more easily interpretable

  • Formalizing and optimizing the health care process