Forecasting Potential Diabetes Complications Yang Yang, Jie Tang, - - PowerPoint PPT Presentation

forecasting potential diabetes complications
SMART_READER_LITE
LIVE PREVIEW

Forecasting Potential Diabetes Complications Yang Yang, Jie Tang, - - PowerPoint PPT Presentation

Forecasting Potential Diabetes Complications Yang Yang, Jie Tang, Juanzi Li Tsinghua University Walter Luyten, Marie-Francine Moens Katholieke Universiteit Leuven Lu Liu Northwestern University 1 Diabetes Complications Life-Threatening


slide-1
SLIDE 1

1

Yang Yang, Jie Tang, Juanzi Li

Tsinghua University

Walter Luyten, Marie-Francine Moens

Katholieke Universiteit Leuven

Lu Liu

Northwestern University

Forecasting Potential Diabetes Complications

slide-2
SLIDE 2

2

Diabetes Complications

  • Life-Threatening

– Over 4.8 million people died in 2012 due to diabetes[1]. – Over 68% of diabetes-related mortality is caused by diabetes complications[2]. – 471 billion USD, while 185 million patients remain undiagnosed[1].

  • Need to be diagnosed in time

[1] http://www.diabetes.org/ [2] http://www.idf.org/diabetesatlas/

coronary heart disease diabetic retinopathy

slide-3
SLIDE 3

3

Forecasting Diabetes Complication

Routine ¡urine ¡ analysis ¡ Bilirubin ¡ example ¡

coronary heart disease diabetic retinopathy

Output: diabetes complications Input: a patient’s lab test results

slide-4
SLIDE 4

4

Data Set

A collection of real clinical records from a hospital in

Beijing, China over one year.

Clinical record

Challenge: feature sparseness

  • Each clinical record only contains

24.43 different lab tests

  • 65.5% of lab tests exist in < 10

clinical records (0.00054%).

Item Statistics Clinical records 181,933 Patient 35,525 Lab tests 1,945

slide-5
SLIDE 5

5

Our Approach

slide-6
SLIDE 6

6

Baseline Model I

Learning task: f (xi) → yi

Limitations:

1. Cannot model correlations between y 2. Cannot handle sparse features

xi

WBC RBC

0.5 0.3 / P

PRO HBV

...

CHD Feature vector 0.5 0.3 / 1

...

Clinical Record Complication

slide-7
SLIDE 7

7

xi

x j

WBC RBC

0.5 0.3 / P

PRO HBV

...

WBC RBC

0.9 0.2 / N

PRO HBV

...

David time t time t+1

Baseline Model II

Objective function:

Still cannot handle sparse features!

slide-8
SLIDE 8

8

Proposed Model

0.5 / / 0.3 / / 0.6 / 0.4 0.2 0.1 0.2 0.1 0.4

Association vector Output Layer Latent Layer Input Layer

dimensional reduction classification Objective function:

slide-9
SLIDE 9

9

Learning Algorithm

Output Layer Latent Layer Input Layer

1 2

slide-10
SLIDE 10

10

Learning Algorithm (cont.)

  • Update the dimensional reduction parameters

– The remaining part of SparseFGM could be regarded as a mixture generative model, with the log- likelihood – Jensen’s inequality tells us that – Derivate with respect to each parameters, set them to zero, and get the update equations.

1 1

slide-11
SLIDE 11

11

Learning Algorithm (cont.)

  • Update the classification

parameters

– New log-likelihood – Adopt a gradient descent method to optimize the new log-likelihood

slide-12
SLIDE 12

12

Theoretical Analysis

1 2 3 1 2 3

、 ¡ 、 ¡

indicate ¡

slide-13
SLIDE 13

13

Experiments

slide-14
SLIDE 14

14

Setting

  • Experiments

Is our model effective? How do different diabetes complications associate with each lab test? Can we forecast all diabetes complications well?

  • Comparison Methods
  • SVM (model I)
  • FGM (model II)
  • FGM+PCA (an alternative method to handle feature

sparseness)

  • SparseFGM (our approach)
slide-15
SLIDE 15

15

Experimental Results

HTN: hypertension, CHD: coronary heart disease, HPL: hyperlipidemia

SVM and FGM suffer from feature sparseness. -59.9% in recall. FGM vs. FGM + PCA (increase +40.3% in recall) PGM+PCA vs. SparseFGM (increase +13.5% in F1)

slide-16
SLIDE 16

16

Association Pattern Illustration

H T N C H D H P L C V D b r

  • .

O P i n s . F L D R d e p r . Vitamin C KET URO BIL Nitrite RBC WBC GLU PRO

WBC in the urine causes frequent voiding -> no good sleep at night Association score: c: complication, e: lab test insomnia

slide-17
SLIDE 17

17

Can We Forecast All Diabetes Complications?

HPL can be forecasted precisely based on lab test results.

slide-18
SLIDE 18

18

Conclusion

  • We study the problem of forecasting diabetes

complications.

  • We propose a graphical model which integrates

dimensional reduction and classification into a uniform framework.

  • We further study the underlying associations between

different diabetes complications and lab test types.

slide-19
SLIDE 19

19

Thanks! Q&A?

@ Yang Yang http://yangy.org/