EVALUATION OF STUDENT PERFORMANCE WITH DATA MINING: AN APPLICATION - - PowerPoint PPT Presentation

evaluation of student performance with data mining an
SMART_READER_LITE
LIVE PREVIEW

EVALUATION OF STUDENT PERFORMANCE WITH DATA MINING: AN APPLICATION - - PowerPoint PPT Presentation

EVALUATION OF STUDENT PERFORMANCE WITH DATA MINING: AN APPLICATION OF ID3 AND CART ALGORITHMS Manawin Songkroh (Ph.D) College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, Thailand Andrea K (Ph.D) Corvinus University of


slide-1
SLIDE 1

EVALUATION OF STUDENT PERFORMANCE WITH DATA MINING: AN APPLICATION OF ID3 AND CART ALGORITHMS

Manawin Songkroh (Ph.D) College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, Thailand Andrea Kő (Ph.D) Corvinus University of Budapest, Hungary

slide-2
SLIDE 2

Outline:

  • Purpose
  • Business

Objective

  • Data Mining

Objective

  • Data Description
  • Data Quality

Assessment

  • Model Selection
  • Model Evaluation
  • Conclusions &

Implications

slide-3
SLIDE 3

Purposes:

  • General Purpose: to classify students

into successful and marginal groups, in order to find better ways to advise them and

  • To assist university admission officials in

identifying students that are likely to be successful in a graduate program

  • Data Mining Purpose: To create

classification model : CART & ID3

slide-4
SLIDE 4

Business Objective

  • Retain & ensure the

graduation in appropriate time frame

slide-5
SLIDE 5

Data Mining Objectives

  • Data Acquisition
  • Data Preparation
  • Build Classification Models
  • Model Evaluation
slide-6
SLIDE 6

Data Preparation

slide-7
SLIDE 7

example of data

slide-8
SLIDE 8

Data Recipe

slide-9
SLIDE 9

Model Selection

  • CART & ID3 assumes

nonparametric, algorithms

selected automatically, categorial/ continuous variables.

slide-10
SLIDE 10

Model Evaluation

  • Cross Validation
  • 80:20 (Training and Evaluation

Test Set)

slide-11
SLIDE 11

Data Description

slide-12
SLIDE 12

Male 46% Female 54%

Number of Students by Gender Male Female

M= 235 F= 272

slide-13
SLIDE 13

61% 12% 9% 7% 4% 3% 1% 1% 2%

By Province

Chiang Mai Lamphun Chiang Rai Lampang Payao Prae Nan Bangkok Other

slide-14
SLIDE 14

English I 28% English II 28% Labor Law 22% Management 11% STAT 11%

Grade F Frequency

English I English II Labor Law Management STAT

slide-15
SLIDE 15

Data Quality Assessment

  • No outliers
  • 10 missing data
  • GPA>2.5= Good,
  • GPA<2.5 --> Bad
slide-16
SLIDE 16

ID3 Model

slide-17
SLIDE 17

CART Model

B,B+ Good 8-0 C,C+ Good 18-2 D,D +,W Bad 2-8

slide-18
SLIDE 18

Model Evaluation

slide-19
SLIDE 19

Evaluator ID3-Cross Validation ID3-Test Set CART- Cross Validation CART- Test Set

Overall Accuracy Accuracy for Good Accuracy for Bad

77.37% 81.05% 75.30% 79.69% 68.42% 84.44% 76.64% 65.26% 83.15% 75% 63.16% 85%

Comparison of Model Evaluation

slide-20
SLIDE 20

0.225 0.450 0.675 0.900

ID3-Cross Validation ID3-Test Set CART-Cross Validation CART-Test Set

Comparison of Model Evaluation

slide-21
SLIDE 21
  • English, Statistics, and Information

and Communication Technology are the key determinator subjects.

  • The results of Classification are

congruent with the frequency data as many students receive F in these classes.

  • English and statistics should be the

subject used to screen students during admission.

Implications:

slide-22
SLIDE 22
  • Info & Comm Technology is the Major

mandatory subject that is required special attention as it will determine the academic performance of the other related subjects.

Implications (2)

slide-23
SLIDE 23

Conclusions

  • This presentation outlined the features
  • f a classification technique to

evaluate student performance in their undergraduate programs

  • Classification technique holds the

promise as an evaluation tool to classify students into successful and marginal categories and supports to identify students that are likely to be successful in a graduate program

slide-24
SLIDE 24

Conclusion (2)

  • The use of a classification model can

support and potentially improve decision making by program directors and dean.

slide-25
SLIDE 25

Q & A