CAMT Data Mining: A CAMT Data Mining: A Case Study Case Study - - PowerPoint PPT Presentation

camt data mining a camt data mining a case study case
SMART_READER_LITE
LIVE PREVIEW

CAMT Data Mining: A CAMT Data Mining: A Case Study Case Study - - PowerPoint PPT Presentation

CAMT Data Mining: A CAMT Data Mining: A Case Study Case Study Manawin Songkroh Manawin Songkroh College of Arts, Media and Technology College of Arts, Media and Technology manawin@live.com manawin@live.com Outline Outline Data Mining


slide-1
SLIDE 1

CAMT Data Mining: A CAMT Data Mining: A Case Study Case Study

Manawin Songkroh Manawin Songkroh College of Arts, Media and Technology College of Arts, Media and Technology manawin@live.com manawin@live.com

slide-2
SLIDE 2

Outline Outline

Data Mining Definition Data Mining Definition CAMT CAMT’ ’s Profile s Profile Literature Review Literature Review Purpose of the study Purpose of the study Data used Data used Proposed Tool: Rapid Miner Proposed Tool: Rapid Miner

slide-3
SLIDE 3

Why Data Mining? Why Data Mining?

is good for is good for information technology information technology era era saves saves time and cost time and cost (Fayyad et al., 1996) (Fayyad et al., 1996) has been accepted by has been accepted by organizations in many fields

  • rganizations in many fields

(NASA, US Treasury Network, Banking Industry, (NASA, US Treasury Network, Banking Industry, Retailer, Medical, Bioinformatics....) Retailer, Medical, Bioinformatics....)

slide-4
SLIDE 4

Data Mining in the real Data Mining in the real world world

Marketing: market Marketing: market-

  • basket analysis

basket analysis Investment: Managing portfolio (LBS Capital Investment: Managing portfolio (LBS Capital Management) Management) http://www.lbs.com/lbs_tech.htm http://www.lbs.com/lbs_tech.htm Fraud Detection: PRISM System for Credit Card Fraud Detection: PRISM System for Credit Card Fraud, FAIS System for detecting money laundering Fraud, FAIS System for detecting money laundering activities. activities.

slide-5
SLIDE 5

DM & KDD DM & KDD

“ “KDD refers to the overall process of discovering KDD refers to the overall process of discovering useful knowledge from data and data mining refers to useful knowledge from data and data mining refers to a particular step in this process. a particular step in this process.” ” (Fayyad et. al., (Fayyad et. al., 1996, p.39) 1996, p.39) The additional steps in KDD process are data The additional steps in KDD process are data preparation, data selection, data cleaning and etc. preparation, data selection, data cleaning and etc.

slide-6
SLIDE 6

Literature Review Literature Review

Hsieh (2004) uses an integrated data mining and Hsieh (2004) uses an integrated data mining and behavioral scoring model to manage existing credit behavioral scoring model to manage existing credit card customer in a bank. card customer in a bank.

slide-7
SLIDE 7

CAMT Profile CAMT Profile

  • ver 1000 students, founded in 2004
  • ver 1000 students, founded in 2004

125 staffs (75 teaching and 50 supporting) 125 staffs (75 teaching and 50 supporting) multidisciplinary college: MMIT, Animation, Software multidisciplinary college: MMIT, Animation, Software Engineering, KM (PHD) Engineering, KM (PHD)

slide-8
SLIDE 8

Current Problems in CRM Current Problems in CRM

Low number of applicants in Software Engineering Low number of applicants in Software Engineering High dropout and expel rate in MMIT High dropout and expel rate in MMIT

slide-9
SLIDE 9

Purpose Purpose

to cluster students for better CRM plan to cluster students for better CRM plan to build the predictive model for tentative drop to build the predictive model for tentative drop-

  • out
  • ut

students students

slide-10
SLIDE 10

Personnel/Students Personnel/Students amount amount Lecturer Lecturer 75 75 Supporting Staff Supporting Staff 25 25 Temporary STaff Temporary STaff 20 20 Undergraduate Undergraduate 700 700 Master Master 60 60 PHD PHD 100 100

Stats Stats

slide-11
SLIDE 11

RapidMiner RapidMiner

http://rapid http://rapid-

  • i.com/content/view/26/84/

i.com/content/view/26/84/ Window, and other systems with Java Window, and other systems with Java RapidMiner 4.6 RapidMiner 4.6 Open Open-

  • Source from German Firm

Source from German Firm

slide-12
SLIDE 12

Data Used Data Used

CAMT Student Records from Registration Office of CAMT Student Records from Registration Office of Chiang Mai University. Chiang Mai University.

slide-13
SLIDE 13

Data File Data File-

  • .dbf form

.dbf form

slide-14
SLIDE 14

Project (Study) Project (Study) Management) Management)

Data Acquisition Data Acquisition Data Preparation & Understanding Data Preparation & Understanding Data Experimentation Data Experimentation Data Validation Data Validation Writing Paper Writing Paper

slide-15
SLIDE 15

Next Presentation Next Presentation

Detailed steps in accomplishing the paper Detailed steps in accomplishing the paper Results from Data Preparation and Understanding & Results from Data Preparation and Understanding & Model Selection Model Selection Q & A Q & A