Integration and Presentation Katsuro Inoue Osaka University - - PowerPoint PPT Presentation

integration and presentation
SMART_READER_LITE
LIVE PREVIEW

Integration and Presentation Katsuro Inoue Osaka University - - PowerPoint PPT Presentation

Session 2: Integration and Presentation Katsuro Inoue Osaka University Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University Papers in this Session 1.


slide-1
SLIDE 1

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

Session 2:

Integration and Presentation

Katsuro Inoue Osaka University

slide-2
SLIDE 2

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

Papers in this Session

  • 1. GlueTheos: Automating the Retrieval and

Analysis of Data from Publicly Available Software Repositories

  • 2. Using CVS Historical Information to

Understand How Students Develop Software

  • 3. Database Techniques for the Analysis and

Exploration of Software Repositories

  • 4. Empirical Project Monitor: A Tool for Mining

Multiple Project Data

slide-3
SLIDE 3

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

  • 1. GlueTheos: Automating the Retrieval and

Analysis of Data from Publicly Available Software Repositories

  • Objective:

– Analysis of free software systems – Measurement of LOC and its visualization

  • Approach:

– Script-based analyzer for CVS

  • Strength:

– Flexible architecture

  • Weakness:

– Applicability to other data ?

slide-4
SLIDE 4

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

  • 2. Using CVS Historical Information

to Understand How Students Develop Software

  • Objective:

– Analysis of activities in a development team

  • Approach:

– Hierarchical analysis (file-individual-team) of CVS data

  • Strength:

– Fine granularity analysis -> activity observation

  • Weakness:

– Scalability to larger projects ?

slide-5
SLIDE 5

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

  • 3. Database Techniques for the Analysis

and Exploration of Software Repositories

  • Objective:

– Analysis of e-mail archive

  • Approach:

– Putting everything into a single SQL DB and mining the DB

  • Strength:

– Once DB has been created, every operation is performed on it

  • Weakness:

– Performance ? – Data format translation to SQL DB

slide-6
SLIDE 6

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

  • 4. Empirical Project Monitor: A Tool for

Mining Multiple Project Data

  • Objective:

– Real-time process monitor

  • Approach:

– CVS, Mailman, and GNATS data to standardized XML database

  • Strength:

– data format standardization

  • Weakness:

– Applicability to other data sources

slide-7
SLIDE 7

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

General Model behind the Works

Collection Integration Analysis Presentation

Projects

  • Software Companies
  • Open Source Projects

Feedback Session 1 Session 3, 4, 5

slide-8
SLIDE 8

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

Integration

1. GlueTheos: Raw CVS data

(+ XML, SQL for external interface)

2. Student Activities: CVS logs 3. Database: SQL DB 4. EPM: XML Standardized data

slide-9
SLIDE 9

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

Presentation

  • 1. GlueTheos: Graphical presentation of

project’s LOC

  • 2. Student Activities: Graphical presentation
  • f detail activities
  • 3. Database:

Web-based browser for mail clusters

  • 4. EPM: Graphical presentation of LOC, #
  • f mails and bugs
slide-10
SLIDE 10

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

Issues on Integration

  • Standardize data format

– Useful for data exchange? – Translation overhead

  • Database v.s. Raw files (CVS, Mail, ...)

– Easiness of data mining – Translation overhead and performance

slide-11
SLIDE 11

Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University

Issues on Presentation

  • Types of presentation

– Based on the goal of measurement/analysis – Currently, most works provide simple metrics graphs – Other analyses -> different presentations

  • ex. 3. mail analysis -> cluster browser
  • Other types of repository analysis ?

– Reuse, knowledge share, ...