Hadi Jahanshahi Mucahit Cevik Ayşe Başar
May 2020 33rd Canadian Conference on Artificial Intelligence
Predicting the Number of Reported Bugs in a Software Repository
Data Science Laboratory
Reported Bugs in a Software Repository Hadi Jahanshahi Mucahit - - PowerPoint PPT Presentation
Predicting the Number of Reported Bugs in a Software Repository Hadi Jahanshahi Mucahit Cevik Aye Baar May 2020 33 rd Canadian Conference on Artificial Intelligence Data Science Laboratory Outline Data Science Laboratory
Hadi Jahanshahi Mucahit Cevik Ayşe Başar
May 2020 33rd Canadian Conference on Artificial Intelligence
Data Science Laboratory
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
2/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
3/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
4/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
5/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
6/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
7/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
8/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
9/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
10/20
repository*. ____________________________
* Mozilla Bug Tracking System. https://bugzilla.mozilla.org/.
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
11/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
12/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
13/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
14/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
15/20
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
16/20
significantly different. Furthermore, the baseline seems as good as the others, a new finding which was not considered in previous studies.
traditional time series models.
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
17/20
Random Forest with exogenous variables exceeds other methods.
term prediction.
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
18/20
[1] Kenmei, B., Antoniol, G., di Penta, M.: Trend analysis and issue prediction in large-scale open source systems. In: 2008 12th European Conference on Software Maintenance and Reengineering, pp. 73–82, April 2008 [2] Krishna, R., Agrawal, A., Rahman, A., Sobran, A., Menzies, T.: What is the connection between issues, bugs, and enhancements? Lessons learned from 800+ software projects. In: Proceedings of the 40th International Conference on Software Engineering: Software Engineering in Practice, ICSE-SEIP 2018, pp. 306–315. Association for Computing Machinery, New York (2018) [3] Wu, W., Zhang, W., Yang, Y., Wang, Q.: Time series analysis for bug number
Mining, pp. 589–596, June 2010
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
19/20
[4] Yazdi, H.S., Angelis, L., Kehrer, T., Kelter, U.: A framework for capturing, statistically modeling and analyzing the evolution of software models. J. Syst.
[4] Destefanis, G., Ortu, M., Counsell, S., Swift, S., Tonelli, R., Marchesi, M.: On the randomness and seasonality of affective metrics for software development. In: Proceedings of the Symposium on Applied Computing, SAC 2017, pp. 1266–1271. Association for Computing Machinery, New York (2017) [5] Wang, J., Zhang, H.: Predicting defect numbers based on defect state transition
Empirical Software Engineering and Measurement, pp. 191–200, September 2012
Data Science Laboratory
May 20
Predicting the Number of Reported Bugs in a Software Repository Jahanshahi et al.
20/20
[6] Chen, X., Zhang, D., Zhao, Y., Cui, Z., Ni, C.: Software defect number prediction: unsupervised vs supervised methods. Inf. Softw. Technol. 106, 161–181 (2019) [7] Gao, K., Khoshgoftaar, T.M.: A comprehensive empirical study of count models for software fault prediction. IEEE Trans. Reliab. 56(2), 223–236 (2007) [8] Graves, T.L., Karr, A.F., Marron, J.S., Siy, H.: Predicting fault incidence using software change history. IEEE Trans. Softw. Eng. 26(7), 653–661 (2000) [9] Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735