Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

(1.16MB)
This is an Open Access publication published under CSC-OpenAccess Policy.
Publications from CSC-OpenAccess Library are being accessed from over 74 countries worldwide.
Software Defect Trend Forecasting In Open Source Projects using A Univariate ARIMA Model and FBProphet
Michael Thomas Shrove, Emil Jovanov
Pages - 1 - 15     |    Revised - 31-03-2020     |    Published - 30-04-2020
Volume - 8   Issue - 1    |    Publication Date - April 2020  Table of Contents
MORE INFORMATION
KEYWORDS
Software Engineering, Software Defects, Time Series Forecasting, ARIMA, FBProphet.
ABSTRACT
Our objective in this research is to provide a framework that will allow project managers, business owners, and developers an effective way to forecast the trend in software defects within a software project in real-time. By providing these stakeholders with a mechanism for forecasting defects, they can then provide the necessary resources at the right time in order to remove these defects before they become too much ultimately leading to software failure. In our research, we will not only show general trends in several open-source projects but also show trends in daily, monthly, and yearly activity. Our research shows that we can use this forecasting method up to 6 months out with only an MSE of 0.019. In this paper, we present our technique and methodologies for developing the inputs for the proposed model and the results of testing on seven open source projects. Further, we discuss the prediction models, the performance, and the implementation using the FBProphet framework and the ARIMA model.
1 S. Wojewoda and S. Hastie, "Standish Group 2015 Chaos Report - Q&A with Jennifer Lynch," 2015. [Online]. Available: https://www.infoq.com/articles/standish-chaos-2015/. [Accessed: 25-Aug-2019].
2 Lehtinen, T., Mäntylä, M., Vanhanen, J., Itkonen, J., & Lassenius, C. (2014). Perceived causes of software project failures - An analysis of their relationships. Information and Software Technology, 56(6), 623-643. https://doi.org/10.1016/j.infsof.2014.01.015
3 Fenton, N., Neil, M., Marsh, W., Hearty, P., Marquez, D., Krause, P., & Mishra, R. (2007). Predicting software defects in varying development lifecycles using Bayesian nets. Information and Software Technology, 49(1), 32-43. https://doi.org/10.1016/j.infsof.2006.09.001
4 Lessmann, S., Baesens, B., Mues, C., & Pietsch, S. (2008). Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings. IEEE Transactions on Software Engineering, 34(4), 485-496. https://doi.org/10.1109/TSE.2008.35
5 Okutan, A., & Yildiz, O. (2014). Software defect prediction using Bayesian networks. Empirical Software Engineering, 19(1), 154-181. https://doi.org/10.1007/s10664-012-9218-8
6 Qinbao Song, Zihan Jia, Shepperd, M., Shi Ying, & Jin Liu. (2011). A General Software Defect-Proneness Prediction Framework. IEEE Transactions on Software Engineering, 37(3), 356-370. https://doi.org/10.1109/TSE.2010.90
7 V. Vashisht, M. Lal, and G. S. Sureshchandar, "A Framework for Software Defect Prediction Using Neural Networks," J. Softw. Eng. Appl., vol. 08, no. 08, pp. 384-394, 2015.
8 Shuo Wang, & Xin Yao. (2013). Using Class Imbalance Learning for Software Defect Prediction. IEEE Transactions on Reliability, 62(2), 434-443. https://doi.org/10.1109/TR.2013.2259203
9 Nam, J., Fu, W., Kim, S., Menzies, T., & Tan, L. (2018). Heterogeneous Defect Prediction. IEEE Transactions on Software Engineering, 44(9), 874-896. https://doi.org/10.1109/TSE.2017.2720603
10 Bou-Hamad, I., & Jamali, I. (2020). Forecasting financial time-series using data mining models: A simulation study. Research in International Business and Finance, 51. https://doi.org/10.1016/j.ribaf.2019.101072
11 Weber, R., Waller, M., Verner, J., & Evanco, W. (2003). Predicting software development project outcomes. Lecture Notes in Computer Science (including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2689, 595-609. https://doi.org/10.1007/3-540-45006-8_45
12 Ramaswamy, V., Suma, V., & Pushphavathi, T. (2012). An approach to predict software project success by cascading clustering and classification. IET Seminar Digest, 2012(4). https://doi.org/10.1049/ic.2012.0137
13 Raja, U., Hale, D., & Hale, J. (2009). Modeling software evolution defects: a time series approach. Journal Of Software Maintenance And Evolution-Research And Practice, 21(1), 49-71. https://doi.org/10.1002/smr.398
14 Manzano, M., Ayala, C., Gomez, C., & Lopez Cuesta, L. (2019). A Software Service Supporting Software Quality Forecasting. 2019 IEEE 19th International Conference on Software Quality, Reliability and Security Companion (QRS-C), 130-132. https://doi.org/10.1109/QRS-C.2019.00037
15 Fenton, N., & Neil, M. (1999). A critique of software defect prediction models. IEEE Transactions on Software Engineering, 25(5), 675-689. https://doi.org/10.1109/32.815326
16 N. K. Chikkakrishna, C. Hardik, K. Deepika and N. Sparsha, "Short-Term Traffic Prediction Using Sarima and FbPROPHET," 2019 IEEE 16th India Council International Conference (INDICON), Rajkot, India, 2019, pp. 1-4.
17 "6.4.4. Univariate Time Series Models." [Online]. Available: https://www.itl.nist.gov/div898/handbook/pmc/section4/pmc44.htm. [Accessed: 30-Aug-2019].
18 Leybourne, S., Kim, T., & Newbold, P. (2005). Examination of Some More Powerful Modifications of the Dickey-Fuller Test. Journal of Time Series Analysis, 26(3), 355-369. https://doi.org/10.1111/j.1467-9892.2004.00406.x
19 "Introduction to ARIMA models." [Online]. Available: https://people.duke.edu/~rnau/411arim.htm. [Accessed: 31-Aug-2019].
20 "Prophet | Prophet is a forecasting procedure implemented in R and Python. It is fast and provides completely automated forecasts that can be tuned by hand by data scientists and analysts." [Online]. Available: https://facebook.github.io/prophet/. [Accessed: 30-Jan-2020].
21 "Prophet: forecasting at scale - Facebook Research." [Online]. Available: https://research.fb.com/blog/2017/02/prophet-forecasting-at-scale/. [Accessed: 01-Sep-2019].
22 Shrove, M. T., & Jovanov, E. (2019). sP2D2: Software Productivity and Popularity of Open Source Projects based on Defect Technical Debt. In IEEE SoutheastCON. IEEE.
Mr. Michael Thomas Shrove
Millennium Corporation, Huntsville - United States of America
tshrove@gmail.com
Dr. Emil Jovanov
ECE Department, University of Alabama, Huntsville - United States of America