Open Access Open Access  Restricted Access Subscription or Fee Access

Software Defect Prediction Using Gain Information Weighted

Nagy Ramdan, Ahmed Elazab

Abstract


Software defect prediction activity concerns metrics of successful of test phase in Software Development Life Cycle (SDLC), companies work on predict the number of defects in software systems, there are some issues of predicting software defects such as imbalanced dataset which contains noisy attributes or lack information about all attributes, the noisy leads to decrease significantly results, This paper proposed a model designed to reduce the number of features used in the prediction process. The model will apply the method of gain information weighted to solve pervious issues reduce the number of selection features using the Gain information weighted, prediction using most popular algorithm of machine learning, prediction using random forest algorithm reaches 88%.


Keywords


Software Quality, Random Forest, Gain Information Weighted.

Full Text:

PDF

References


Lasota, T., Londzin, B., Telec, Z., & Trawiński, B. (2014, April). Comparison of ensemble approaches: Mixture of experts and adaboost for a regression problem. In Asian Conference on Intelligent Information and Database Systems (pp. 100-109). Springer, Cham.‏

Dragnić, D. (2014). Impact of internal and external factors on the performance of fast-growing small and medium businesses. Management: journal of contemporary management issues, 19(1), 119-159.‏

E.Elsherbeiny, Hadeel & Ahmed, Abd El-Aziz & Ramadan, Nagy. (2017). Decision Support for Requirements Prioritization Using Data Analysis. Egyptian Computer Science Journal (ECS)- ISSN 1110-2586. 41.

A rora, I., Tetarwal, V., & Saha, A. (2015). Open issues in software defect prediction. Procedia Computer Science, 46, 906-912.

Berander, P., Damm, L. O., Eriksson, J., Gorschek, T., Henningsson, K., Jönsson, P., ... & Tomaszewski, P. (2005). Software quality attributes and trade-offs. Blekinge Institute of Technology.‏

Singh, S., & Chana, I. (2015). QRSF: QoS-aware resource scheduling framework in cloud computing. The Journal of Supercomputing, 71(1), 241-292.‏

Hilario, M., & Kalousis, A. (2008). Approaches to dimensionality reduction in proteomic biomarker studies. Briefings in bioinformatics, 9(2), 102-118.‏

Pisica, I., Taylor, G., & Lipan, L. (2013). Feature selection filter for classification of power system operating states. Computers & Mathematics with Applications, 66(10), 1795-1807

ElAzab, A. (2016). Fake Accounts Detection in Twitter based on Minimum Weighted Feature. World Malhotra, R. (2015).

A systematic review of machine learning techniques for software fault prediction. Applied Soft Com A

Mullen, K. C. (2015). The Effects of Professional Status, Leader Inclusiveness, and Psychological Safety on Ethical and OCB Outcomes in the Financial Services Industry (Doctoral dissertation, Anderson University

Arcuri, A., & Briand, L. (2011, May). A practical guide for using statistical tests to assess randomized algorithms in software engineering. In Software Engineering (ICSE), 2011 33rd International Conference on (pp. 1-10). IEEE.‏

Burgess, C. J., & Lefley, M. (2001). Can genetic programming improve software effort estimation? A comparative evaluation. Information and software technology, 43(14), 863-873

Fenton, N., & Bieman, J. (2014). Software metrics: a rigorous and practical approach. CRC press.‏.‏

Garg, V. K., & Bansal, R. K. (2015). Intelligent Computing Techniques for the Detection of Sleep Disorders: A Review. International Journal of Computer Applications, 110(1).‏

Uusitalo, L., Lehikoinen, A., Helle, I., & Myrberg, K. (2015). An overview of methods to evaluate uncertainty of deterministic models in decision support. Environmental Modelling & Software, 63, 24-31.‏

Tomar, D., & Agarwal, S. (2014). A survey on pre-processing and post-processing techniques in data mining. International Journal of Database Theory & Application, 7(4), 99-128.‏

Reddy, C. S., & Raju, K. V. S. V. N. (2009). A concise neural network model for estimating software effort. International Journal of Recent Trends in Engineering, 1(1), 188.‏

Pospieszny, P. (2017, October). Software estimation: towards prescriptive analytics. In Proceedings of the 27th International Workshop on Software Measurement and 12th International Conference on Software Process and Product Measurement (pp. 221-226). ACM.‏

Karim, S., Warnars, H. L. H. S., Gaol, F. L., Abdurachman, E., & Soewito, B. (2017, November). Software metrics for fault prediction using machine learning approaches: A literature review with PROMISE repository dataset. In Cybernetics and Computational Intelligence (CyberneticsCom), 2017 IEEE International Conference on (pp. 19-23). IEEE.‏

López-Martín, C. (2015). Predictive accuracy comparison between neural networks and statistical regression for development effort of software projects. Applied Soft Computing, 27, 434-449.‏

Malhotra, R. (2014). Comparative analysis of statistical and machine learning methods for predicting faulty modules. Applied Soft Computing, 21, 286-297

Abe, S., Mizuno, O., Kikuno, T., Kikuchi, N., & Hirayama, M. (2006, May). Estimation of project success using bayesian classifier. In Proceedings of the 28th international conference on Software engineering (pp. 600-603). ACM.‏


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.