A Comparative Study on Similarity Measure Techniques for Cross-Project Defect Prediction

KIPS Transactions on Software and Data Engineering, Vol. 7, No.6, pp.205-220, June 2018
10.3745/KTSDE.2018.7.6.205, Full Text

Abstract

Software defect prediction is helpful for allocating valuable project resources effectively for software quality assurance activities thanks to focusing on the identified fault-prone modules. If historical data collected within a company is sufficient, a Within-Project Defect Prediction (WPDP) can be utilized for accurate fault-prone module prediction. In case a company does not maintain historical data, it may be helpful to build a classifier towards predicting comprehensible fault prediction based on Cross-Project Defect Prediction (CPDP). Since CPDP employs different project data collected from other organization to build a classifier, the main obstacle to build an accurate classifier is that distributions between source and target projects are not similar. To address the problem, because it is crucial to identify effective similarity measure techniques to obtain high performance for CPDP, In this paper, we aim to identify them. We compare various similarity measure techniques. The effectiveness of similarity weights calculated by those similarity measure techniques are evaluated. The results are verified using the statistical significance test and the effect size test. The results show k-Nearest Neighbor (k-NN), LOcal Correlation Integral (LOCI), and Range methods are the top three performers. The experimental results show that predictive performances using the three methods are comparable to those of WPDP.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from October 15, 2016)

Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


Cite this paper

[KIPS Transactions Style]
D. Ryu and J. Baik, "A Comparative Study on Similarity Measure Techniques for Cross-Project Defect Prediction," KIPS Transactions on Software and Data Engineering, Vol.7, No.6, pp.205-220, 2018, DOI: 10.3745/KTSDE.2018.7.6.205.

[IEEE Style]
Duksan Ryu and Jongmoon Baik, "A Comparative Study on Similarity Measure Techniques for Cross-Project Defect Prediction," KIPS Transactions on Software and Data Engineering, vol. 7, no. 6, pp. 205-220, 2018. DOI: 10.3745/KTSDE.2018.7.6.205.

[ACM Style]
Ryu, D. and Baik, J. 2018. A Comparative Study on Similarity Measure Techniques for Cross-Project Defect Prediction. KIPS Transactions on Software and Data Engineering, 7, 6, (2018), 205-220. DOI: 10.3745/KTSDE.2018.7.6.205.