Publication:
Clustering and dimensionality reduction to determine important software quality metrics

dc.contributor.authorTuran, Metin
dc.contributor.authorÇataltepe, Zehra
dc.contributor.authorID140809tr_TR
dc.contributor.authorID104999tr_TR
dc.date.accessioned2018-07-31T06:49:03Z
dc.date.available2018-07-31T06:49:03Z
dc.date.issued2007
dc.description.abstractDuring the last two decades research on software engineering is concentrated on quality. The best approach to quality evaluation goes through determining well-defined metrics on software properties. One such property is module complexity, which is a view of the software that is related to how easily it can be modified. There has been work on constructing a metrics domain which measures the module complexity. Generally, PCA (Principal Component Analysis) is used for defining principal metrics in the domain. Since there are usually no labels for the software data, an unsupervised dimensionality reduction technique, such as PCA needs to be used for determining the most important metrics. In this study, we use the clustering similarity obtained when a certain subset of metrics and when the whole set of metrics are used, to determine the most important metrics. We measure the relative difference/similarity between clusterings using three different indices, namely Rand, Jaccard and Fowlkes-Mallow. We use both backward feature selection and PCA for dimensionality reduction. On the publicly available NASA data, we find out that instead of the whole set of 42 metrics, using only 15 dimensions, we get almost the same clustering performance. Therefore, instead of the whole set of software metrics, a smaller number of them could be used to evaluate the software quality.tr_TR
dc.identifier.isbn978-1-4244-1363-8
dc.identifier.scopus2-s2.0-48649092019
dc.identifier.urihttps://hdl.handle.net/11413/2441
dc.identifier.wos256394000039
dc.language.isoen
dc.publisherIEEE, 345 E 47th St, New York, Ny 10017 USA
dc.relation2007 22nd International Symposium on Computer and Information Sciencestr_TR
dc.titleClustering and dimensionality reduction to determine important software quality metricstr_TR
dc.typeArticle
dspace.entity.typePublication
local.indexed.atWOS
local.indexed.atScopus

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: