Publication: Automatic Software Categorization Using Ensemble Methods and Bytecode Analysis
dc.contributor.author | Çatal, Çağatay | |
dc.contributor.author | Tugul, Serkan | |
dc.contributor.author | Akpınar, Başar | |
dc.contributor.authorID | 108363 | tr_TR |
dc.date.accessioned | 2018-07-20T13:01:04Z | |
dc.date.available | 2018-07-20T13:01:04Z | |
dc.date.issued | 2017-09 | |
dc.description.abstract | Software repositories consist of thousands of applications and the manual categorization of these applications into domain categories is very expensive and time-consuming. In this study, we investigate the use of an ensemble of classifiers approach to solve the automatic software categorization problem when the source code is not available. Therefore, we used three data sets (package level/class level/method level) that belong to 745 closed-source Java applications from the Sharejar repository. We applied the Vote algorithm, AdaBoost, and Bagging ensemble methods and the base classifiers were Support Vector Machines, Naive Bayes, J48, IBk, and Random Forests. The best performance was achieved when the Vote algorithm was used. The base classifiers of the Vote algorithm were AdaBoost with J48, AdaBoost with Random Forest, and Random Forest algorithms. We showed that the Vote approach with method attributes provides the best performance for automatic software categorization; these results demonstrate that the proposed approach can effectively categorize applications into domain categories in the absence of source code. | tr_TR |
dc.identifier.issn | 0218-1940 | |
dc.identifier.other | 1793-6403 | |
dc.identifier.scopus | 2-s2.0-85029663055 | |
dc.identifier.uri | https://doi.org/10.1142/S0218194017500425 | |
dc.identifier.uri | https://hdl.handle.net/11413/2233 | |
dc.identifier.wos | 411338700006 | |
dc.language.iso | en | |
dc.publisher | World Scientific Publ Co Pte Ltd, 5 Toh Tuck Link, Singapore 596224, Singapore | |
dc.relation | International Journal of Software Engineering and Knowledge Engineering | tr_TR |
dc.subject | Software categorization | tr_TR |
dc.subject | machine learning | tr_TR |
dc.subject | software repository | tr_TR |
dc.subject | bytecode | tr_TR |
dc.title | Automatic Software Categorization Using Ensemble Methods and Bytecode Analysis | tr_TR |
dc.type | Article | |
dspace.entity.type | Publication | |
local.indexed.at | WOS | |
local.indexed.at | Scopus |
Files
License bundle
1 - 1 of 1
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: