Publication:
Deep Learning-Based Classification of Software Bugs Using Code Context and AST Features

dc.contributor.authorGökcen, Alpaslan
dc.contributor.authorARŞIK, ARDA
dc.contributor.authorAKBULUT, AKHAN
dc.contributor.authorÇatal, Çağatay
dc.date.accessioned2025-11-11T11:26:08Z
dc.date.issued2025
dc.description.abstractA software component plagued with bugs is likely to experience both functional and non-functional difficulties, such as usability, performance, and security issues. These bugs can range from improper layouts to system crashes and security vulnerabilities. In this research, we developed a deep learning-based model to classify software bugs based on preceding and corrected code statements. Preprocessing includes extracting features from the Abstract Syntax Tree (AST) by traversing the tree to capture node types and relationships, then encoding the AST into a numerical vector representation for classification. Using AST and padding methods, features are extracted from code statements for use in a Convolutional Neural Network (CNN) model. We proposed a CNN-based model with AST-based feature extraction for large-scale software bug classification and evaluated on 153,652 bug samples from 1,000 GitHub projects. Compared to traditional datasets, this large dataset presents challenges in building accurate prediction models. This model is particularly useful in continuous integration (CI) pipelines, where it can automatically detect problematic code during the build process, helping to identify bugs faster and reduce manual review.en
dc.identifier.citationGokcen, A., Arsik, A., Akbulut, A., & Catal, C. (2025, April). Deep Learning-Based Classification of Software Bugs Using Code Context and AST Features. In 2025 13th International Symposium on Digital Forensics and Security (ISDFS) (pp. 1-5). IEEE.
dc.identifier.isbn979-833150993-4
dc.identifier.scopus2-s2.0-105008499038
dc.identifier.urihttps://doi.org/10.15672/hujms.1386151
dc.identifier.urihttps://hdl.handle.net/11413/9707
dc.language.isoen
dc.publisherIEEE
dc.relation.journalISDFS 2025 - 13th International Symposium on Digital Forensics and Security
dc.rightsinfo:eu-repo/semantics/restrictedAccess
dc.subjectAbstract Syntax Tree
dc.subjectBug Classification
dc.subjectConvolutional Neural Networks
dc.subjectDefect Prediction
dc.subjectFault Prediction
dc.titleDeep Learning-Based Classification of Software Bugs Using Code Context and AST Features
dc.title.alternativeInternational Symposium on Digital Forensics and Security
dc.typeconferenceObject
dspace.entity.typePublication
local.indexed.atScopus
local.journal.endpage5
local.journal.startpage1
relation.isAuthorOfPublication12591ace-1ce4-4de6-baa9-9d7a19f7a44e
relation.isAuthorOfPublication6ee0b32b-faed-495d-ac4d-8a263d1ff889
relation.isAuthorOfPublication.latestForDiscovery12591ace-1ce4-4de6-baa9-9d7a19f7a44e

Files

Original bundle

Now showing 1 - 1 of 1
Placeholder
Name:
Tam Metin/Full Text
Size:
814.97 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.81 KB
Format:
Item-specific license agreed upon to submission
Description: