Browsing by Author "Diri, Banu"

Now showing 1 - 5 of 5

A fault detection strategy for software projects
(Univ Osijek, Tech Fac, Trg ivane Brlic-Mazuranic 2, Slavonski Brod, Hr-35000, Croatia, 2013-02) Çatal, Çağatay; Diri, Banu; 108363; 25308
Abstract The existing software fault prediction models require metrics and fault data belonging to previous software versions or similar software projects. However, there are cases when previous fault data are not present, such as a software company's transition to a new project domain. In this kind of situations, supervised learning methods using fault labels cannot be applied, leading to the need for new techniques. We proposed a software fault prediction strategy using method-level metrics thresholds to predict the fault-proneness of unlabelled program modules. This technique was experimentally evaluated on NASA datasets, KC2 and JM1. Some existing approaches implement several clustering techniques to cluster modules, process followed by an evaluation phase. This evaluation is performed by a software quality expert, who analyses every representative of each cluster and then labels the modules as fault-prone or not fault-prone. Our approach does not require a human expert during the prediction process. It is a fault prediction strategy, which combines a method-level metrics thresholds as filtering mechanism and an OR operator as a composition mechanism.
Detection of Phishing Websites by Using Machine Learning-Based URL Analysis
(Institute of Electrical and Electronics Engineers Inc., 2020) Korkmaz, Mehmet; ŞAHİNGÖZ, ÖZGÜR KORAY; Diri, Banu
In recent years, with the increasing use of mobile devices, there is a growing trend to move almost all real-world operations to the cyberworld. Although this makes easy our daily lives, it also brings many security breaches due to the anonymous structure of the Internet. Used antivirus programs and firewall systems can prevent most of the attacks. However, experienced attackers target on the weakness of the computer users by trying to phish them with bogus webpages. These pages imitate some popular banking, social media, e-commerce, etc. sites to steal some sensitive information such as, user-ids, passwords, bank account, credit card numbers, etc. Phishing detection is a challenging problem, and many different solutions are proposed in the market as a blacklist, rule-based detection, anomaly-based detection, etc. In the literature, it is seen that current works tend on the use of machine learning-based anomaly detection due to its dynamic structure, especially for catching the 'zero-day' attacks. In this paper, we proposed a machine learning-based phishing detection system by using eight different algorithms to analyze the URLs, and three different datasets to compare the results with other works. The experimental results depict that the proposed models have an outstanding performance with a success rate.
An Evolutionary Approach to Multiple Traveling Salesman Problem for Efficient Distribution of Pharmaceutical Products
(Institute of Electrical and Electronics Engineers Inc., 2020) Koçyiğit, Emre; ŞAHİNGÖZ, ÖZGÜR KORAY; Diri, Banu
Considerable growth of computer science has created novel solutions for variable problem fields and has increased the efficiency of available solutions. Evolutionary algorithms are quite successful in dealing with real-world problems that require optimization. In this article, we implemented a Genetic Algorithm that is well known evolutionary algorithm in order to provide an efficient solution for the Distribution of Pharmaceutical Products, which is a vital optimization problem, especially in situations such as a pandemic. The Multiple Traveling Salesman Problem approach was used to distribute pharmaceutical products as soon as possible. Moreover, we strengthened our proposal algorithm with 2-Opt Algorithm to get optimal results in earlier iterations. Different datasets from a library were applied to measure the quality of solutions and computation time. At the end of the work, we observed that our proposed algorithm generates successful solutions in an acceptable running time. This study will be extended with a new mutation concept as future work.
Feature Selections for the Classification of Webpages to Detect Phishing Attacks: A Survey
(IEEE, 2020) Korkmaz, Mehmet; ŞAHİNGÖZ, ÖZGÜR KORAY; Diri, Banu
In recent years, due to the increased number of Internet-connected devices, almost all the real-world interactions are transferred to the cyberworld. Therefore, most of the commerce (especially in the e-commerce format) are executed over webpages. The anonymous and uncontrollable structure of Internet, enables the malicious use of this cyber environment for a relatively new crime format, named as e-crime, which mainly aims some illegal financial gain by cheating the standard end-users. Phishing attacks are one of the most preferred fraudulent technique which is used for getting some confidential information (like user-id, password, credit card information, etc.) of the end-users. Therefore, security admins of the networks try to decrease the number of victims is their companies. One principal protection mechanism is the use of blacklists to detect the phishing webpages. However, it has a significant deficiency in not protection about new page attacks. Most of the security admins use some learning systems which are trained by a pre-collected a-dataset by extracting some features from the URL and content of the web pages. The performance of the used system directly related with the features used for the classification. In this work, we aimed to analyze the previously used features in the classification of the web pages by making a comparative analysis about the literature. With this study, it is aimed to produce a general survey resource for the researchers, which aim to work on the classification of webpages or the security of the networks.
Machine Learning Based Phishing Detection from URIs
(2017-12) Buber, Ebubekir; Demir, Önder; Diri, Banu; ŞAHİNGÖZ, ÖZGÜR KORAY; 214903
Due to the rapid growth of the Internet, users change their preference from traditional shopping to the electronic commerce. Instead of bank/shop robbery, nowadays, criminals try to find their victims in the cyberspace with some specific tricks. By using the anonymous structure of the Internet, attackers set out new techniques, such as phishing, to deceive victims with the use of false websites to collect their sensitive information such as account IDs, usernames, passwords, etc. Understanding whether a web page is legitimate or phishing is a very challenging problem, due to its semantics-based attack struc ture, which mainly exploits the computer users’ vulnerabilities. Although software companies launch new anti-phishing products, which use blacklists, heuristics, visual and machine learning-based approaches, these products cannot prevent all of the phishing attacks. In this paper, a real-time anti-phishing system, which uses seven different classification algorithms and natural language processing (NLP) based features, is proposed. The system has the following distinguishing properties from other studies in the literature: language independence, use of a huge size of phishing and legitimate data, real-time execution, detection of new websites, independence from third-party services and use of feature-rich classifiers. For mea suring the performance of the system, a new dataset is constructed, and the experimental results are tested on it. According to the experimental and comparative results from the implemented classification algorithms, Random Forest algorithm with only NLP based features gives the best performance with the 97.98% accuracy rate for detection of phishing URLs.