Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms

AYRANCI, AHMET AYTUĞ; Atay, Sergen; Yıldırım, Tülay

Publication:
Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms

dc.contributor.author	AYRANCI, AHMET AYTUĞ
dc.contributor.author	Atay, Sergen
dc.contributor.author	Yıldırım, Tülay
dc.date.accessioned	2023-01-03T11:20:19Z
dc.date.available	2023-01-03T11:20:19Z
dc.date.issued	2021
dc.description.abstract	Speech and speaker recognition systems aim to analyze parametric information contained in the human voice and recognize it at the highest possible rate. One of the most important features in the audio signal for the speaker to be recognized successfully by the system is the speaker's accent. Speaker accent recognition systems are based on the analysis of patterns such as the way the speaker speaks and the word choice he uses while speaking. In this study, the data obtained by the MFCC feature extraction technique from voice signals of 367 speakers with 7 different accents were used. The data of 330 speakers in the data set were taken from the "Speaker Accent Recognition" data set in the UC Irvine Machine Learning (ML) open data source. The data of the other 37 speakers were obtained by converting the voice recordings in the "Speaker Accent Archive" data set created by George Mason University into data using the MFCC feature extraction technique. 9 ML classification algorithms were used for the designed speaker accent recognition system. Also, the k-fold cross-validation technique was used to test the data set independently. In this way, the performance of ML algorithms is shown when the data set is divided into a k number of parts. Information about the classification algorithms used in the designed system and the hyperparameter optimizations made in these algorithms are also given. The success performances of the classification algorithms are shown with performance metrics.	en
dc.identifier	33
dc.identifier.citation	AYRANCI A, ATAY S, YILDIRIM T (2021). Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. International journal of advances in engineering and pure sciences (Online), 33(0), 17 - 27. 10.7240/jeps.896427
dc.identifier.eissn	2636-8277
dc.identifier.uri	https://doi.org/10.7240/jeps.896427
dc.identifier.uri	https://hdl.handle.net/11413/8160
dc.language.iso	en
dc.publisher	Marmara Üniversitesi, Fen Bilimleri Enstitüsü
dc.relation.journal	International journal of advances in engineering and pure sciences (Online)
dc.rights	info:eu-repo/semantics/openAccess
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject	Mel-frequency Cepstral Coefficients
dc.subject	Machine Learning
dc.subject	Speaker Accent Recognition
dc.subject	Feature Extraction
dc.title	Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms	en
dc.title.alternative	MFCC Öznitelik Çıkarım Tekniği ve Makine Öğrenmesi Algoritmaları Kullanılarak Konuşmacı Aksanı Tanıma	tr
dc.type	Article
dspace.entity.type	Publication
local.indexed.at	TrDizin
local.journal.endpage	27
local.journal.startpage	17
relation.isAuthorOfPublication	7c09e543-ec43-49e3-abab-f07ef9883a06
relation.isAuthorOfPublication.latestForDiscovery	7c09e543-ec43-49e3-abab-f07ef9883a06

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Tam Metin/Full Text
Size:: 3.59 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.82 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

TRDizin İndeksli Yayınlar / TRDizin Indexed Publications
Elektrik-Elektronik Mühendisliği Bölümü / Department of Electrical and Electronics Engineering

Publication: Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms

Files

Original bundle

License bundle

Collections

Publication:
Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms