Publication:
Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms

dc.contributor.authorAYRANCI, AHMET AYTUĞ
dc.contributor.authorAtay, Sergen
dc.contributor.authorYıldırım, Tülay
dc.date.accessioned2023-01-03T11:20:19Z
dc.date.available2023-01-03T11:20:19Z
dc.date.issued2021
dc.description.abstractSpeech and speaker recognition systems aim to analyze parametric information contained in the human voice and recognize it at the highest possible rate. One of the most important features in the audio signal for the speaker to be recognized successfully by the system is the speaker's accent. Speaker accent recognition systems are based on the analysis of patterns such as the way the speaker speaks and the word choice he uses while speaking. In this study, the data obtained by the MFCC feature extraction technique from voice signals of 367 speakers with 7 different accents were used. The data of 330 speakers in the data set were taken from the "Speaker Accent Recognition" data set in the UC Irvine Machine Learning (ML) open data source. The data of the other 37 speakers were obtained by converting the voice recordings in the "Speaker Accent Archive" data set created by George Mason University into data using the MFCC feature extraction technique. 9 ML classification algorithms were used for the designed speaker accent recognition system. Also, the k-fold cross-validation technique was used to test the data set independently. In this way, the performance of ML algorithms is shown when the data set is divided into a k number of parts. Information about the classification algorithms used in the designed system and the hyperparameter optimizations made in these algorithms are also given. The success performances of the classification algorithms are shown with performance metrics.en
dc.identifier33
dc.identifier.citationAYRANCI A, ATAY S, YILDIRIM T (2021). Speaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithms. International journal of advances in engineering and pure sciences (Online), 33(0), 17 - 27. 10.7240/jeps.896427
dc.identifier.eissn2636-8277
dc.identifier.urihttps://doi.org/10.7240/jeps.896427
dc.identifier.urihttps://hdl.handle.net/11413/8160
dc.language.isoen
dc.publisherMarmara Üniversitesi, Fen Bilimleri Enstitüsü
dc.relation.journalInternational journal of advances in engineering and pure sciences (Online)
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectMel-frequency Cepstral Coefficients
dc.subjectMachine Learning
dc.subjectSpeaker Accent Recognition
dc.subjectFeature Extraction
dc.titleSpeaker Accent Recognition Using MFCC Feature Extraction and Machine Learning Algorithmsen
dc.title.alternativeMFCC Öznitelik Çıkarım Tekniği ve Makine Öğrenmesi Algoritmaları Kullanılarak Konuşmacı Aksanı Tanımatr
dc.typeArticle
dspace.entity.typePublication
local.indexed.attrdizin
local.journal.endpage27
local.journal.startpage17
relation.isAuthorOfPublication7c09e543-ec43-49e3-abab-f07ef9883a06
relation.isAuthorOfPublication.latestForDiscovery7c09e543-ec43-49e3-abab-f07ef9883a06

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Tam Metin/Full Text
Size:
3.59 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.82 KB
Format:
Item-specific license agreed upon to submission
Description: