Hyppää sisältöön
    • FI
    • ENG
  • FI
  • /
  • EN
OuluREPO – Oulun yliopiston julkaisuarkisto / University of Oulu repository
Näytä viite 
  •   OuluREPO etusivu
  • Oulun yliopisto
  • Avoin saatavuus
  • Näytä viite
  •   OuluREPO etusivu
  • Oulun yliopisto
  • Avoin saatavuus
  • Näytä viite
JavaScript is disabled for your browser. Some features of this site may not work without it.

Gender identification from arabic speech using machine learning

Hamdi, Skander; Moussaoui, Abdelouahab; Oussalah, Mourad; Saidi, Mohamed (2020-09-06)

 
Avaa tiedosto
nbnfi-fe2022030422007.pdf (389.7Kt)
nbnfi-fe2022030422007_meta.xml (34.07Kt)
nbnfi-fe2022030422007_solr.xml (33.59Kt)
Lataukset: 

URL:
https://doi.org/10.1007/978-3-030-58861-8_11

Hamdi, Skander
Moussaoui, Abdelouahab
Oussalah, Mourad
Saidi, Mohamed
Springer Nature
06.09.2020

Hamdi S., Moussaoui A., Oussalah M., Saidi M. (2021) Gender Identification from Arabic Speech Using Machine Learning. In: Chikhi S., Amine A., Chaoui A., Saidouni D., Kholladi M. (eds) Modelling and Implementation of Complex Systems. MISC 2020. Lecture Notes in Networks and Systems, vol 156. Springer, Cham. https://doi.org/10.1007/978-3-030-58861-8_11

https://rightsstatements.org/vocab/InC/1.0/
© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2021. This is a post-peer-review, pre-copyedit version of an article published in Modelling and Implementation of Complex Systems. MISC 2020. Lecture Notes in Networks and Systems, vol 156. The final authenticated version is available online at https://doi.org/10.1007/978-3-030-58861-8_11
https://rightsstatements.org/vocab/InC/1.0/
doi:https://doi.org/10.1007/978-3-030-58861-8_11
Näytä kaikki kuvailutiedot
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2022030422007
Tiivistelmä

Abstract

Speech recognition is becoming increasingly used in real-world applications. One of the interesting applications is automatic gender recognition which aims to recognize male and female voices from short speech samples. This can be useful in applications such as automatic dialogue systems, system verification, prediction of demographic attributes (e.g., age, location) and estimating person’s emotional state. This paper focuses on gender identification from the publicly available dataset Arabic Natural Audio Dataset (ANAD) using an ensemble-classifier based approach. More specifically, initially we extended the original ANAD to include a gender label information through a manual annotation task. Next, in order to optimize the feature engineering process, a three stage machine learning approach is devised. In the first phase, re restricted to features to the two widely used ones; namely, MFCC and fundamental frequency coefficients. In the second phase, six distinct acoustic features were employed. Finally, in the third phase, the features were selected according to their associated weights in Random Forest Classifier, and the best features are thereby selected. The latter approach enabled us to achieve a classification rate of 96.02% on the test set generated with linear SVM classifier.

Kokoelmat
  • Avoin saatavuus [38404]
oulurepo@oulu.fiOulun yliopiston kirjastoOuluCRISLaturiMuuntaja
SaavutettavuusselosteTietosuojailmoitusYlläpidon kirjautuminen
 

Selaa kokoelmaa

NimekkeetTekijätJulkaisuajatAsiasanatUusimmatSivukartta

Omat tiedot

Kirjaudu sisäänRekisteröidy
oulurepo@oulu.fiOulun yliopiston kirjastoOuluCRISLaturiMuuntaja
SaavutettavuusselosteTietosuojailmoitusYlläpidon kirjautuminen