Uncovering Linguistic Patterns: Exploring Ensemble Learning and Low-Level Features for Identifying Spoken Arabic, English, Spanish, and German
Hamdi, Skander; Moussaoui, Abdelouahab; Chabane, Mafaza; Laouarem, Ayoub; Berrimi, Mohamed; Oussalah, Mourad (2023-11-22)
Hamdi, Skander
Moussaoui, Abdelouahab
Chabane, Mafaza
Laouarem, Ayoub
Berrimi, Mohamed
Oussalah, Mourad
IEEE
22.11.2023
S. Hamdi, A. Moussaoui, M. Chabane, A. Laouarem, M. Berrimi and M. Oussalah, "Uncovering Linguistic Patterns: Exploring Ensemble Learning and Low-Level Features for Identifying Spoken Arabic, English, Spanish, and German," 2023 5th International Conference on Pattern Analysis and Intelligent Systems (PAIS), Sétif, Algeria, 2023, pp. 1-8, doi: 10.1109/PAIS60821.2023.10322032.
https://rightsstatements.org/vocab/InC/1.0/
© 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/
© 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:oulu-202501031045
https://urn.fi/URN:NBN:fi:oulu-202501031045
Tiivistelmä
Abstract
This paper presents a novel approach to spoken language identification in Arabic, English, Spanish, and German languages using ensemble learning techniques. The study compares the performance of two well-known ensemble learning algorithms (Random Forest and XGBoost). Next, a Stacking Ensemble method with Logistic Regression is used as a meta-model, which combines the predictions of two configurations of Random Forest and two other configurations of XGBoost. We explore the utilization of the audio Low-Level Descriptors features, which have previously received limited attention in spoken language identification. Experimental results demonstrate the effectiveness of the ensemble learning algorithms, achieving high accuracy in accurately classifying spoken languages. Notably, the Stacking Ensemble method showcases its ability to reduce misclassification rate, emphasizing its potential for performance improvement. The stacking Ensemble method achieved the highest recorded classification rate of 97.22%, outperforming individual methods.
This paper presents a novel approach to spoken language identification in Arabic, English, Spanish, and German languages using ensemble learning techniques. The study compares the performance of two well-known ensemble learning algorithms (Random Forest and XGBoost). Next, a Stacking Ensemble method with Logistic Regression is used as a meta-model, which combines the predictions of two configurations of Random Forest and two other configurations of XGBoost. We explore the utilization of the audio Low-Level Descriptors features, which have previously received limited attention in spoken language identification. Experimental results demonstrate the effectiveness of the ensemble learning algorithms, achieving high accuracy in accurately classifying spoken languages. Notably, the Stacking Ensemble method showcases its ability to reduce misclassification rate, emphasizing its potential for performance improvement. The stacking Ensemble method achieved the highest recorded classification rate of 97.22%, outperforming individual methods.
Kokoelmat
- Avoin saatavuus [38824]