Serviceeinschränkungen vom 12.-22.02.2026 - weitere Infos auf der UB-Homepage

Treffer: A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis.

Title:
A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis.
Authors:
Hu Q; Faculty of Humanities and Arts, Macau University of Science and Technology, Avenida Wai Long, Taipa, Macau, 999078, China., Peng Y; Faculty of Innovation Engineering, Macau University of Science and Technology, Avenida Wai Long, Taipa, Macau, 999078, China. 3230002514@student.must.edu.mo., Zheng Z; Faculty of Humanities and Arts, Macau University of Science and Technology, Avenida Wai Long, Taipa, Macau, 999078, China.
Source:
Scientific reports [Sci Rep] 2025 Aug 05; Vol. 15 (1), pp. 28569. Date of Electronic Publication: 2025 Aug 05.
Publication Type:
Journal Article
Language:
English
Journal Info:
Publisher: Nature Publishing Group Country of Publication: England NLM ID: 101563288 Publication Model: Electronic Cited Medium: Internet ISSN: 2045-2322 (Electronic) Linking ISSN: 20452322 NLM ISO Abbreviation: Sci Rep Subsets: MEDLINE
Imprint Name(s):
Original Publication: London : Nature Publishing Group, copyright 2011-
References:
Accid Anal Prev. 2020 Apr;138:105468. (PMID: 32065912)
Clin Gastroenterol Hepatol. 2023 Jan;21(1):8-10. (PMID: 36113552)
J Electrocardiol. 2020 Mar - Apr;59:151-157. (PMID: 32146201)
Acad Radiol. 2020 Jan;27(1):13-25. (PMID: 31818379)
Inform Med Unlocked. 2020;18:. (PMID: 32864421)
Indian J Ophthalmol. 2020 Feb;68(2):398-405. (PMID: 31957737)
Lancet Digit Health. 2020 Mar;2(3):e138-e148. (PMID: 33334578)
J Am Coll Cardiol. 2022 Aug 9;80(6):613-626. (PMID: 35926935)
Gastroenterology. 2020 Jan;158(1):76-94.e2. (PMID: 31593701)
Neural Netw. 2017 Aug;92:60-68. (PMID: 28396068)
Contributed Indexing:
Keywords: Artificial intelligence; Cloning algorithm; Neural network; Robotic intelligence; Systematic emotions
Entry Date(s):
Date Created: 20250805 Date Completed: 20250805 Latest Revision: 20250808
Update Code:
20250808
PubMed Central ID:
PMC12325928
DOI:
10.1038/s41598-025-14016-w
PMID:
40764384
Database:
MEDLINE

Weitere Informationen

Speech is one of the most efficient methods of communication among humans, inspiring advancements in machine speech processing under Natural Language Processing (NLP). This field aims to enable computers to analyze, comprehend, and generate human language naturally. Speech processing, as a subset of artificial intelligence, is rapidly expanding due to its applications in emotion recognition, human-computer interaction, and sentiment analysis. This study introduces a novel algorithm for emotion recognition from speech using deep learning techniques. The proposed model achieves up to a 15% improvement compared to state-of-the-art deep learning methods in speech emotion recognition. It employs advanced supervised learning algorithms and deep neural network architectures, including Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units. These models are trained on labeled datasets to accurately classify emotions such as happiness, sadness, anger, fear, surprise, and neutrality. The research highlights the system's real-time application potential, such as analyzing audience emotional responses during live television broadcasts. By leveraging advancements in deep learning, the model achieves high accuracy in understanding and predicting emotional states, offering valuable insights into user behavior. This approach contributes to diverse domains, including media analysis, customer feedback systems, and human-machine interaction, showcasing the transformative potential of combining speech processing with neural networks.
(© 2025. The Author(s).)

Declarations. Competing interests: The authors declare no competing interests.