Decoding Marathi Emotions: Enhanced Speech Emotion Recognition via Deep Belief Network-SVM Integration
DOI:
https://doi.org/10.6977/IJoSI.202508_9(4).0006Abstract
SER in Marathi presents considerable hurdles due to the language's distinct grammatical and emotional characteristics. This paper presents a robust methodology for classifying emotions in Marathi speech utilizing advanced signal processing, feature extraction, and machine learning techniques. The method entails collecting a diverse collection of Marathi speech samples and using pre-processing steps such as Pre-Emphasis and VAD to improve signal quality. Speech signals are segmented using the Hamming window to reduce discontinuities, and features such as MFCCs, pitch, intensity, and spectral properties are retrieved. For classification, an attentive DBN is paired with an SVM, which uses attention techniques and batch normalization to improve performance and reduce overfitting. The suggested approach surpasses existing models, with 98% accuracy, 98% F1-Score, 99% specificity, 99% sensitivity, 98% precision, and 98% recall.
Downloads
Published
Issue
Section
License
Copyright in a work is a bundle of rights. IJoSI's, copyright covers what may be done with the work in terms of making copies, making derivative works, abstracting parts of it for citation or quotation elsewhere and so on. IJoSI requires authors to sign over rights when their article is ready for publication so that the publisher from then on owns the work. Until that point, all rights belong to the creator(s) of the work. The format of IJoSI copy right form can be found at the IJoSI web site.The issues of International Journal of Systematic Innovation (IJoSI) are published in electronic format and in print. Our website, journal papers, and manuscripts etc. are stored on one server. Readers can have free online access to our journal papers. Authors transfer copyright to the publisher as part of a journal publishing agreement, but have the right to:
1. Share their article for personal use, internal institutional use and scholarly sharing purposes, with a DOI link to the version of record on our server.
2. Retain patent, trademark and other intellectual property rights (including research data).
3. Proper attribution and credit for the published work.