Abstract
Emotion identification from textual data is crucial for understanding human emotions in applications such as user behaviour analysis, targeted content delivery, and mental health monitoring. Even though English has advanced significantly, emotion detection in Hindi, a widely spoken language lacking enough resources, is still tricky because of data, grammatical complexity, and NLP tools. This study presents a deep neural network (DNN)-based framework for emotion classification in Hindi sentences. Our approach includes Hindi-specific preprocessing using the iNLTK library, comparative evaluation of multiple encoding techniques (Bag of Words, TF-IDF, Word2Vec), and training a robust DNN model to classify text into five emotion classes: joy, sadness, anger, suspense, and neutral. Experimental results on the BHAV dataset demonstrate that our DNN model achieves a balanced accuracy of 94.91%, outperforming traditional classifiers such as Naive Bayes, SVC, Logistic Regression, Decision Trees, and Boosted Trees. The confusion matrix and training-validation curves confirm the model's generalization capabilities and minimal overfitting. Our outcomes underscore the significance of deep learning in low-resource language settings and set the groundwork for future improvements in multimodal emotion detection, code-mixed data handling, and deployment in real-time Hindi NLP applications.
References
- Seshakagari, Haranadha Reddy Busireddy, et al. "Dynamic financial sentiment analysis and market forecasting through large language models." International Journal of Human Computations & Intelligence 4.1 (2025): 397-410.
- M. Zhou, J. Ju, W. Yuan, L. Liu, and Y. Feng, “Exploring the roles of informational and emotional language in online government interactions to promote citizens’ continuous participation,” Public Performance & Management Review, vol. 48, no. 2, pp. 468–496, 2025.
- Reddy, Busireddy Seshakagari Haranadha, R. Venkatramana, and L. Jayasree. "Enhancing apple fruit quality detection with augmented YOLOv3 deep learning algorithm." International Journal of Human Computations & Intelligence 4.1 (2025): 386-396..
- Z. Liu, L. Qian, Q. Xie, J. Huang, K. Yang, and S. Ananiadou, “Mmaffben: A multilingual and multimodal affective analysis benchmark for evaluating llms and vlms,” arXiv preprint arXiv:2505.24423, 2025.
- P. Pakray, A. Gelbukh, and S. Bandyopadhyay, “Natural language processing applications for low-resource languages,” Natural Language Processing, vol. 31, no. 2, pp. 183–197, 2025.
- R. Mahajan, A. S. More, and U. Shah, “Navigating emotion in code-mixed languages: Performance of ml and dl models on hindi-english text,” Procedia Computer Science, vol. 258, pp. 4029–4037, 2025.
- M. Balipa, K. Anwaya, M. Murugappan et al., “A rule-basedmachine translation framework for low-resource language pairs,” in 2025 4th International Conference on Sentiment Analysis and Deep Learning (ICSADL). IEEE, 2025, pp. 969–974.
- R. Raja and A. Vats, “Parallel corpora for machine translation in low-resource indic languages: A comprehensive review,” arXiv preprint arXiv:2503.04797, 2025.
- T. Manshu and W. Bing, “Adding prior knowledge in hierarchical attention neural network for cross domain sentiment classification,” IEEE Access, vol. 7, pp. 32 578–32 588, 2019.
- Y.-H. Huang, S.-R. Lee, M.-Y. Ma, Y.-H. Chen, Y.-W. Yu, and Y.-S. Chen, “Emotionx-idea: Emotion bert–an affectional model for conversation,” arXiv preprint arXiv:1908.06264, 2019.
- S.-Y. Chen, C.-C. Hsu, C.-C. Kuo, L.-W. Ku et al., “Emotionlines: An emotion corpus of multi-party conversations,” arXiv preprint arXiv:1802.08379, 2018.
- M. Polignano, P. Basile, M. de Gemmis, and G. Semeraro, “A comparison of word-embeddings in emotion detection from text using bilstm, cnn and self-attention,” in Adjunct publication of the 27th conference on user modeling, adaptation and personalization, 2019, pp. 63–68.
- S. Al-Azani and E.-S. M. El-Alfy, “Enhanced video analytics for sentiment analysis based on fusing textual, auditory and visual information,” IEEE Access, vol. 8, pp. 136 843–136 857, 2020.
- J. Wang, L.-C. Yu, K. R. Lai, and X. Zhang, “Tree-structured regional cnn-lstm model for dimensional sentiment analysis,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 581–591, 2019.
- B. Zhang, X. Li, X. Xu, K.-C. Leung, Z. Chen, and Y. Ye, “Knowledge guided capsule attention network for aspect-based sentiment analysis,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 2538–2551, 2020.
- Z. Lu, L. Cao, Y. Zhang, C.-C. Chiu, and J. Fan, “Speech sentiment analysis via pre-trained features from end-to-end asr models,” in ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2020, pp. 7149–7153.
- F. Yin, Y. Wang, J. Liu, and L. Lin, “The construction of sentiment lexicon based on context-dependent part-of-speech chunks for semantic disambiguation,” IEEE Access, vol. 8, pp. 63 359–63 367, 2020.
- Z. Ahmad, R. Jindal, A. Ekbal, and P. Bhattachharyya, “Borrow from rich cousin: transfer learning for emotion detection using cross lingual embedding,” Expert Systems with Applications, vol. 139, p. 112851, 2020.
- M. Suhasini and B. Srinivasu, “Emotion detection framework for twitter data using supervised classifiers,” in Data Engineering and Communication Technology: Proceedings of 3rd ICDECT-2K19. Springer, 2020, pp. 565–576.
- D. Seal, U. K. Roy, and R. Basak, “Sentence-level emotion detection from text based on semantic rules,” in Information and Communication Technology for Sustainable Development: Proceedings of ICT4SD 2018. Springer, 2020, pp. 423–430.
- S. Seo, S. Na, and J. Kim, “Hmtl: Heterogeneous modality transfer learning for audio-visual sentiment analysis,” IEEE Access, vol. 8, pp. 140 426–140 437, 2020.
- T. Kumar, M. Mahrishi, and G. Sharma, “Emotion recognition in hindi text using multilingual bert transformer,” Multimedia Tools and Applications, vol. 82, no. 27, pp. 42 373–42 394, 2023.
- N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “Smote: synthetic minority over-sampling technique,” Journal of artificial intelligence research, vol. 16, pp. 321–357, 2002.