Sentiment Analysis of Balochi Text Using Deep Learning

Authors

DOI:

https://doi.org/10.21015/vtcs.v13i1.2081

Abstract

Balochi is a low-resource language with limited available data for computational modelling. This study aims to perform sentiment analysis on Balochi text using machine learning techniques. To address the scarcity of linguistic data, we contribute a large, newly constructed dataset of Balochi text. Our proposed model incorporates feature extraction and data augmentation within deep learning algorithms to classify sentiments as positive, negative, or neutral. We evaluate both traditional machine learning methods—such as Random Forest and Support Vector Machine (SVM)—and advanced deep learning models, including Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU).The experimental evidence proves that LSTM and GRU are more effective than traditional methods, and the accuracy rates of sentiment classification with their help are 83.57% and 81.23%, respectively. It has been experimentally verified that, when it comes to the Balochi sentiment analysis, deep learning methods can be more effective than the traditional ones.

References

A. Agarwal, B. Xie, I. Vovsha, O. Rambow, and R. J. Passonneau, “Sentiment analysis of Twitter data,” Proc. Workshop on Language in Social Media (LSM 2011), pp. 30–38, 2011.

X. Li, L. Bing, W. Zhang, and W. Lam, “Exploiting BERT for end-to-end aspect-based sentiment analysis,” arXiv preprint arXiv:1910.00883, 2019.

C. Jahani, A Grammar of Modern Standard Balochi, Acta Universitatis Upsaliensis, 2019.

K. Melcher, “A friendly introduction to [deep] neural networks,” 2021.

S. Minaee, N. Kalchbrenner, E. Cambria, N. Nikzad, M. Chenaghlu, and J. Gao, “Deep learning–based text classification: A comprehensive review,” ACM Comput. Surv., vol. 54, no. 3, pp. 1–40, 2021.

R. Joshi, P. Goel, and R. Joshi, “Deep learning for Hindi text classification: A comparison,” in Proc. Intelligent Human Computer Interaction (IHCI), 2019, pp. 94–101, Springer, 2020.

X. Hu and R. Zhang, “Text classification based on machine learning,” in 2022 IEEE Int. Conf. Artif. Intell. Comput. Appl. (ICAICA), pp. 911–916, IEEE, 2022.

J. F. Ruma, S. Akter, J. Laboni, and R. M. Rahman, “A deep learning classification model for Persian Hafez poetry based on the poet’s era,” Decis. Anal. J., vol. 4, 2022.

A. Hawalah, “Semantic ontology-based approach to enhance Arabic text classification,” Big Data Cogn. Comput., 2019.

M. A. Dootio and A. I. Wagan, “Development of Sindhi text corpus,” J. King Saud Univ. Comput. Inf. Sci., vol. 33, no. 4, pp. 468–475, 2021.

M. Heikal, M. Torki, and N. El-Makky, “Sentiment analysis of Arabic tweets using deep learning,” Procedia Comput. Sci., vol. 142, pp. 114–122, 2018.

A. M. Alayba, V. Palade, M. England, and R. Iqbal, “A combined CNN and LSTM model for Arabic sentiment analysis,” in Proc. CD-MAKE 2018, pp. 179–191, Springer, 2018.

A. Elnahas, N. Elfishawy, M. Nour, and M. Tolba, “Machine learning and feature selection approaches for categorizing Arabic text: Analysis, comparison, and proposal,” Egypt. J. Lang. Eng., vol. 7, no. 2, pp. 1–19, 2020.

Z. B. Nezhad and M. A. Deihimi, “A combined deep learning model for Persian sentiment analysis,” IIUM Eng. J., vol. 20, no. 1, pp. 129–139, 2019.

H. Lu, L. Ehwerhemuepha, and C. Rakovski, “A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance,” BMC Med. Res. Methodol., vol. 22, no. 1, p. 181, 2022.

I. U. Khan et al., “A review of Urdu sentiment analysis with multilingual perspective: A case of Urdu and Roman Urdu language,” Computers, vol. 11, no. 1, p. 3, 2022.

W. Ali et al., “Creating and evaluating resources for sentiment analysis in the low-resource language: Sindhi,” in Proc. 11th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 188–194, 2021.

H. Elfaik and E. H. Nfaoui, “e,” J. Intell. Syst., vol. 30, no. 1, pp. 395–412, 2020.

S. Iqbal et al., “Sentiment analysis of social media content in Pashto language using deep learning algorithms,” J. Internet Technol., vol. 23, no. 7, pp. 1669–1677, 2022.

F. Barakzai, S. Bhatti, and S. Saddar, “Sentiment analysis of Sindhi news articles using deep learning,” in 2022 IEEE 17th Int. Conf. Comput. Sci. Inf. Technol. (CSIT), pp. 26–31, IEEE, 2022.

K. I. Malik, “Urdu news content classification using machine learning algorithms,” Lahore Garrison Univ. Res. J. Comput. Sci. Inf. Technol., vol. 6, no. 1, pp. 22–31, 2022.

U. Naqvi, A. Majid, and S. A. Abbas, “UTSA: Urdu text sentiment analysis using deep learning methods,” IEEE Access, vol. 9, pp. 114085–114094, 2021.

L. Khan et al., “Urdu sentiment analysis with deep learning methods,” IEEE Access, vol. 9, pp. 97803–97812, 2021.

X. Luo, “Efficient English text classification using selected machine learning techniques,” Alex. Eng. J., vol. 60, no. 3, pp. 3401–3409, 2021.

A. Setyanto et al., “Arabic language opinion mining based on long short-term memory (LSTM),” Appl. Sci., vol. 12, no. 9, p. 4140, 2022.

L. Khan et al., “Deep sentiment analysis using CNN-LSTM architecture of English and Roman Urdu text shared in social media,” Appl. Sci., vol. 12, no. 5, p. 2694, 2022.

A. C. M. V. Srinivas et al., “Sentiment analysis using neural network and LSTM,” in IOP Conf. Ser.: Mater. Sci. Eng., vol. 1074, p. 012007, IOP Publishing, 2021.

F. Aftab et al., “A comprehensive survey on sentiment analysis techniques,” Int. J. Technol., vol. 14, no. 6, pp. 1288–1298, 2023.

S. Fakhar et al., “Smart classroom monitoring using novel real-time facial expression recognition system,” Appl. Sci., vol. 12, no. 23, p. 12134, 2022, doi: 10.3390/app122312134.

A. Topbaş et al., “Sentiment analysis for COVID-19 tweets using recurrent neural network (RNN) and bidirectional encoder representations (BERT) models,” in 2021 Int. Conf. Comput., Electron. Electr. Eng. (ICECube), Quetta, Pakistan, pp. 1–6, doi: 10.1109/ICECube53880.2021.9628315.

I. Nasir et al., “Urdu sentiment analysis using deep learning,” in 2024 18th Int. Conf. Open Source Syst. Technol. (ICOSST), Dec. 2024, pp. 1–7.

J. Singh and P. Tripathi, “Sentiment analysis of Twitter data by making use of SVM, random forest and decision tree algorithm,” in 2021 10th IEEE Int. Conf. Commun. Syst. Netw. Technol. (CSNT), pp. 193–198.

Z. Zaland et al., “Vote bank forecast based on social media sentiment analysis,” in 2023 17th Int. Conf. Open Source Syst. Technol. (ICOSST), Dec. 2023, pp. 1–8.

Downloads

Published

2025-05-09

How to Cite

Hussain, S., Bazaib, S. U., Qadir, S., Marjan, S., ghafoor, M. I., & Pervaiz, P. (2025). Sentiment Analysis of Balochi Text Using Deep Learning. VAWKUM Transactions on Computer Sciences, 13(1), 190–200. https://doi.org/10.21015/vtcs.v13i1.2081