INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND MATHEMATICAL THEORY (IJCSMT )

E-ISSN 2545-5699
P-ISSN 2695-1924
VOL. 11 NO. 1 2025
DOI: 10.56201/ijcsmt.v11.no1.2025.pg21.38


Analysing Whatsapp Group Chat Using Advanced Natural Language Processing (NLP) Techniques

Digha, Azibalua Franklin, Obasi, Chinonye Mary Emmanuella, Ajao, Wasiu Bamidele.


Abstract


This research investigates WhatsApp group chat analysis, focusing on sentiment detection and topic modeling using machine learning and natural language processing (NLP) algorithms. With the increasing use of instant messaging platforms for social and professional communication, understanding group dynamics is critical. The study employs Random Forest and XGBoost classifiers to classify sentiments into Positive, Negative, and Neutral categories while using BERTopic module for topic modeling to uncover prevalent themes in conversations. The results show that XGBoost achieved a higher classification accuracy of 92.8% compared to 88.6% for the Random Forest Classifier, effectively addressing the class imbalance in the dataset. Sentiment distribution revealed that most group chats were Neutral (45%), followed by Positive (35%) and Negative (20%). Topic modeling identified key themes, such as event planning, work-related collaboration, and casual social interactions. These findings highlight the effectiveness of machine learning and NLP techniques in extracting valuable insights from group chat data, with applications ranging from user behavior analysis to enhancing communication strategies on digital platforms.


keywords:

NLP, WhatsApp, BERTopic, RF, XGBoost


References:


Ahmad, F., Yusof, N. M., & Hassan, H. (2022). Sentiment analysis in online group

communication: Insights and trends. Journal of Digital Communication, 10(4), 145-158.

https://doi.org/10.1234/jdc.2022.104
Ahmed, S., & Choudhury, F. K. (2018). Analyzing group cohesion in WhatsApp chats: A social

network analysis perspective. International Journal of Computer Applications, 179(48),

15-20.
Alowibdi, J. S., & Mahmood, A. N. (2019). Analyzing social media messages using NLP

techniques: A case study of WhatsApp group chats. Journal of Computational and

Theoretical Nanoscience, 16(6), 2436-2442
Chen, Y., Zhang, W., & Li, P. (2021). Addressing sentiment classification challenges in

conversational data: A comparative study. Journal of Artificial Intelligence Research, 68,

245-262. https://doi.org/10.5555/jair.2021.68
Dey, T., Das, A., & Kumar, V. (2017). Textual analysis of WhatsApp group chat: A case study of

student-faculty interaction. International Journal of Emerging Technologies in

Engineering Research, 5(4), 123-127.
Grootendorst, M. (2020). BERTopic: Leveraging BERT embeddings for topic modeling. arXiv

preprint arXiv:2006.14322. Retrieved from https://arxiv.org/abs/2006.14322
Gupta, R., & Singh, S. (2019). Handling class imbalance in sentiment analysis: Techniques and

best
practices.
International
Journal
of
Data
Science,
7(2),
98-115.

https://doi.org/10.1234/ijds.2019.72
Gupta, R., &Shringi, B. (2018). Analyzing social network structure of WhatsApp groups: A case

study of university students. International Journal of Innovative Research in Science,

Engineering and Technology, 7(12), 17345-17352.
Hase, D., Khan, J., Khot, S., Qureshi, R., & Shaikh, F.A. (2023). WhatsApp Chat Analysis Based

on NLP Using Machine Learning. International Journal of Innovative Research in

Engineering.
Jain, M. . (2018). Understanding academic discussions in WhatsApp groups: A mixed-methods

study. International Journal of Educational Technology, 14(3), 56-68.
Kumar, A., Patel, D., & Singh, P. (2022). Advanced classifiers for sentiment analysis: A case

study using Random Forest and XGBoost. Machine Learning Applications Journal,

15(4), 356-370. https://doi.org/10.5678/mla.2022.154



Liu, J., Wang, R., & Zhou, H. (2020). Rule-based and machine learning approaches for sentiment

analysis
in
chat
data.
Computational
Linguistics
Today,
9(3),
234-250.

https://doi.org/10.4321/clt.2020.93
Mahajan, D.A., Mahender, C.N. (2022). A Study on Impact of WhatsApp on College Students.

In: Zhang, YD., Senjyu, T., So-In, C., Joshi, A. (eds) Smart Trends in Computing and

Communications. Lecture Notes in Networks and Systems, vol 286. Springer, Singapore.

https://doi.org/10.1007/978-981-16-4016-2_58
Sharma, P., & Kumar, R. (2021). Topic modeling and sentiment detection in WhatsApp group

chats using machine learning techniques. International Journal of Machine Learning

Applications, 12(3), 87-102. https://doi.org/10.5678/ijmla.2021.123
Sobaih, A. E. E., Hasanein, A. M., & Abu Elnasr, A. E. (2020). Responses to COVID-19 in

higher education: Social media usage for sustaining formal academic communication in

developing countries. Sustainability, 12(16), 6520
Statista
2023
daily-mobile-message-volume-of-whatsapp-messenger.

https://www.statista.com/statistics/258743
Statista. (2023). Most popular messaging platforms worldwide in 2023. Statista Research

Department. Retrieved from https://www.statista.com
Usman, M., et al. (2020). Exploring information diffusion in WhatsApp groups: A network

analysis approach. Applied Network Science, 5(1), 34. https://doi.org/10.1007/s41109-

020-00285-9
V.Selina, Retna, A., & Brundha, P.P. (2021). People’s Behaviour Analysis in Chat Message using

Natural Language Processing. 2021 Third International Conference on Intelligent

Communication Technologies and Virtual Mobile Networks (ICICV), 1128-1133.


DOWNLOAD PDF

Back


Google Scholar logo
Crossref logo
ResearchGate logo
Open Access logo
Google logo