Identifying spam messages for Kazakh Language using Hybrid Machine Learning Model
Loading...
Date
2023
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The rapid growth of digital communication has led to an increase in unwanted spam messages, which pose a significant challenge for users. While numerous machine learning techniques have been employed to combat spam in widely spoken languages, there is a lack of research on spam detection in less common languages, such as Kazakh. In this study, we propose a hybrid machine learning model for identifying spam messages in the Kazakh language. Our approach combines the strengths of different machine learning techniques to enhance the accuracy and efficiency of spam detection. To evaluate the performance of our hybrid model, we manually collected a dataset of Kazakh language messages, which were labeled as spam or non-spam. The results demonstrate that our hybrid approach surpasses traditional machine learning methods in terms of accuracy, precision, recall, and F1-score. This model not only improves the efficiency of spam filtering but also enhances the user experience for Kazakh-speaking individuals and promotes the development of languagespecific spam detection techniques for other underrepresented languages.
Description
Keywords
Internet, spam messages, machine learning, algorithms