FAST AND RELATIVELY ACCURATE SENTIMENT ANALYSIS FOR THE KAZAKH LANGUAGE
dc.contributor.author | N. Manteyeva | |
dc.date.accessioned | 2024-01-16T04:41:05Z | |
dc.date.available | 2024-01-16T04:41:05Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Abstract This paper constructs a fast and accurate sentiment analysis model for the Kazakh language. The main method for text classification is based on TF-IDF-based tokens trained with Logistic Regression. The processing and modeling stages are fully implemented in the PySpark framework. The proposed method has shown an accuracy level of 82% on an evenly distributed test dataset. As a byproduct of the work, we have collected a list of words in the Kazakh language that could signal the negativity/positivity of the given review. | |
dc.identifier.citation | N. Manteyeva / FAST AND RELATIVELY ACCURATE SENTIMENT ANALYSIS FOR THE KAZAKH LANGUAGE / 2022 International Young Scholars' Conference | |
dc.identifier.uri | https://repository.sdu.edu.kz/handle/123456789/1125 | |
dc.language.iso | en | |
dc.publisher | 2022 International Young Scholars' Conference | |
dc.subject | sentiment analysis | |
dc.subject | natural language processing | |
dc.subject | Kazakh language | |
dc.subject | 2022 International Young Scholars' Conference | |
dc.subject | №11 | |
dc.title | FAST AND RELATIVELY ACCURATE SENTIMENT ANALYSIS FOR THE KAZAKH LANGUAGE | |
dc.type | Other |