KAZAKH NAMES GENERATOR USING DEEP LEARNING

Loading...
Thumbnail Image

Date

2020

Journal Title

Journal ISSN

Volume Title

Publisher

ВЕСТНИК КАЗАХСТАНСКО-БРИТАНСКОГО ТЕХНИЧЕСКОГО УНИВЕРСИТЕТА, №4

Abstract

In recent years, sentiment analysis of e-mail messages or social media posts is becoming very popular. It can help people define if they are reading something positive or negative. On the same time, there are some services on the Internet that can help you find or create a new name. When processing the creation, they check the name in other popular languages, so your name does not mean inappropriate things in other languages. For this they bill for 25 thousand US dollars. If there are such services, then there is a demand. In this study, sentiment analysis of e-mails was implemented with using StanfordNLP [1] lemmatizer and classic machine learning algorithms as a classifier. It is applied to real e-mails from Russian speaking mailbox, which means there are both English and Russian messages. Thus, language identification is also added as preprocessing step. In this study only binary sentiment analysis was made, but it can be improved with adding several emotions to be detected. Then another model generates Kazakh names using neural networks, where all Kazakh names data has been collected through various websites. The sentiment analysis model gives 81% accuracy and the joint use of two models allow us to generate new Kazakh names, which are checked with Russian language if they mean something inappropriate. The result can be improved with checking with other languages.

Description

Keywords

Natural language processing, sentiment analysis, Deep Learning

Citation

Nurmambetov D , Dauylov S , Bogdanchikov A / KAZAKH NAMES GENERATOR USING DEEP LEARNING / ВЕСТНИК КАЗАХСТАНСКО-БРИТАНСКОГО ТЕХНИЧЕСКОГО УНИВЕРСИТЕТА, №4 / 2020