Study of the transformation of Kazakh language speech into text data
dc.contributor.author | Kursabayeva A. | |
dc.date.accessioned | 2025-04-01T07:02:15Z | |
dc.date.available | 2025-04-01T07:02:15Z | |
dc.date.issued | 2024 | |
dc.description.abstract | The transformation of speech into text data is a key component in the development of modern language technologies and artificial intelligence. Despite significant advances in this field, support for languages with unique grammatical and phonetic characteristics, such as Kazakh, remains a challenge. The purpose of this study is to analyze the existing method of converting speech in the Kazakh language into text and evaluate their effectiveness. The research methodology includes the analysis of the VOSK model for speech transformation in the Kazakh language. An experimental study is being conducted based on the KazakhTTS dataset using machine learning and natural language processing methods. The results of the experiment, presented as an indicator of the error rate in the word (WER), showed that VOSK big and VOSK small have almost the same indicators (51% and 53% respectively). It was also noted that there are limitations in recognizing word endings and that some errors occur during speech recognition. The discussion of the results highlights the potential of the model and points to the need for further improvement and training in working with more diverse data. In conclusion, the key conclusions are outlined, as well as potential directions for further research in the field of Kazakh speech recognition. | |
dc.identifier.citation | Kursabayeva A / Study of the transformation of Kazakh language speech into text data / 2024 / Computer Science - 7M06012 | |
dc.identifier.uri | https://repository.sdu.edu.kz/handle/123456789/1677 | |
dc.language.iso | en | |
dc.publisher | Faculty of Engineering and Natural Science | |
dc.title | Study of the transformation of Kazakh language speech into text data | |
dc.type | Other |