Study of the transformation of Kazakh language speech into text data

dc.contributor.authorKursabayeva A.
dc.date.accessioned2025-04-01T07:02:15Z
dc.date.available2025-04-01T07:02:15Z
dc.date.issued2024
dc.description.abstractThe transformation of speech into text data is a key component in the development of modern language technologies and artificial intelligence. Despite significant advances in this field, support for languages with unique grammatical and phonetic characteristics, such as Kazakh, remains a challenge. The purpose of this study is to analyze the existing method of converting speech in the Kazakh language into text and evaluate their effectiveness. The research methodology includes the analysis of the VOSK model for speech transformation in the Kazakh language. An experimental study is being conducted based on the KazakhTTS dataset using machine learning and natural language processing methods. The results of the experiment, presented as an indicator of the error rate in the word (WER), showed that VOSK big and VOSK small have almost the same indicators (51% and 53% respectively). It was also noted that there are limitations in recognizing word endings and that some errors occur during speech recognition. The discussion of the results highlights the potential of the model and points to the need for further improvement and training in working with more diverse data. In conclusion, the key conclusions are outlined, as well as potential directions for further research in the field of Kazakh speech recognition.
dc.identifier.citationKursabayeva A / Study of the transformation of Kazakh language speech into text data / 2024 / Computer Science - 7M06012
dc.identifier.urihttps://repository.sdu.edu.kz/handle/123456789/1677
dc.language.isoen
dc.publisherFaculty of Engineering and Natural Science
dc.titleStudy of the transformation of Kazakh language speech into text data
dc.typeOther

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ainur Kursabayeva.pdf
Size:
454.17 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
12.6 KB
Format:
Item-specific license agreed to upon submission
Description:

Collections