SPEECH RECOGNITION BASED ON CONVENTIONAL NEURAL NETWORKS

dc.contributor.authorAlmukhametova D.
dc.contributor.authorKuanyshbay D.
dc.contributor.authorAskar N.
dc.date.accessioned2023-12-14T08:51:35Z
dc.date.available2023-12-14T08:51:35Z
dc.date.issued2021
dc.description.abstractAbstract. In this research work, the problem of speech recognition is considered in the form of an analysis of the numbers from 1 to 10 recorded by the speaker on the dictaphone. The paper uses the method of recognizing the spectrogram of an audio signal using convolutional neural networks. Also written and implemented an algorithm for processing input data, and an algorithm for recognizing spoken words. In this work, the quality of recognition was assessed for a different number of convolutional layers. A comparison of the recognition quality is made in cases when the input data for the network are the spectrogram of the audio signal or the first two formants extracted from it. The recognition algorithm was tested using examples of male and female voices with different pronunciation lengths.
dc.identifier.citationD. Almukhametova , D. Kuanyshbay , N. Askar / SPEECH RECOGNITION BASED ON CONVENTIONAL NEURAL NETWORKS / СДУ хабаршысы - 2021
dc.identifier.issn2709-2631
dc.identifier.urihttps://repository.sdu.edu.kz/handle/123456789/990
dc.language.isoen
dc.publisherСДУ хабаршысы - 2021
dc.subjectspectrogram
dc.subjectformant
dc.subjectalgorithm for learningneural networks
dc.subjectСДУ хабаршысы - 2021
dc.subject№2
dc.titleSPEECH RECOGNITION BASED ON CONVENTIONAL NEURAL NETWORKS
dc.typeArticle

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2021.2 жаратылыстану-53-59.pdf
Size:
2.91 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
13.85 KB
Format:
Item-specific license agreed to upon submission
Description: