<Repository logo
  • English
  • Қазақ
  • Log In
    or
    New user? Click here to register.Have you forgotten your password?
  • English
  • Қазақ
  • Log In
    or
    New user? Click here to register.Have you forgotten your password?
Repository logo
  • Communities & Collections
  • All of SDU repository
  • GuideRegulations
  1. Home
  2. Browse by Author

Browsing by Author "Turapbekov B."

Now showing 1 - 1 of 1
Results Per Page
Sort Options
  • Loading...
    Thumbnail Image
    ItemOpen Access
    BUILDING KAZAKH LANGUAGE OPEN SOURCE CORPORA USING WIKIPEDIA RESOURCES
    (СДУ хабаршысы - 2018, 2018) Chapaev D. ; Turapbekov B.
    Abstract. The lack of free public accessible Kazakh language corpus is one of the difficulties that Kazakh linguistics researchers face. Corpuses are used as a data source in statistical linguistics for the detection of unigrams, bigrams and n-grams. These data help analyze the structure of the language and find the most used words, etc. The aim of this paper is a step towards supporting Kazakh linguistics with the open source corpus built on Wikipedia dumps and one of its applications a Kazakh spell checker. Now, corpus contains over 21 million words. It is also open source and waiting for any contributors and suggestions.

Find us

  • SDU Scientific Library Office B203,
  • Abylaikhana St. 1/1 Kaskelen, Kazakhstan

Call us

Phone: +7 (727) 307 9565 (Int. 183)

Mail us

E-mail: repository@sdu.edu.kz
logo

Useful Links

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback

Follow us

Springshare
ROAR
OpenDOAR

Copyright © 2023, All Right Reserved SDU University