<Repository logo
  • English
  • Қазақ
  • Log In
    or
    New user? Click here to register.Have you forgotten your password?
  • English
  • Қазақ
  • Log In
    or
    New user? Click here to register.Have you forgotten your password?
Repository logo
  • Communities & Collections
  • All of SDU repository
  • GuideRegulations
  1. Home
  2. Browse by Author

Browsing by Author "Kudabayev T."

Now showing 1 - 1 of 1
Results Per Page
Sort Options
  • Loading...
    Thumbnail Image
    ItemOpen Access
    Randomized and approximized algorithms
    (2022) Kudabayev T.
    In modern times, as the number of websites expands, so does the volume of data. Automated apps that can scan and extract the essential information, process it, and save it in a user-friendly format are in high demand among users. These programs are called web scrapers. The immense popularity of these applications compels online service owners to take additional steps to decrease the number of bots in their Internet service traffic. If Web Services Protection identifies the user as a bot, the server will block the user entirely. The primary objective of this master’s thesis is to create a Node.js parser using the Selenium tool. In addition, in order to replicate human activities, our parser will utilize randomized algorithms. In this thesis, we will examine the work of server protection in greater detail; we will examine four common indicators by which the server identifies that the user is a parser, as well as the server’s methods for preventing bots. We will analyze how using the Selenium tool and the introduction of randomized algorithms will help us bypass the blocking from the server. To obtain the results, we will parse five subcategories of the chosen website and evaluate the stability of our software based on four parameters: the number of pages parsed, the amount of data processed, the total number of errors and blocks, and the rate of data parsing per unit of time. In order to do a qualitative analysis, we will compare these indicators using the same parser but without the application of randomized algorithms.

Find us

  • SDU Scientific Library Office B203,
  • Abylaikhana St. 1/1 Kaskelen, Kazakhstan

Call us

Phone: +7 (727) 307 9565 (Int. 183)

Mail us

E-mail: repository@sdu.edu.kz
logo

Useful Links

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback

Follow us

Springshare
ROAR
OpenDOAR

Copyright © 2023, All Right Reserved SDU University