7 open source projects by NikolaiT

Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.

✭ 220

A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.

✭ 2,363

Update of uncaptcha2 from 2019

✭ 92

Javascript scraping module based on puppeteer for many different search engines...

✭ 425

Passive TCP/IP Fingerprinting Tool. Run this on your server and find out what Operating Systems your clients are *really* using.

✭ 83

Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.

✭ 49

Cloud crawler functions for scrapeulous

✭ 38

1-7 of 7 user projects