bots-zooNo description or website provided.
Stars: ✭ 59 (-92.52%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-93.41%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-44.23%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+513.05%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-78.33%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+1868.95%)
AntchAntch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-74.9%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (-19.26%)
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-98.1%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+57.92%)
ZeiverA Scraper, Downloader, & Recorder for static open directories.
Stars: ✭ 14 (-98.23%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-64.89%)
img-cliAn interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-98.1%)
Annie👾 Fast and simple video download library and CLI tool written in Go
Stars: ✭ 16,369 (+1974.65%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (-73.26%)
ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+5266.67%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+416.73%)
Sasila一个灵活、友好的爬虫框架
Stars: ✭ 286 (-63.75%)
DataflowkitExtract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (-42.21%)
DotnetcrawlerDotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-87.33%)
NewspaperNews, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+1363.24%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-93.28%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (-93.54%)
NewcrawlerFree Web Scraping Tool with Java
Stars: ✭ 589 (-25.35%)
patreon-scraperWIP Patreon attachment download written in TypeScript
Stars: ✭ 25 (-96.83%)
scrapy facebookerCollection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-97.21%)
pompScreen scraping and web crawling framework
Stars: ✭ 61 (-92.27%)
Scraper-Projects🕸 List of mini projects that involve web scraping 🕸
Stars: ✭ 25 (-96.83%)
flink-crawlerContinuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-93.92%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-97.85%)
TorScrapperA Scraper made 100% in Python using BeautifulSoup and Tor. It can be used to scrape both normal and onion links. Happy Scraping :)
Stars: ✭ 24 (-96.96%)
TumblTwoTumblTwo, an Improved Fork of TumblOne, a Tumblr Downloader.
Stars: ✭ 57 (-92.78%)
ARGUSARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (-91.38%)
dijnet-botAz összes számlád még egy helyen :)
Stars: ✭ 17 (-97.85%)
facebook-discussion-tkA collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.
Stars: ✭ 33 (-95.82%)
fiction-dlA content downloader, capable of retrieving works of (fan)fiction from the web and saving them in a few common file formats.
Stars: ✭ 22 (-97.21%)
lightnovel epub🍭 epub generator for (light)novels (轻) 小说 epub 生成器,支持站点:轻之国度、轻小说文库
Stars: ✭ 89 (-88.72%)
SpidyThe simple, easy to use command line web crawler.
Stars: ✭ 257 (-67.43%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (-32.07%)
Captcha-ToolsAll-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!
Stars: ✭ 23 (-97.08%)
scraperNodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
Stars: ✭ 37 (-95.31%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (-1.01%)
Imagescraper✂️ High performance, multi-threaded image scraper
Stars: ✭ 630 (-20.15%)
LinkedinLinkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (-60.84%)
RcrawlerAn R web crawler and scraper
Stars: ✭ 274 (-65.27%)
Hquery.phpAn extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (-62.61%)
SpidermonScrapy Extension for monitoring spiders execution.
Stars: ✭ 309 (-60.84%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (-56.4%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (-55.89%)
Apify JsApify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+299.75%)
Webstera reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (-53.87%)
KatanaA Python Tool For google Hacking
Stars: ✭ 355 (-55.01%)