All Projects → lorien → Awesome Web Scraping

lorien / Awesome Web Scraping

Licence: other
List of libraries, tools and APIs for web scraping and data processing.

Programming Languages

Makefile
30231 projects

Projects that are alternatives of or similar to Awesome Web Scraping

2captcha-php
PHP package for easy integration with the API of 2captcha captcha solving service to bypass recaptcha, hcaptcha, funcaptcha, geetest and solve any other captchas.
Stars: ✭ 25 (-99.45%)
Mutual labels:  captcha-solving, captcha-breaking, captcha-recognition
socks5 list
Auto-updated SOCKS5 proxy list + proxies for Telegram
Stars: ✭ 210 (-95.34%)
Mutual labels:  proxy-server, proxy-list, proxylist
Proxy List
A list of free, public, forward proxy servers. UPDATED DAILY!
Stars: ✭ 1,125 (-75.06%)
Mutual labels:  proxy, proxy-server, proxy-list
Delete
(迫于压力,本项目停止维护,请尽快fork代码。1月1日之后删除项目)[免翻墙工具]A free and open-source youtube video proxy script [Written in PHP]
Stars: ✭ 1,316 (-70.82%)
Mutual labels:  proxy, proxy-server, proxy-list
Proxybroker
Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS 🎭
Stars: ✭ 2,767 (-38.65%)
Mutual labels:  proxy, proxy-server, proxy-list
TikTokBot
Bot save videos from instagram and then post them to Tik-Tok
Stars: ✭ 21 (-99.53%)
Mutual labels:  captcha-solving, captcha-breaking, captcha-solver
2captcha-python
Python 3 package for easy integration with the API of 2captcha captcha solving service to bypass recaptcha, hcaptcha, funcaptcha, geetest and solve any other captchas.
Stars: ✭ 140 (-96.9%)
Mutual labels:  captcha-solving, captcha-breaking, captcha-recognition
Freeproxy
免费、高速的 V2Ray 代理和订阅。
Stars: ✭ 104 (-97.69%)
Mutual labels:  proxy, proxy-server, proxy-list
Free Proxy List
🔥Free proxy servers list / Updated hourly!
Stars: ✭ 326 (-92.77%)
Mutual labels:  proxy, proxy-server, proxy-list
Mubeng
An incredibly fast proxy checker & IP rotator with ease.
Stars: ✭ 234 (-94.81%)
Mutual labels:  proxy, proxy-server, proxy-list
Proxy requests
a class that uses scraped proxies to make http GET/POST requests (Python requests)
Stars: ✭ 357 (-92.08%)
Mutual labels:  proxy, proxy-server, proxy-list
ProxyChecker
proxy checker to check the status of the ip-port proxy list
Stars: ✭ 24 (-99.47%)
Mutual labels:  proxy-server, proxy-list
proxy fetcher
💪 Ruby / JRuby / TrufflleRuby gem & CLI for dealing with proxy lists from various sources
Stars: ✭ 119 (-97.36%)
Mutual labels:  proxy-list, proxylist
captcha-solver
Library and CLI for automating captcha verification across multiple providers.
Stars: ✭ 101 (-97.76%)
Mutual labels:  captcha-solving, anti-captcha
LiveProxies
Asynchronous proxy checker
Stars: ✭ 17 (-99.62%)
Mutual labels:  proxy-server, proxy-list
torchestrator
Spin up Tor containers and then proxy HTTP requests via these Tor instances
Stars: ✭ 32 (-99.29%)
Mutual labels:  proxy-server, proxy-list
2captcha-go
Golang Module for easy integration with the API of 2captcha captcha solving service to bypass recaptcha, hcaptcha, funcaptcha, geetest and solve any other captchas.
Stars: ✭ 31 (-99.31%)
Mutual labels:  captcha-solving, captcha-breaking
Free-Proxy
Hi there will be a lot of proxies here.
Stars: ✭ 135 (-97.01%)
Mutual labels:  proxy-server, proxy-list
StegoProxy
Steganography proxy implemented in java
Stars: ✭ 19 (-99.58%)
Mutual labels:  proxy-server, proxyserver
Php Curl Class
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (-35.63%)
Mutual labels:  web-scraping, proxy

Awesome Web Scraping

The list of tools, programming libraries and web services used in web scraping and data processing.

Web scraping chats: @grablab (English) and @grablab_ru (Russian)

Programming Libraries

Other Things

Captcha Solving Services

These two links point to same captcha service, it is just a different language versions

Contributing

See Contributing document.

Credits

The list is based initially on some data from these sources awesome-python, awesome-php, awesome-ruby, ruby-nlp, awesome-javascript

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].