Awesome PuppeteerA curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+1663.27%)
Apify JsApify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+3118.37%)
Intrec PackIntelligence and Reconnaissance Package/Bundle installer.
Stars: ✭ 177 (+80.61%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-46.94%)
scrapy-distributedA series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-61.22%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+15752.04%)
Bhban rpa6개월 치 업무를 하루 만에 끝내는 업무 자동화(생능출판사, 2020)의 예제 코드입니다. 파이썬을 한 번도 배워본 적 없는 분들을 위한 예제이며, 엑셀부터 디자인, 매크로, 크롤링까지 업무 자동화와 관련된 다양한 분야 예제가 제공됩니다.
Stars: ✭ 124 (+26.53%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-45.92%)
scrapy-fieldstatsA Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-82.65%)
Comic DlComic-dl is a command line tool to download manga and comics from various comic and manga sites. Supported sites : readcomiconline.to, mangafox.me, comic naver and many more.
Stars: ✭ 365 (+272.45%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+348.98%)
Secret AgentThe web browser that's built for scraping.
Stars: ✭ 151 (+54.08%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+74.49%)
Chatterinternet monitoring osint telegram bot for windows
Stars: ✭ 123 (+25.51%)
MemoriousDistributed crawling framework for documents and structured data.
Stars: ✭ 248 (+153.06%)
double-agentA test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (+25.51%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (-47.96%)
Assh💻 make your ssh client smarter
Stars: ✭ 2,340 (+2287.76%)
Php Curl ClassPHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+2862.24%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+182.65%)
KatanaA Python Tool For google Hacking
Stars: ✭ 355 (+262.24%)
Sasila一个灵活、友好的爬虫框架
Stars: ✭ 286 (+191.84%)
OjTools for various online judges. Downloading sample cases, generating additional test cases, testing your code, and submitting it.
Stars: ✭ 517 (+427.55%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+705.1%)
ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+43107.14%)
DotnetcrawlerDotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (+2.04%)
AntchAntch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (+102.04%)
WebsocatCommand-line client for WebSockets, like netcat (or curl) for ws:// with advanced socat-like functions
Stars: ✭ 3,477 (+3447.96%)
D4n155OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (+7.14%)
Instagram BotAn Instagram bot developed using the Selenium Framework
Stars: ✭ 138 (+40.82%)
Php WhoisPHP WHOIS provides parsed and raw whois lookup of domains and ASN routes. PHP 5.4+ and 7+ compatible
Stars: ✭ 179 (+82.65%)
Cdp4jcdp4j - Chrome DevTools Protocol for Java
Stars: ✭ 232 (+136.73%)
MosintAn automated e-mail OSINT tool
Stars: ✭ 184 (+87.76%)
socials👨👩👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (-62.24%)
Anime DlAnime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (+93.88%)
zcrawlAn open source web crawling platform
Stars: ✭ 21 (-78.57%)
crawling-frameworkEasily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-77.55%)
go-scrapyWeb crawling and scraping framework for Golang
Stars: ✭ 17 (-82.65%)
Nginx LeNginx with automatic let's encrypt (docker image)
Stars: ✭ 475 (+384.69%)
DorknetSelenium powered Python script to automate searching for vulnerable web apps.
Stars: ✭ 256 (+161.22%)
ARGUSARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (-30.61%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-39.8%)
Api StoreContains all the public APIs listed in Phantombuster's API store. Pull requests welcome!
Stars: ✭ 69 (-29.59%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+4060.2%)
SpidermonScrapy Extension for monitoring spiders execution.
Stars: ✭ 309 (+215.31%)
Undetected ChromedriverCustom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
Stars: ✭ 365 (+272.45%)
pompScreen scraping and web crawling framework
Stars: ✭ 61 (-37.76%)
NickjsWeb scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)
Stars: ✭ 494 (+404.08%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+4835.71%)
DataflowkitExtract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (+365.31%)
PastepwnPython framework to scrape Pastebin pastes and analyze them
Stars: ✭ 87 (-11.22%)
DahliaAn opinionated React Framework. [Rename to pea.js]
Stars: ✭ 92 (-6.12%)
CocsharpClash of Clans library, proxy and server written in .NET [Unmaintained]
Stars: ✭ 94 (-4.08%)