Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-53.02%)
SquidwarcSquidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-65.66%)
Awesome PuppeteerA curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+374.73%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-23.9%)
throughout🎪 End-to-end testing made simple (using Jest and Puppeteer)
Stars: ✭ 16 (-95.6%)
puppet-masterPuppeteer as a service hosted on Saasify.
Stars: ✭ 25 (-93.13%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+4167.86%)
Skycaiji蓝天采集器是一款免费的数据采集发布爬虫软件,采用php+mysql开发,可部署在云服务器,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Stars: ✭ 1,514 (+315.93%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-83.79%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+20.88%)
CrawlergoA powerful dynamic crawler for web vulnerability scanners
Stars: ✭ 1,088 (+198.9%)
PhantomasHeadless Chromium-based web performance metrics collector and monitoring tool
Stars: ✭ 2,191 (+501.92%)
Apify JsApify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+766.48%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-81.32%)
Ppspiderweb spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
Stars: ✭ 237 (-34.89%)
flink-crawlerContinuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-86.81%)
simplechromeWebrecorders DevTools Protocol Automation Library
Stars: ✭ 16 (-95.6%)
after-work.js[DEPRECATED] CLI for automated tests in web projects.
Stars: ✭ 56 (-84.62%)
Zhihu Login知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (-6.59%)
nest-puppeteerPuppeteer (Headless Chrome) provider for Nest.js
Stars: ✭ 68 (-81.32%)
Mochify.js☕️ TDD with Browserify, Mocha, Headless Chrome and WebDriver
Stars: ✭ 338 (-7.14%)
double-agentA test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (-66.21%)
ZSpider基于Electron爬虫程序
Stars: ✭ 37 (-89.84%)
codepen-puppeteerUse Puppeteer to download pens from Codepen.io as single html pages
Stars: ✭ 22 (-93.96%)
spiderA web spider framework
Stars: ✭ 25 (-93.13%)
apify-cliApify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
Stars: ✭ 37 (-89.84%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-85.71%)
RecorderA browser extension that generates Cypress, Playwright and Puppeteer test scripts from your interactions 🖱 ⌨
Stars: ✭ 277 (-23.9%)
phantom-lordHandy API for Headless Chromium
Stars: ✭ 24 (-93.41%)
scrapy-distributedA series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-89.56%)
FlareSolverrSharpFlareSolverr .Net / Proxy server to bypass Cloudflare protection
Stars: ✭ 62 (-82.97%)
puppeteer-emailEmail automation driven by headless chrome.
Stars: ✭ 135 (-62.91%)
ToapiEvery web site provides APIs.
Stars: ✭ 3,209 (+781.59%)
talospidertalospider - A simple,lightweight scraping micro-framework
Stars: ✭ 57 (-84.34%)
crawlerA simple and flexible web crawler framework for java.
Stars: ✭ 20 (-94.51%)
kitesTemplate-based Web Application Framework
Stars: ✭ 51 (-85.99%)
node-headless-chrome⚠️ 🚧 Install precompiled versions of the Chromium/Chrome headless shell using npm or yarn
Stars: ✭ 20 (-94.51%)
img-cliAn interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-95.88%)
puppeteer-githubGitHub automation driven by headless chrome.
Stars: ✭ 15 (-95.88%)
WebCrawler一个轻量级、快速、多线程、多管道、灵活配置的网络爬虫。
Stars: ✭ 39 (-89.29%)
thal译文:Puppeteer 与 Chrome Headless —— 从入门到爬虫
Stars: ✭ 651 (+78.85%)
hc-pdf-serverConvert HTML to PDF Server by headless chrome with TypeScript. The new version of hcep-pdf-server.
Stars: ✭ 24 (-93.41%)
slime🍰 一个可视化的爬虫平台
Stars: ✭ 27 (-92.58%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (-5.49%)
galerA fast tool to fetch URLs from HTML attributes by crawl-in.
Stars: ✭ 138 (-62.09%)
mitm-playMan in the middle using Playwright
Stars: ✭ 13 (-96.43%)
SpidyThe simple, easy to use command line web crawler.
Stars: ✭ 257 (-29.4%)
Weixin Spider微信公众号爬虫,公众号历史文章,文章评论,文章阅读及在看数据,可视化web页面,可部署于Windows服务器。基于Python3之flask/mysql/redis/mitmproxy/pywin32等实现,高效微信爬虫,微信公众号爬虫,历史文章,文章评论,数据更新。
Stars: ✭ 287 (-21.15%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-95.33%)