All Projects → Webster → Similar Projects or Alternatives

1306 Open source projects that are alternatives of or similar to Webster

Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+1309.07%)
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-53.02%)
Mutual labels:  crawler, spider, crawling, puppeteer
Chromium for spider
dynamic crawler for web vulnerability scanner
Stars: ✭ 220 (-39.56%)
Mutual labels:  crawler, spider, puppeteer, chromium
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-65.66%)
Mutual labels:  crawler, crawling, puppeteer, headless-chrome
Awesome Puppeteer
A curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+374.73%)
Mutual labels:  crawling, puppeteer, headless-chrome
Puppeteer Walker
a puppeteer walker 🕷 🕸
Stars: ✭ 78 (-78.57%)
Mutual labels:  crawler, spider, puppeteer
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-23.9%)
Mutual labels:  crawler, spider, crawling
throughout
🎪 End-to-end testing made simple (using Jest and Puppeteer)
Stars: ✭ 16 (-95.6%)
Mutual labels:  chromium, headless-chrome, puppeteer
puppet-master
Puppeteer as a service hosted on Saasify.
Stars: ✭ 25 (-93.13%)
Mutual labels:  crawling, headless-chrome, puppeteer
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+4167.86%)
Mutual labels:  crawler, spider, crawling
Skycaiji
蓝天采集器是一款免费的数据采集发布爬虫软件,采用php+mysql开发,可部署在云服务器,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Stars: ✭ 1,514 (+315.93%)
Mutual labels:  crawler, spider, crawling
bots-zoo
No description or website provided.
Stars: ✭ 59 (-83.79%)
Mutual labels:  crawler, crawling, puppeteer
Crawly
Crawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+20.88%)
Mutual labels:  crawler, spider, crawling
Crawlergo
A powerful dynamic crawler for web vulnerability scanners
Stars: ✭ 1,088 (+198.9%)
Mutual labels:  crawler, chromium, headless-chrome
Phantomas
Headless Chromium-based web performance metrics collector and monitoring tool
Stars: ✭ 2,191 (+501.92%)
Mutual labels:  puppeteer, chromium, headless-chrome
Apify Js
Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+766.48%)
Mutual labels:  crawling, puppeteer, headless-chrome
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (-81.32%)
Mutual labels:  crawler, spider, crawling
Ppspider
web spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
Stars: ✭ 237 (-34.89%)
Mutual labels:  crawler, spider, puppeteer
flink-crawler
Continuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-86.81%)
Mutual labels:  crawler, spider, crawling
simplechrome
Webrecorders DevTools Protocol Automation Library
Stars: ✭ 16 (-95.6%)
Mutual labels:  chromium, puppeteer
after-work.js
[DEPRECATED] CLI for automated tests in web projects.
Stars: ✭ 56 (-84.62%)
Mutual labels:  headless-chrome, puppeteer
LInkedIn-Reverese-Lookup
🔎Search LinkedIn profile by email address📧
Stars: ✭ 20 (-94.51%)
Mutual labels:  chromium, puppeteer
Zhihu Login
知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (-6.59%)
Mutual labels:  crawler, spider
headless-chrome-alpine
A Docker container running headless Chrome
Stars: ✭ 26 (-92.86%)
Mutual labels:  chromium, headless-chrome
nest-puppeteer
Puppeteer (Headless Chrome) provider for Nest.js
Stars: ✭ 68 (-81.32%)
Mutual labels:  headless-chrome, puppeteer
Mochify.js
☕️ TDD with Browserify, Mocha, Headless Chrome and WebDriver
Stars: ✭ 338 (-7.14%)
Mutual labels:  puppeteer, headless-chrome
double-agent
A test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (-66.21%)
Mutual labels:  crawling, puppeteer
ZSpider
基于Electron爬虫程序
Stars: ✭ 37 (-89.84%)
Mutual labels:  spider, puppeteer
puppeteer-autoscroll-down
Handle infinite scroll on websites by puppeteer
Stars: ✭ 40 (-89.01%)
Mutual labels:  headless-chrome, puppeteer
codepen-puppeteer
Use Puppeteer to download pens from Codepen.io as single html pages
Stars: ✭ 22 (-93.96%)
Mutual labels:  headless-chrome, puppeteer
puppeteer-instagram
Instagram automation driven by headless chrome.
Stars: ✭ 87 (-76.1%)
Mutual labels:  headless-chrome, puppeteer
nanobox-express
Quickstart for Express on Nanobox
Stars: ✭ 13 (-96.43%)
spider
A web spider framework
Stars: ✭ 25 (-93.13%)
Mutual labels:  spider, puppeteer
apify-cli
Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
Stars: ✭ 37 (-89.84%)
Mutual labels:  headless-chrome, puppeteer
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-85.71%)
Mutual labels:  spider, crawling
Recorder
A browser extension that generates Cypress, Playwright and Puppeteer test scripts from your interactions 🖱 ⌨
Stars: ✭ 277 (-23.9%)
Mutual labels:  chromium, puppeteer
phantom-lord
Handy API for Headless Chromium
Stars: ✭ 24 (-93.41%)
Mutual labels:  headless-chrome, puppeteer
scrapy-distributed
A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-89.56%)
Mutual labels:  spider, crawling
FlareSolverrSharp
FlareSolverr .Net / Proxy server to bypass Cloudflare protection
Stars: ✭ 62 (-82.97%)
Mutual labels:  chromium, puppeteer
puppeteer-email
Email automation driven by headless chrome.
Stars: ✭ 135 (-62.91%)
Mutual labels:  headless-chrome, puppeteer
Toapi
Every web site provides APIs.
Stars: ✭ 3,209 (+781.59%)
Mutual labels:  crawler, spider
talospider
talospider - A simple,lightweight scraping micro-framework
Stars: ✭ 57 (-84.34%)
Mutual labels:  spider, crawling
SlackWebhooksGithubCrawler
Search for Slack Webhooks token publicly exposed on Github
Stars: ✭ 21 (-94.23%)
Mutual labels:  crawling, puppeteer
crawler
A simple and flexible web crawler framework for java.
Stars: ✭ 20 (-94.51%)
Mutual labels:  crawler, spider
kites
Template-based Web Application Framework
Stars: ✭ 51 (-85.99%)
node-headless-chrome
⚠️ 🚧 Install precompiled versions of the Chromium/Chrome headless shell using npm or yarn
Stars: ✭ 20 (-94.51%)
Mutual labels:  chromium, headless-chrome
purescript-toppokki
A binding to puppeteer to drive headless Chrome.
Stars: ✭ 48 (-86.81%)
Mutual labels:  headless-chrome, puppeteer
img-cli
An interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-95.88%)
Mutual labels:  crawler, crawling
puppeteer-github
GitHub automation driven by headless chrome.
Stars: ✭ 15 (-95.88%)
Mutual labels:  headless-chrome, puppeteer
WebCrawler
一个轻量级、快速、多线程、多管道、灵活配置的网络爬虫。
Stars: ✭ 39 (-89.29%)
Mutual labels:  crawler, spider
thal
译文:Puppeteer 与 Chrome Headless —— 从入门到爬虫
Stars: ✭ 651 (+78.85%)
Mutual labels:  headless-chrome, puppeteer
hc-pdf-server
Convert HTML to PDF Server by headless chrome with TypeScript. The new version of hcep-pdf-server.
Stars: ✭ 24 (-93.41%)
Mutual labels:  headless-chrome, puppeteer
slime
🍰 一个可视化的爬虫平台
Stars: ✭ 27 (-92.58%)
Mutual labels:  crawler, spider
ZhengFang System Spider
🐛一只登录正方教务管理系统,爬取数据的小爬虫
Stars: ✭ 21 (-94.23%)
Mutual labels:  crawler, spider
Xcrawler
快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (-5.49%)
Mutual labels:  crawler, spider
galer
A fast tool to fetch URLs from HTML attributes by crawl-in.
Stars: ✭ 138 (-62.09%)
Mutual labels:  crawler, spider
mitm-play
Man in the middle using Playwright
Stars: ✭ 13 (-96.43%)
Mutual labels:  chromium, puppeteer
Spidy
The simple, easy to use command line web crawler.
Stars: ✭ 257 (-29.4%)
Mutual labels:  crawler, crawling
Weixin Spider
微信公众号爬虫,公众号历史文章,文章评论,文章阅读及在看数据,可视化web页面,可部署于Windows服务器。基于Python3之flask/mysql/redis/mitmproxy/pywin32等实现,高效微信爬虫,微信公众号爬虫,历史文章,文章评论,数据更新。
Stars: ✭ 287 (-21.15%)
Mutual labels:  crawler, spider
arachnod
High performance crawler for Nodejs
Stars: ✭ 17 (-95.33%)
Mutual labels:  crawler, spider
1-60 of 1306 similar projects