ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+33774.4%)
NewspaperNews, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+9136%)
RendertronA Headless Chrome rendering solution
Stars: ✭ 5,593 (+4374.4%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+12328%)
AntchAntch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (+58.4%)
Html Pdf ChromeHTML to PDF converter via Chrome/Chromium
Stars: ✭ 629 (+403.2%)
BrowserlessA browser driver on top of puppeteer, ready for production scenarios.
Stars: ✭ 664 (+431.2%)
CupriteHeadless Chrome/Chromium driver for Capybara
Stars: ✭ 743 (+494.4%)
Ppspiderweb spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
Stars: ✭ 237 (+89.6%)
Skycaiji蓝天采集器是一款免费的数据采集发布爬虫软件,采用php+mysql开发,可部署在云服务器,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Stars: ✭ 1,514 (+1111.2%)
Chromeless🖥 Chrome automation made simple. Runs locally or headless on AWS Lambda.
Stars: ✭ 13,254 (+10503.2%)
double-agentA test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (-1.6%)
codepen-puppeteerUse Puppeteer to download pens from Codepen.io as single html pages
Stars: ✭ 22 (-82.4%)
Viewfinder📷 BrowserBox - Remote isolated browser API for security, automation visibility and interactivity. Run on our cloud, or bring your own. Full scope double reverse web proxy with multi-tab, mobile-ready browser UI frontend. Plus co-browsing, advanced adaptive streaming, secure document viewing and more! But only in the Pro version. Get BB today! Se…
Stars: ✭ 1,741 (+1292.8%)
phantom-lordHandy API for Headless Chromium
Stars: ✭ 24 (-80.8%)
nest-puppeteerPuppeteer (Headless Chrome) provider for Nest.js
Stars: ✭ 68 (-45.6%)
Puppeteer WebperfAutomating Web Performance testing with Puppeteer 🎪
Stars: ✭ 1,392 (+1013.6%)
apify-cliApify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
Stars: ✭ 37 (-70.4%)
img-cliAn interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-88%)
Docker Headless ShellMinimal container for Chrome's headless shell, useful for automating / driving the web
Stars: ✭ 272 (+117.6%)
NavaliaA bullet-proof, fast, and reliable headless browser API
Stars: ✭ 950 (+660%)
Mochify.js☕️ TDD with Browserify, Mocha, Headless Chrome and WebDriver
Stars: ✭ 338 (+170.4%)
Whatspup🔳 WhatsApp chat from commandline/console/cli using GoogleChrome puppeteer
Stars: ✭ 310 (+148%)
LightcrawlerCrawl a website and run it through Google lighthouse
Stars: ✭ 1,339 (+971.2%)
FerrumHeadless Chrome Ruby API
Stars: ✭ 1,009 (+707.2%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+252%)
Pptraas.comPuppeteer as a service
Stars: ✭ 433 (+246.4%)
SinglefileWeb Extension for Firefox/Chrome/MS Edge and CLI tool to save a faithful copy of an entire web page in a single HTML file
Stars: ✭ 4,417 (+3433.6%)
DifferencifyDifferencify is a library for visual regression testing
Stars: ✭ 572 (+357.6%)
CrawlergoA powerful dynamic crawler for web vulnerability scanners
Stars: ✭ 1,088 (+770.4%)
ChromyChromy is a library for operating headless chrome. 🍺🍺🍺
Stars: ✭ 593 (+374.4%)
PuphpeteerA Puppeteer bridge for PHP, supporting the entire API.
Stars: ✭ 1,014 (+711.2%)
Mocha Chrome☕️ Run Mocha tests using headless Google Chrome
Stars: ✭ 66 (-47.2%)
Try PuppeteerRun Puppeteer code in the cloud
Stars: ✭ 642 (+413.6%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (+409.6%)
Jest PuppeteerRun your tests using Jest & Puppeteer 🎪✨
Stars: ✭ 3,267 (+2513.6%)
Viewfinderjs📷 ViewFinder - NodeJS product to make the browser into a web app. WTF RBI. CBII. Remote browser isolation, embeddable browserview, secure chrome saas. Licenses, managed, self-hosted. Like S2, WebGap, Bromium, Authentic8, Menlo Security and Broadcom, but open source with free live demos available now! Also, integrated RBI/CDR with CDR from https://github.com/dosyago/p2%2e
Stars: ✭ 1,175 (+840%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-45.6%)
Chart To AwsMicroservice to generate screenshot from a webpage and upload it to a AWS S3 Bucket.
Stars: ✭ 43 (-65.6%)
Page2image📷 page2image is a npm package for taking screenshots which also provides CLI command
Stars: ✭ 66 (-47.2%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+531.2%)
ChroxyHeadless Chrome as a Service
Stars: ✭ 172 (+37.6%)
Headless RecorderChrome extension that records your browser interactions and generates a Playwright or Puppeteer script.
Stars: ✭ 13,786 (+10928.8%)
Webshot FactoryWeb Screenshots at scale based on headless chrome
Stars: ✭ 288 (+130.4%)
Alpine ChromeChrome Headless docker images built upon alpine official image
Stars: ✭ 754 (+503.2%)
Roam Research Private ApiPrivate API to enable API access for Roam Research. Now you can connect Roam to your other projects.
Stars: ✭ 88 (-29.6%)
Puppeteer DartA Dart library to automate the Chrome browser over the DevTools Protocol. This is a port of the Puppeteer API
Stars: ✭ 92 (-26.4%)