Awesome PuppeteerA curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (-45.21%)
browser-poolA Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (-97.75%)
apify-cliApify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
Stars: ✭ 37 (-98.83%)
PhantomasHeadless Chromium-based web performance metrics collector and monitoring tool
Stars: ✭ 2,191 (-30.53%)
puppet-masterPuppeteer as a service hosted on Saasify.
Stars: ✭ 25 (-99.21%)
codepen-puppeteerUse Puppeteer to download pens from Codepen.io as single html pages
Stars: ✭ 22 (-99.3%)
iowebWeb Scraping Framework
Stars: ✭ 31 (-99.02%)
Puppeteer Extra💯 Teach puppeteer new tricks through plugins.
Stars: ✭ 3,397 (+7.7%)
double-agentA test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (-96.1%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-98.13%)
GrawlerGrawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them in a file.
Stars: ✭ 98 (-96.89%)
Api StoreContains all the public APIs listed in Phantombuster's API store. Pull requests welcome!
Stars: ✭ 69 (-97.81%)
Page2image📷 page2image is a npm package for taking screenshots which also provides CLI command
Stars: ✭ 66 (-97.91%)
SquidwarcSquidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-96.04%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-91.22%)
Ayakashi⚡️ Ayakashi.io - The next generation web scraping framework
Stars: ✭ 117 (-96.29%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+29.26%)
Webstera reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (-88.46%)
PuphpeteerA Puppeteer bridge for PHP, supporting the entire API.
Stars: ✭ 1,014 (-67.85%)
NickjsWeb scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)
Stars: ✭ 494 (-84.34%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-94.58%)
Deno PuppeteerA port of puppeteer running on Deno
Stars: ✭ 128 (-95.94%)
zcrawlAn open source web crawling platform
Stars: ✭ 21 (-99.33%)
Whatsapp-NetGenerate a network graph of connections from your WhatsApp groups data
Stars: ✭ 75 (-97.62%)
pdf-crawlerSimFin's open source PDF crawler
Stars: ✭ 100 (-96.83%)
puppeteer-lambdaModule for using Headless-Chrome by Puppeteer on AWS Lambda.
Stars: ✭ 117 (-96.29%)
PuppeteerHeadless Chrome Node.js API
Stars: ✭ 75,197 (+2284.18%)
TaikoA node.js library for testing modern web applications
Stars: ✭ 2,964 (-6.02%)
CrawlerSamplesThis is a Puppeteer+AngleSharp crawler console app samples, used C# 7.1 coding and dotnet core build.
Stars: ✭ 36 (-98.86%)
Cdp4jcdp4j - Chrome DevTools Protocol for Java
Stars: ✭ 232 (-92.64%)
AutomagicaAI-powered Smart Robotic Process Automation 🤖
Stars: ✭ 2,610 (-17.25%)
PythonScrapyBasicSetupBasic setup with random user agents and IP addresses for Python Scrapy Framework.
Stars: ✭ 57 (-98.19%)
coreThe complete web scraping toolkit for PHP.
Stars: ✭ 1,110 (-64.81%)
actor-content-checkerYou can use this act to monitor any page's content and get a notification when content changes.
Stars: ✭ 16 (-99.49%)
socials👨👩👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (-98.83%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-98.32%)
trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (-77.46%)
scrapy-fieldstatsA Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-99.46%)
hc-pdf-serverConvert HTML to PDF Server by headless chrome with TypeScript. The new version of hcep-pdf-server.
Stars: ✭ 24 (-99.24%)
crawling-frameworkEasily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-99.3%)
selectorlibA library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (-98.32%)
ARGUSARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (-97.84%)
after-work.js[DEPRECATED] CLI for automated tests in web projects.
Stars: ✭ 56 (-98.22%)
puppeteer-botcheck🕵♂ Bot detection tests for Puppeteer. Hide and seek!
Stars: ✭ 42 (-98.67%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (-98.38%)
nest-puppeteerPuppeteer (Headless Chrome) provider for Nest.js
Stars: ✭ 68 (-97.84%)
throughout🎪 End-to-end testing made simple (using Jest and Puppeteer)
Stars: ✭ 16 (-99.49%)
phantom-lordHandy API for Headless Chromium
Stars: ✭ 24 (-99.24%)
go-scrapyWeb crawling and scraping framework for Golang
Stars: ✭ 17 (-99.46%)
OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-99.52%)
thal译文:Puppeteer 与 Chrome Headless —— 从入门到爬虫
Stars: ✭ 651 (-79.36%)
browser-automation-apiBrowser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (-99.24%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-98.35%)