crawling-frameworkEasily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-99.57%)
browser-automation-apiBrowser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (-99.53%)
ha-multiscrapeHome Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Stars: ✭ 103 (-97.99%)
browser-poolA Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (-98.62%)
phantom-lordHandy API for Headless Chromium
Stars: ✭ 24 (-99.53%)
nest-puppeteerPuppeteer (Headless Chrome) provider for Nest.js
Stars: ✭ 68 (-98.67%)
node-headless-chrome⚠️ 🚧 Install precompiled versions of the Chromium/Chrome headless shell using npm or yarn
Stars: ✭ 20 (-99.61%)
RecorderA browser extension that generates Cypress, Playwright and Puppeteer test scripts from your interactions 🖱 ⌨
Stars: ✭ 277 (-94.6%)
site-audit-seoWeb service and CLI tool for SEO site audit: crawl site, lighthouse all pages, view public reports in browser. Also output to console, json, csv, xlsx, Google Drive.
Stars: ✭ 91 (-98.23%)
rubiumRubium is a lightweight alternative to Selenium/Capybara/Watir if you need to perform some operations (like web scraping) using Headless Chromium and Ruby
Stars: ✭ 65 (-98.73%)
go-scrapyWeb crawling and scraping framework for Golang
Stars: ✭ 17 (-99.67%)
after-work.js[DEPRECATED] CLI for automated tests in web projects.
Stars: ✭ 56 (-98.91%)
copycatA PHP Scraping Class
Stars: ✭ 70 (-98.64%)
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (-99.73%)
FlareSolverrSharpFlareSolverr .Net / Proxy server to bypass Cloudflare protection
Stars: ✭ 62 (-98.79%)
scrapy-distributedA series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-99.26%)
opensea-scraperScrapes nft floor prices and additional information from opensea. Used for https://nftfloorprice.info
Stars: ✭ 129 (-97.48%)
puppeteer-emailEmail automation driven by headless chrome.
Stars: ✭ 135 (-97.37%)
Scraper-Projects🕸 List of mini projects that involve web scraping 🕸
Stars: ✭ 25 (-99.51%)
Captcha-ToolsAll-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!
Stars: ✭ 23 (-99.55%)
naos📉 Uptime and error monitoring CLI
Stars: ✭ 30 (-99.42%)
PychromeA Python Package for the Google Chrome Dev Protocol [threading base]
Stars: ✭ 469 (-90.86%)
pompScreen scraping and web crawling framework
Stars: ✭ 61 (-98.81%)
Mimo-CrawlerA web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Stars: ✭ 22 (-99.57%)
scrapy facebookerCollection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-99.57%)
img-cliAn interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-99.71%)
TorScrapperA Scraper made 100% in Python using BeautifulSoup and Tor. It can be used to scrape both normal and onion links. Happy Scraping :)
Stars: ✭ 24 (-99.53%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-99.67%)
Sms Boom利用chrome的headless模式,模拟用户注册进行短信轰炸机
Stars: ✭ 507 (-90.12%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (-90.95%)
flink-crawlerContinuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-99.06%)
ZeiverA Scraper, Downloader, & Recorder for static open directories.
Stars: ✭ 14 (-99.73%)
dijnet-botAz összes számlád még egy helyen :)
Stars: ✭ 17 (-99.67%)
thal译文:Puppeteer 与 Chrome Headless —— 从入门到爬虫
Stars: ✭ 651 (-87.31%)
Playwright Sharp.NET version of the Playwright testing and automation library.
Stars: ✭ 459 (-91.05%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (-89.55%)
docker-selenium-lambdaThe simplest demo of chrome automation by python and selenium in AWS Lambda
Stars: ✭ 172 (-96.65%)
crawlkitA crawler based on Phantom. Allows discovery of dynamic content and supports custom scrapers.
Stars: ✭ 23 (-99.55%)
puppeteer-githubGitHub automation driven by headless chrome.
Stars: ✭ 15 (-99.71%)
scraperNodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
Stars: ✭ 37 (-99.28%)
Hstspreload.org🔒 Chromium's HSTS preload list submission website.
Stars: ✭ 548 (-89.32%)
QzoneexportQQ空间导出助手,用于备份QQ空间的说说、日志、私密日记、相册、视频、留言板、QQ好友、收藏夹、分享、最近访客为文件,便于迁移与保存
Stars: ✭ 456 (-91.11%)
pccomponentes-buy-botA script made to buy any out-of-stock product off spanish stores
Stars: ✭ 34 (-99.34%)
lightnovel epub🍭 epub generator for (light)novels (轻) 小说 epub 生成器,支持站点:轻之国度、轻小说文库
Stars: ✭ 89 (-98.26%)
facebook-discussion-tkA collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.
Stars: ✭ 33 (-99.36%)
mitm-playMan in the middle using Playwright
Stars: ✭ 13 (-99.75%)
SpidyThe simple, easy to use command line web crawler.
Stars: ✭ 257 (-94.99%)
WrpWeb Rendering Proxy: Use vintage, historical, legacy browsers on modern web
Stars: ✭ 503 (-90.19%)
ScrapedinLinkedIn Scraper (currently working 2020)
Stars: ✭ 453 (-91.17%)
hc-pdf-serverConvert HTML to PDF Server by headless chrome with TypeScript. The new version of hcep-pdf-server.
Stars: ✭ 24 (-99.53%)