All Projects → Apify Js → Similar Projects or Alternatives

2136 Open source projects that are alternatives of or similar to Apify Js

Awesome Puppeteer
A curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (-45.21%)
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+62.62%)
browser-pool
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (-97.75%)
Mutual labels:  scraping, web-scraping, rpa, puppeteer
apify-cli
Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
Stars: ✭ 37 (-98.83%)
Mutual labels:  headless-chrome, apify, puppeteer
Phantomas
Headless Chromium-based web performance metrics collector and monitoring tool
Stars: ✭ 2,191 (-30.53%)
Mutual labels:  automation, puppeteer, headless-chrome
puppet-master
Puppeteer as a service hosted on Saasify.
Stars: ✭ 25 (-99.21%)
Mutual labels:  crawling, headless-chrome, puppeteer
codepen-puppeteer
Use Puppeteer to download pens from Codepen.io as single html pages
Stars: ✭ 22 (-99.3%)
ioweb
Web Scraping Framework
Stars: ✭ 31 (-99.02%)
Mutual labels:  scraping, web-scraping, web-crawling
Puppeteer Extra
💯 Teach puppeteer new tricks through plugins.
Stars: ✭ 3,397 (+7.7%)
Mutual labels:  automation, puppeteer, headless-chrome
double-agent
A test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (-96.1%)
Mutual labels:  scraping, crawling, puppeteer
bots-zoo
No description or website provided.
Stars: ✭ 59 (-98.13%)
Mutual labels:  scraping, crawling, puppeteer
Grawler
Grawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them in a file.
Stars: ✭ 98 (-96.89%)
Mutual labels:  automation, scraping, crawling
Api Store
Contains all the public APIs listed in Phantombuster's API store. Pull requests welcome!
Stars: ✭ 69 (-97.81%)
Mutual labels:  automation, scraping, headless-chrome
Page2image
📷 page2image is a npm package for taking screenshots which also provides CLI command
Stars: ✭ 66 (-97.91%)
Mutual labels:  npm, puppeteer, headless-chrome
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-96.04%)
Mutual labels:  crawling, puppeteer, headless-chrome
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-91.22%)
Mutual labels:  scraping, web-scraping, crawling
Ayakashi
⚡️ Ayakashi.io - The next generation web scraping framework
Stars: ✭ 117 (-96.29%)
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+29.26%)
Mutual labels:  automation, scraping, web-scraping
Webster
a reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (-88.46%)
Mutual labels:  crawling, puppeteer, headless-chrome
Puphpeteer
A Puppeteer bridge for PHP, supporting the entire API.
Stars: ✭ 1,014 (-67.85%)
Mutual labels:  automation, puppeteer, headless-chrome
Nickjs
Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)
Stars: ✭ 494 (-84.34%)
Mutual labels:  automation, scraping, headless-chrome
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-94.58%)
Mutual labels:  scraping, crawling, puppeteer
Deno Puppeteer
A port of puppeteer running on Deno
Stars: ✭ 128 (-95.94%)
Mutual labels:  automation, puppeteer, headless-chrome
zcrawl
An open source web crawling platform
Stars: ✭ 21 (-99.33%)
Mutual labels:  scraping, crawling, web-crawling
Whatsapp-Net
Generate a network graph of connections from your WhatsApp groups data
Stars: ✭ 75 (-97.62%)
Mutual labels:  scraping, puppeteer
pdf-crawler
SimFin's open source PDF crawler
Stars: ✭ 100 (-96.83%)
Mutual labels:  crawling, puppeteer
puppeteer-lambda
Module for using Headless-Chrome by Puppeteer on AWS Lambda.
Stars: ✭ 117 (-96.29%)
Mutual labels:  headless-chrome, puppeteer
Puppeteer
Headless Chrome Node.js API
Stars: ✭ 75,197 (+2284.18%)
Mutual labels:  automation, headless-chrome
Taiko
A node.js library for testing modern web applications
Stars: ✭ 2,964 (-6.02%)
Mutual labels:  automation, headless-chrome
CrawlerSamples
This is a Puppeteer+AngleSharp crawler console app samples, used C# 7.1 coding and dotnet core build.
Stars: ✭ 36 (-98.86%)
Mutual labels:  headless-chrome, puppeteer
Cdp4j
cdp4j - Chrome DevTools Protocol for Java
Stars: ✭ 232 (-92.64%)
Mutual labels:  automation, crawling
pythonista-chromeless
Serverless selenium which dynamically execute any given code.
Stars: ✭ 31 (-99.02%)
Mutual labels:  scraping, headless-chrome
Automagica
AI-powered Smart Robotic Process Automation 🤖
Stars: ✭ 2,610 (-17.25%)
Mutual labels:  automation, rpa
PythonScrapyBasicSetup
Basic setup with random user agents and IP addresses for Python Scrapy Framework.
Stars: ✭ 57 (-98.19%)
Mutual labels:  scraping, web-scraping
scrape-github-trending
Tutorial for web scraping / crawling with Node.js.
Stars: ✭ 42 (-98.67%)
Mutual labels:  scraping, crawling
core
The complete web scraping toolkit for PHP.
Stars: ✭ 1,110 (-64.81%)
Mutual labels:  crawling, web-scraping
actor-content-checker
You can use this act to monitor any page's content and get a notification when content changes.
Stars: ✭ 16 (-99.49%)
Mutual labels:  web-scraping, apify
socials
👨‍👩‍👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (-98.83%)
Mutual labels:  scraping, crawling
diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-98.32%)
Mutual labels:  scraping, crawling
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (-77.46%)
Mutual labels:  scraping, web-scraping
scrapy-fieldstats
A Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-99.46%)
Mutual labels:  scraping, crawling
hc-pdf-server
Convert HTML to PDF Server by headless chrome with TypeScript. The new version of hcep-pdf-server.
Stars: ✭ 24 (-99.24%)
Mutual labels:  headless-chrome, puppeteer
crawling-framework
Easily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-99.3%)
Mutual labels:  scraping, crawling
puppeteer-autoscroll-down
Handle infinite scroll on websites by puppeteer
Stars: ✭ 40 (-98.73%)
Mutual labels:  headless-chrome, puppeteer
selectorlib
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (-98.32%)
Mutual labels:  scraping, web-scraping
ARGUS
ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (-97.84%)
Mutual labels:  scraping, crawling
after-work.js
[DEPRECATED] CLI for automated tests in web projects.
Stars: ✭ 56 (-98.22%)
Mutual labels:  headless-chrome, puppeteer
LInkedIn-Reverese-Lookup
🔎Search LinkedIn profile by email address📧
Stars: ✭ 20 (-99.37%)
Mutual labels:  scraping, puppeteer
puppeteer-botcheck
🕵‍♂ Bot detection tests for Puppeteer. Hide and seek!
Stars: ✭ 42 (-98.67%)
Mutual labels:  scraping, puppeteer
puppeteer-instagram
Instagram automation driven by headless chrome.
Stars: ✭ 87 (-97.24%)
Mutual labels:  headless-chrome, puppeteer
proxycrawl-python
ProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (-98.38%)
Mutual labels:  scraping, crawling
nest-puppeteer
Puppeteer (Headless Chrome) provider for Nest.js
Stars: ✭ 68 (-97.84%)
Mutual labels:  headless-chrome, puppeteer
throughout
🎪 End-to-end testing made simple (using Jest and Puppeteer)
Stars: ✭ 16 (-99.49%)
Mutual labels:  headless-chrome, puppeteer
phantom-lord
Handy API for Headless Chromium
Stars: ✭ 24 (-99.24%)
Mutual labels:  headless-chrome, puppeteer
go-scrapy
Web crawling and scraping framework for Golang
Stars: ✭ 17 (-99.46%)
Mutual labels:  scraping, crawling
OLX Scraper
📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-99.52%)
Mutual labels:  web-scraping, web-crawling
thal
译文:Puppeteer 与 Chrome Headless —— 从入门到爬虫
Stars: ✭ 651 (-79.36%)
Mutual labels:  headless-chrome, puppeteer
feedsearch-crawler
Crawl sites for RSS, Atom, and JSON feeds.
Stars: ✭ 23 (-99.27%)
Mutual labels:  scraping, crawling
browser-automation-api
Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (-99.24%)
Mutual labels:  scraping, puppeteer
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-98.35%)
Mutual labels:  scraping, crawling
1-60 of 2136 similar projects