All Projects → Apify Js → Similar Projects or Alternatives

2136 Open source projects that are alternatives of or similar to Apify Js

Awesome Puppeteer

A curated list of awesome puppeteer resources.

Stars: ✭ 1,728 (-45.21%)

Mutual labels: automation, scraping, crawling, puppeteer, headless-chrome

Headless Chrome Crawler

Distributed crawler powered by Headless Chrome

Stars: ✭ 5,129 (+62.62%)

Mutual labels: scraping, crawling, puppeteer, headless-chrome

browser-pool

A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.

Stars: ✭ 71 (-97.75%)

Mutual labels: scraping, web-scraping, rpa, puppeteer

apify-cli

Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

Stars: ✭ 37 (-98.83%)

Mutual labels: headless-chrome, apify, puppeteer

Phantomas

Headless Chromium-based web performance metrics collector and monitoring tool

Stars: ✭ 2,191 (-30.53%)

Mutual labels: automation, puppeteer, headless-chrome

puppet-master

Puppeteer as a service hosted on Saasify.

Stars: ✭ 25 (-99.21%)

Mutual labels: crawling, headless-chrome, puppeteer

codepen-puppeteer

Use Puppeteer to download pens from Codepen.io as single html pages

Stars: ✭ 22 (-99.3%)

Mutual labels: web-scraping, headless-chrome, puppeteer

ioweb

Web Scraping Framework

Stars: ✭ 31 (-99.02%)

Mutual labels: scraping, web-scraping, web-crawling

Puppeteer Extra

💯 Teach puppeteer new tricks through plugins.

Stars: ✭ 3,397 (+7.7%)

Mutual labels: automation, puppeteer, headless-chrome

double-agent

A test suite of common scraper detection techniques. See how detectable your scraper stack is.

Stars: ✭ 123 (-96.1%)

Mutual labels: scraping, crawling, puppeteer

bots-zoo

No description or website provided.

Stars: ✭ 59 (-98.13%)

Mutual labels: scraping, crawling, puppeteer

Grawler

Grawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them in a file.

Stars: ✭ 98 (-96.89%)

Mutual labels: automation, scraping, crawling

Api Store

Contains all the public APIs listed in Phantombuster's API store. Pull requests welcome!

Stars: ✭ 69 (-97.81%)

Mutual labels: automation, scraping, headless-chrome

Page2image

📷 page2image is a npm package for taking screenshots which also provides CLI command

Stars: ✭ 66 (-97.91%)

Mutual labels: npm, puppeteer, headless-chrome

Squidwarc

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

Stars: ✭ 125 (-96.04%)

Mutual labels: crawling, puppeteer, headless-chrome

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (-91.22%)

Mutual labels: scraping, web-scraping, crawling

Ayakashi

⚡️ Ayakashi.io - The next generation web scraping framework

Stars: ✭ 117 (-96.29%)

Mutual labels: automation, web-scraping, headless-chrome

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Stars: ✭ 4,077 (+29.26%)

Mutual labels: automation, scraping, web-scraping

Webster

a reliable high-level web crawling & scraping framework for Node.js.

Stars: ✭ 364 (-88.46%)

Mutual labels: crawling, puppeteer, headless-chrome

Puphpeteer

A Puppeteer bridge for PHP, supporting the entire API.

Stars: ✭ 1,014 (-67.85%)

Mutual labels: automation, puppeteer, headless-chrome

Nickjs

Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)

Stars: ✭ 494 (-84.34%)

Mutual labels: automation, scraping, headless-chrome

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (-94.58%)

Mutual labels: scraping, crawling, puppeteer

Deno Puppeteer

A port of puppeteer running on Deno

Stars: ✭ 128 (-95.94%)

Mutual labels: automation, puppeteer, headless-chrome

zcrawl

An open source web crawling platform

Stars: ✭ 21 (-99.33%)

Mutual labels: scraping, crawling, web-crawling

Whatsapp-Net

Generate a network graph of connections from your WhatsApp groups data

Stars: ✭ 75 (-97.62%)

Mutual labels: scraping, puppeteer

pdf-crawler

SimFin's open source PDF crawler

Stars: ✭ 100 (-96.83%)

Mutual labels: crawling, puppeteer

puppeteer-lambda

Module for using Headless-Chrome by Puppeteer on AWS Lambda.

Stars: ✭ 117 (-96.29%)

Mutual labels: headless-chrome, puppeteer

Puppeteer

Headless Chrome Node.js API

Stars: ✭ 75,197 (+2284.18%)

Mutual labels: automation, headless-chrome

Taiko

A node.js library for testing modern web applications

Stars: ✭ 2,964 (-6.02%)

Mutual labels: automation, headless-chrome

CrawlerSamples

This is a Puppeteer+AngleSharp crawler console app samples, used C# 7.1 coding and dotnet core build.

Stars: ✭ 36 (-98.86%)

Mutual labels: headless-chrome, puppeteer

Cdp4j

cdp4j - Chrome DevTools Protocol for Java

Stars: ✭ 232 (-92.64%)

Mutual labels: automation, crawling

pythonista-chromeless

Serverless selenium which dynamically execute any given code.

Stars: ✭ 31 (-99.02%)

Mutual labels: scraping, headless-chrome

Automagica

AI-powered Smart Robotic Process Automation 🤖

Stars: ✭ 2,610 (-17.25%)

Mutual labels: automation, rpa

PythonScrapyBasicSetup

Basic setup with random user agents and IP addresses for Python Scrapy Framework.

Stars: ✭ 57 (-98.19%)

Mutual labels: scraping, web-scraping

scrape-github-trending

Tutorial for web scraping / crawling with Node.js.

Stars: ✭ 42 (-98.67%)

Mutual labels: scraping, crawling

core

The complete web scraping toolkit for PHP.

Stars: ✭ 1,110 (-64.81%)

Mutual labels: crawling, web-scraping

actor-content-checker

You can use this act to monitor any page's content and get a notification when content changes.

Stars: ✭ 16 (-99.49%)

Mutual labels: web-scraping, apify

socials

👨‍👩‍👦 Social account detection and extraction in Python, e.g. for crawling/scraping.

Stars: ✭ 37 (-98.83%)

Mutual labels: scraping, crawling

diffbot-php-client

[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library

Stars: ✭ 53 (-98.32%)

Mutual labels: scraping, crawling

trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Stars: ✭ 711 (-77.46%)

Mutual labels: scraping, web-scraping

scrapy-fieldstats

A Scrapy extension to log items coverage when the spider shuts down

Stars: ✭ 17 (-99.46%)

Mutual labels: scraping, crawling

hc-pdf-server

Convert HTML to PDF Server by headless chrome with TypeScript. The new version of hcep-pdf-server.

Stars: ✭ 24 (-99.24%)

Mutual labels: headless-chrome, puppeteer

crawling-framework

Easily crawl news portals or blog sites using Storm Crawler.

Stars: ✭ 22 (-99.3%)

Mutual labels: scraping, crawling

puppeteer-autoscroll-down

Handle infinite scroll on websites by puppeteer

Stars: ✭ 40 (-98.73%)

Mutual labels: headless-chrome, puppeteer

selectorlib

A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them

Stars: ✭ 53 (-98.32%)

Mutual labels: scraping, web-scraping

ARGUS

ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9

Stars: ✭ 68 (-97.84%)

Mutual labels: scraping, crawling

after-work.js

[DEPRECATED] CLI for automated tests in web projects.

Stars: ✭ 56 (-98.22%)

Mutual labels: headless-chrome, puppeteer

LInkedIn-Reverese-Lookup

🔎Search LinkedIn profile by email address📧

Stars: ✭ 20 (-99.37%)

Mutual labels: scraping, puppeteer

puppeteer-botcheck

🕵‍♂ Bot detection tests for Puppeteer. Hide and seek!

Stars: ✭ 42 (-98.67%)

Mutual labels: scraping, puppeteer

puppeteer-instagram

Instagram automation driven by headless chrome.

Stars: ✭ 87 (-97.24%)

Mutual labels: headless-chrome, puppeteer

proxycrawl-python

ProxyCrawl Python library for scraping and crawling

Stars: ✭ 51 (-98.38%)

Mutual labels: scraping, crawling

nest-puppeteer

Puppeteer (Headless Chrome) provider for Nest.js

Stars: ✭ 68 (-97.84%)

Mutual labels: headless-chrome, puppeteer

throughout

🎪 End-to-end testing made simple (using Jest and Puppeteer)

Stars: ✭ 16 (-99.49%)

Mutual labels: headless-chrome, puppeteer

phantom-lord

Handy API for Headless Chromium

Stars: ✭ 24 (-99.24%)

Mutual labels: headless-chrome, puppeteer

go-scrapy

Web crawling and scraping framework for Golang

Stars: ✭ 17 (-99.46%)

Mutual labels: scraping, crawling

OLX Scraper

📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.

Stars: ✭ 15 (-99.52%)

Mutual labels: web-scraping, web-crawling

thal

译文：Puppeteer 与 Chrome Headless —— 从入门到爬虫

Stars: ✭ 651 (-79.36%)

Mutual labels: headless-chrome, puppeteer

feedsearch-crawler

Crawl sites for RSS, Atom, and JSON feeds.

Stars: ✭ 23 (-99.27%)

Mutual labels: scraping, crawling

browser-automation-api

Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.

Stars: ✭ 24 (-99.24%)

Mutual labels: scraping, puppeteer

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (-98.35%)

Mutual labels: scraping, crawling

1-60 of 2136 similar projects

›

next*5