All Projects → Autoscraper → Similar Projects or Alternatives

2702 Open source projects that are alternatives of or similar to Autoscraper

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-99.63%)

Mutual labels: crawler, scraper, scraping, web-scraping

Phpscraper

PHP Scraper - an highly opinionated web-interface for PHP

Stars: ✭ 148 (-96.37%)

Mutual labels: scraper, scraping, web-scraping

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (-83.91%)

Mutual labels: crawler, scraper, web-scraping

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

Stars: ✭ 440 (-89.21%)

Mutual labels: crawler, scraper, scraping

Rcrawler

An R web crawler and scraper

Stars: ✭ 274 (-93.28%)

Mutual labels: crawler, scraper, webscraping

Youtube Projects

This repository contains all the code I use in my YouTube tutorials.

Stars: ✭ 144 (-96.47%)

Mutual labels: crawler, scraper, webscraping

Anime Dl

Anime-dl is a command-line program to download anime from CrunchyRoll and Funimation.

Stars: ✭ 190 (-95.34%)

Mutual labels: automation, scraper, scraping

BookingScraper

🌎 🏨 Scrape Booking.com 🏨 🌎

Stars: ✭ 68 (-98.33%)

Mutual labels: scraper, web-scraping, webscraping

Polite

Be nice on the web

Stars: ✭ 253 (-93.79%)

Mutual labels: crawler, scraper, webscraping

bots-zoo

No description or website provided.

Stars: ✭ 59 (-98.55%)

Mutual labels: crawler, scraper, scraping

Goapy

Goal-Oriented Action Planning implementation in Python

Stars: ✭ 33 (-99.19%)

Mutual labels: automation, artificial-intelligence, ai

Headless Chrome Crawler

Distributed crawler powered by Headless Chrome

Stars: ✭ 5,129 (+25.8%)

Mutual labels: crawler, scraper, scraping

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (-95.81%)

Mutual labels: crawler, scraper, scraping

Scrape Linkedin Selenium

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

Stars: ✭ 239 (-94.14%)

Mutual labels: scraper, scraping, web-scraping

diffbot-php-client

[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library

Stars: ✭ 53 (-98.7%)

Mutual labels: scraper, scraping, scrape

Huginn

Create agents that monitor and act on your behalf. Your agents are standing by!

Stars: ✭ 33,694 (+726.44%)

Mutual labels: automation, scraper, webscraping

Ferret

Declarative web scraping

Stars: ✭ 4,837 (+18.64%)

Mutual labels: crawler, scraper, scraping

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (-74.88%)

Mutual labels: scraper, scraping, webscraping

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+281.04%)

Mutual labels: crawler, scraper, scraping

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (-69.44%)

Mutual labels: crawler, scraper, scraping

Dotnetcrawler

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (-97.55%)

Mutual labels: crawler, scraping, webscraping

Goose Parser

Universal scrapping tool, which allows you to extract data using multiple environments

Stars: ✭ 211 (-94.82%)

Mutual labels: crawler, scraper, scraping

Instagram Scraper

scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot

Stars: ✭ 2,209 (-45.82%)

Mutual labels: crawler, scraper, scrape

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (-93.21%)

Mutual labels: crawler, scraping, web-scraping

scrapers

scrapers for building your own image databases

Stars: ✭ 46 (-98.87%)

Mutual labels: scraper, scraping, scrape

Lulu

[Unmaintained] A simple and clean video/music/image downloader 👾

Stars: ✭ 789 (-80.65%)

Mutual labels: crawler, scraper, scraping

ha-multiscrape

Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.

Stars: ✭ 103 (-97.47%)

Mutual labels: scraper, scraping, scrape

Rod

A Devtools driver for web automation and scraping

Stars: ✭ 1,392 (-65.86%)

Mutual labels: automation, scraper, web-scraping

Scrapple

A framework for creating semi-automatic web content extractors

Stars: ✭ 464 (-88.62%)

Mutual labels: crawler, scraping, web-scraping

Sillynium

Automate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements

Stars: ✭ 100 (-97.55%)

Mutual labels: automation, scraper, web-scraping

ioweb

Web Scraping Framework

Stars: ✭ 31 (-99.24%)

Mutual labels: scraping, web-scraping, webscraping

Apify Js

Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

Stars: ✭ 3,154 (-22.64%)

Mutual labels: automation, scraping, web-scraping

browser-pool

A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.

Stars: ✭ 71 (-98.26%)

Mutual labels: scraping, web-scraping

anime-scraper

[partially working] Scrape and add anime episode stream URLs to uGet (Linux) or IDM (Windows) ~ Python3

Stars: ✭ 21 (-99.48%)

Mutual labels: scraping, webscraping

angel.co-companies-list-scraping

No description or website provided.

Stars: ✭ 54 (-98.68%)

Mutual labels: scraper, scraping

Lightnet

🌓 Bringing pjreddie's DarkNet out of the shadows #yolo

Stars: ✭ 322 (-92.1%)

Mutual labels: artificial-intelligence, ai

extractnet

A Dragnet that also extract author, headline, date, keywords from context

Stars: ✭ 52 (-98.72%)

Mutual labels: web-scraping, webscraping

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (-98.72%)

Mutual labels: scraper, scraping

InstagramLocationScraper

No description or website provided.

Stars: ✭ 13 (-99.68%)

Mutual labels: scraper, scrape

document-dl

Command line program to download documents from web portals.

Stars: ✭ 14 (-99.66%)

Mutual labels: scraper, scraping

chesf

CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages

Stars: ✭ 18 (-99.56%)

Mutual labels: scraping, webscraping

Android-Web-Scraper

Android Web Scraper is a simple library for android web automation. You can perform web task in background to fetch website data programmatically.

Stars: ✭ 38 (-99.07%)

Mutual labels: webscraping, webautomation

Clai

Command Line Artificial Intelligence or CLAI is an open-sourced project from IBM Research aimed to bring the power of AI to the command line interface.

Stars: ✭ 320 (-92.15%)

Mutual labels: artificial-intelligence, ai

copycat

A PHP Scraping Class

Stars: ✭ 70 (-98.28%)

Mutual labels: scraper, scraping

OLX Scraper

📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.

Stars: ✭ 15 (-99.63%)

Mutual labels: scraper, web-scraping

Mimo-Crawler

A web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.