All Projects → Arachnid → Similar Projects or Alternatives

1578 Open source projects that are alternatives of or similar to Arachnid

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+864.71%)

Mutual labels: crawler, spider, web-scraping, web-scraper

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (+307.35%)

Mutual labels: crawler, spider, web-scraping, crawling

Scrapple

A framework for creating semi-automatic web content extractors

Stars: ✭ 464 (+582.35%)

Mutual labels: crawler, web-scraping, web-scraper

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (+151.47%)

Mutual labels: crawler, spider, crawling

Instagram Bot

An Instagram bot developed using the Selenium Framework

Stars: ✭ 138 (+102.94%)

Mutual labels: bot, crawler, crawling

Skycaiji

蓝天采集器是一款免费的数据采集发布爬虫软件，采用php+mysql开发，可部署在云服务器，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Stars: ✭ 1,514 (+2126.47%)

Mutual labels: crawler, spider, crawling

Webster

a reliable high-level web crawling & scraping framework for Node.js.

Stars: ✭ 364 (+435.29%)

Mutual labels: crawler, spider, crawling

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+22745.59%)

Mutual labels: crawler, spider, crawling

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

Stars: ✭ 440 (+547.06%)

Mutual labels: crawler, spider, crawling

Awesome Crawler

A collection of awesome web crawler,spider in different languages

Stars: ✭ 4,793 (+6948.53%)

Mutual labels: crawler, spider, web-scraper

Laravel Crawler Detect

A Laravel wrapper for CrawlerDetect - the web crawler detection library

Stars: ✭ 227 (+233.82%)

Mutual labels: bot, crawler, spider

flink-crawler

Continuous scalable web crawler built on top of Flink and crawler-commons

Stars: ✭ 48 (-29.41%)

Mutual labels: crawler, spider, crawling

Scrapit

Scraping scripts for various websites.

Stars: ✭ 25 (-63.24%)

Mutual labels: bot, crawler, spider

Xcrawler

快速、简洁且强大的PHP爬虫框架

Stars: ✭ 344 (+405.88%)

Mutual labels: crawler, spider

Lightnovel Crawler

Download and generate e-books from online sources.

Stars: ✭ 344 (+405.88%)

Mutual labels: bot, web-scraper

Freshonions Torscraper

Fresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion

Stars: ✭ 348 (+411.76%)

Mutual labels: crawler, spider

Social Media Profile Scrapers

Fetch user's data across social media

Stars: ✭ 60 (-11.76%)

Mutual labels: web-scraping, web-scraper

Fictiondown

Stars: ✭ 362 (+432.35%)

Mutual labels: crawler, spider

Signature algorithm

各种App、小程序、网站的请求签名或加密算法。现已有：自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)

Stars: ✭ 380 (+458.82%)

Mutual labels: crawler, spider

Gosint

OSINT Swiss Army Knife

Stars: ✭ 401 (+489.71%)

Mutual labels: crawler, spider

Ferret

Declarative web scraping

Stars: ✭ 4,837 (+7013.24%)

Mutual labels: crawler, crawling

Haipproxy

💖 High available distributed ip proxy pool, powerd by Scrapy and Redis

Stars: ✭ 4,993 (+7242.65%)

Mutual labels: crawler, spider

Go jobs

带你了解一下Golang的市场行情

Stars: ✭ 526 (+673.53%)

Mutual labels: crawler, spider

Xxl Crawler

A distributed web crawler framework.（分布式爬虫框架XXL-CRAWLER）

Stars: ✭ 561 (+725%)

Mutual labels: crawler, spider

Html2article

Html网页正文提取

Stars: ✭ 441 (+548.53%)

Mutual labels: crawler, spider

Xsrfprobe

The Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.

Stars: ✭ 532 (+682.35%)

Mutual labels: crawler, spider

Netdiscovery

NetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。

Stars: ✭ 573 (+742.65%)

Mutual labels: crawler, spider

Cascadia

Go cascadia package command line CSS selector

Stars: ✭ 67 (-1.47%)

Mutual labels: web-scraping, web-scraper

Ttbot

今日头条机器人，支持用户登陆、关注、取消关注、获取关注粉丝、发文、发悟空问答、点赞、评论、采集各种类型新闻讯息等，使用今日头条网页版API实现

Stars: ✭ 338 (+397.06%)

Mutual labels: crawler, spider

91porn Api

🌭💦 91porn爬虫在线无限制API接口（永久有效，口令每日更新）及在线web预览

Stars: ✭ 341 (+401.47%)

Mutual labels: crawler, spider

Scavenger

Crawler (Bot) searching for credential leaks on different paste sites.

Stars: ✭ 347 (+410.29%)

Mutual labels: bot, crawler

Zhihu Login

知乎模拟登录，支持提取验证码和保存 Cookies

Stars: ✭ 340 (+400%)

Mutual labels: crawler, spider

Spider Flow

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

Stars: ✭ 365 (+436.76%)

Mutual labels: crawler, spider

Bilili

🍻 bilibili video (including bangumi) and danmaku downloader | B站视频（含番剧）、弹幕下载器

Stars: ✭ 379 (+457.35%)

Mutual labels: crawler, spider

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Stars: ✭ 4,077 (+5895.59%)

Mutual labels: crawler, web-scraping

Learnpython

Python的基础练习代码与各种爬虫代码

Stars: ✭ 451 (+563.24%)

Mutual labels: crawler, spider

Faster Than Requests

Faster requests on Python 3

Stars: ✭ 639 (+839.71%)

Mutual labels: web-scraping, web-scraper

Beanbun

Beanbun 是用 PHP 编写的多进程网络爬虫框架，具有良好的开放性、高可扩展性，基于 Workerman。

Stars: ✭ 1,096 (+1511.76%)

Mutual labels: crawler, spider

Grab Site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

Stars: ✭ 680 (+900%)

Mutual labels: crawler, spider

Headless Chrome Crawler

Distributed crawler powered by Headless Chrome

Stars: ✭ 5,129 (+7442.65%)

Mutual labels: crawler, crawling

Fbcrawl

A Facebook crawler

Stars: ✭ 536 (+688.24%)

Mutual labels: crawler, spider

Douyin

API of DouYin for Humans used to Crawl Popular Videos and Musics

Stars: ✭ 580 (+752.94%)

Mutual labels: crawler, spider

Toapi

Every web site provides APIs.

Stars: ✭ 3,209 (+4619.12%)

Mutual labels: crawler, spider

Scrapyrt

HTTP API for Scrapy spiders

Stars: ✭ 637 (+836.76%)

Mutual labels: crawler, crawling

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (+825%)

Mutual labels: crawler, spider

Baiduimagespider

一个超级轻量的百度图片爬虫

Stars: ✭ 591 (+769.12%)

Mutual labels: crawler, spider

Crawler

A high performance web crawler in Elixir.

Stars: ✭ 781 (+1048.53%)

Mutual labels: crawler, spider

Gospider

Gospider - Fast web spider written in Go

Stars: ✭ 785 (+1054.41%)

Mutual labels: crawler, spider

Lulu

[Unmaintained] A simple and clean video/music/image downloader 👾

Stars: ✭ 789 (+1060.29%)

Mutual labels: crawler, crawling

Zhihu Crawler

zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目

Stars: ✭ 890 (+1208.82%)

Mutual labels: crawler, spider

Newcrawler

Free Web Scraping Tool with Java

Stars: ✭ 589 (+766.18%)

Mutual labels: crawler, spider

Creeper

🐾 Creeper - The Next Generation Crawler Framework (Go)

Stars: ✭ 762 (+1020.59%)

Mutual labels: crawler, spider

Torbot

Dark Web OSINT Tool

Stars: ✭ 821 (+1107.35%)

Mutual labels: crawler, spider

Car Prices

Golang爬虫爬取汽车之家二手车产品库

Stars: ✭ 57 (-16.18%)

Mutual labels: crawler, spider

Awesome Python Primer

自学入门 Python 优质中文资源索引，包含书籍 / 文档 / 视频，适用于爬虫 / Web / 数据分析 / 机器学习方向

Stars: ✭ 57 (-16.18%)

Mutual labels: crawler, spider

Maman

Rust Web Crawler saving pages on Redis

Stars: ✭ 39 (-42.65%)

Mutual labels: crawler, spider

Lizard

💐 Full Amazon Automatic Download

Stars: ✭ 41 (-39.71%)

Mutual labels: crawler, spider

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+12241.18%)

Mutual labels: crawler, spider

Nodespider

[DEPRECATED] Simple, flexible, delightful web crawler/spider package

Stars: ✭ 33 (-51.47%)

Mutual labels: crawler, spider

Vulnx

vulnx 🕷️ is an intelligent bot auto shell injector that detect vulnerabilities in multiple types of cms { `wordpress , joomla , drupal , prestashop .. `}

Stars: ✭ 1,009 (+1383.82%)

Mutual labels: bot, crawler

1-60 of 1578 similar projects

›

next*5