All Projects → Supercrawler → Similar Projects or Alternatives

756 Open source projects that are alternatives of or similar to Supercrawler

flink-crawler
Continuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-84.31%)
Mutual labels:  crawler, web-crawler
Infinitycrawler
A simple but powerful web crawler library for .NET
Stars: ✭ 97 (-68.3%)
Mutual labels:  crawler, web-crawler
Abot
Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+540.85%)
Mutual labels:  crawler, web-crawler
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+114.38%)
Mutual labels:  crawler, web-crawler
Pspider
简单易用的Python爬虫框架,QQ交流群:597510560
Stars: ✭ 1,611 (+426.47%)
Mutual labels:  crawler, web-crawler
Strong Web Crawler
基于C#.NET+PhantomJS+Sellenium的高级网络爬虫程序。可执行Javascript代码、触发各类事件、操纵页面Dom结构。
Stars: ✭ 238 (-22.22%)
Mutual labels:  crawler, web-crawler
Awesome Crawler
A collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+1466.34%)
Mutual labels:  crawler, web-crawler
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-35.29%)
Mutual labels:  crawler, web-crawler
Spider Flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+19.28%)
Mutual labels:  crawler, web-crawler
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Stars: ✭ 63 (-79.41%)
Mutual labels:  crawler, web-crawler
Crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+2642.48%)
Mutual labels:  crawler, web-crawler
Spidy
The simple, easy to use command line web crawler.
Stars: ✭ 257 (-16.01%)
Mutual labels:  crawler, web-crawler
Sitemap Generator Cli
Creates an XML-Sitemap by crawling a given site.
Stars: ✭ 214 (-30.07%)
Mutual labels:  sitemap, crawler
Sitemap Generator Crawler
Script that generates a sitemap by crawling a given URL
Stars: ✭ 169 (-44.77%)
Mutual labels:  sitemap, crawler
Zhihu Crawler People
A simple distributed crawler for zhihu && data analysis
Stars: ✭ 182 (-40.52%)
Mutual labels:  crawler, web-crawler
Sitemap Generator
Easily create XML sitemaps for your website.
Stars: ✭ 273 (-10.78%)
Mutual labels:  sitemap, crawler
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-9.48%)
Mutual labels:  crawler, web-crawler
Maman
Rust Web Crawler saving pages on Redis
Stars: ✭ 39 (-87.25%)
Mutual labels:  crawler, web-crawler
Crawlab Lite
Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台
Stars: ✭ 122 (-60.13%)
Mutual labels:  crawler, web-crawler
siteshooter
📷 Automate full website screenshots and PDF generation with multiple viewport support.
Stars: ✭ 63 (-79.41%)
Mutual labels:  sitemap, web-crawler
CrawlBox
Easy way to brute-force web directory.
Stars: ✭ 118 (-61.44%)
Mutual labels:  crawler, web-crawler
ComicBookMaker
Script to fetch webcomics and use them to create ebooks.
Stars: ✭ 27 (-91.18%)
Mutual labels:  web-crawler
galer
A fast tool to fetch URLs from HTML attributes by crawl-in.
Stars: ✭ 138 (-54.9%)
Mutual labels:  crawler
octopus
Recursive and multi-threaded broken link checker
Stars: ✭ 19 (-93.79%)
Mutual labels:  crawler
CLF reactive planning system
This package provides a CLF-based reactive planning system, described in paper: Efficient Anytime CLF Reactive Planning System for a Bipedal Robot on Undulating Terrain. The reactive planning system consists of a 5-Hz planning thread to guide a robot to a distant goal and a 300-Hz Control-Lyapunov-Function-based (CLF-based) reactive thread to co…
Stars: ✭ 21 (-93.14%)
Mutual labels:  robot
Sitemap Php
Library for generating Google sitemap XML files
Stars: ✭ 289 (-5.56%)
Mutual labels:  sitemap
Rcrawler
An R web crawler and scraper
Stars: ✭ 274 (-10.46%)
Mutual labels:  crawler
notspot sim py
This repository contains all the code and files needed to simulate the notspot quadrupedal robot using Gazebo and ROS.
Stars: ✭ 41 (-86.6%)
Mutual labels:  robot
UnChain
A tool to find redirection chains in multiple URLs
Stars: ✭ 77 (-74.84%)
Mutual labels:  web-crawler
Pa11y Ci
Pa11y CI is a CI-centric accessibility test runner, built using Pa11y
Stars: ✭ 291 (-4.9%)
Mutual labels:  sitemap
PY-Login
模拟登录各类网站,操作 API 完成各种不可描述的事情
Stars: ✭ 26 (-91.5%)
Mutual labels:  crawler
Ottodiyesp
build you own internet of robots!
Stars: ✭ 273 (-10.78%)
Mutual labels:  robot
strapi-plugin-sitemap
🔌 Generate a highly customizable sitemap XML in Strapi CMS
Stars: ✭ 136 (-55.56%)
Mutual labels:  sitemap
Hquery.php
An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (-3.59%)
Mutual labels:  crawler
Prestasitemapbundle
A symfony bundle that provides tools to build a rich application sitemap. The main goals are : simple, no databases, various namespace (eg. google image), respect constraints etc.
Stars: ✭ 272 (-11.11%)
Mutual labels:  sitemap
MyCrawler
我的爬虫合集
Stars: ✭ 55 (-82.03%)
Mutual labels:  crawler
StuyLib
Award-Winning FRC Library by StuyPulse Team 694
Stars: ✭ 17 (-94.44%)
Mutual labels:  robot
weibo-scraper
Simple Weibo Scraper
Stars: ✭ 50 (-83.66%)
Mutual labels:  crawler
Sasila
一个灵活、友好的爬虫框架
Stars: ✭ 286 (-6.54%)
Mutual labels:  crawler
Dingtalk Plugin
Dingtalk for jenkins
Stars: ✭ 272 (-11.11%)
Mutual labels:  robot
eastmoney
python requests + Django+ nodejs koa+ mysql to crawl eastmoney fund and stock data,for data analysis and visualiaztion .
Stars: ✭ 56 (-81.7%)
Mutual labels:  crawler
SitemapTools
A sitemap (sitemap.xml) querying and parsing library for .NET
Stars: ✭ 19 (-93.79%)
Mutual labels:  sitemap
Arachni
Web Application Security Scanner Framework
Stars: ✭ 2,942 (+861.44%)
Mutual labels:  crawler
tg crawler
Just a crawler based on tg-cli for Telegram. Deprecated by now, please use telegram-export.
Stars: ✭ 71 (-76.8%)
Mutual labels:  crawler
rankr
🇰🇷 Realtime integrated information analysis service
Stars: ✭ 21 (-93.14%)
Mutual labels:  crawler
Go Dork
The fastest dork scanner written in Go.
Stars: ✭ 274 (-10.46%)
Mutual labels:  crawler
Ghcrawler
Crawl GitHub APIs and store the discovered orgs, repos, commits, ...
Stars: ✭ 293 (-4.25%)
Mutual labels:  crawler
Gospider
golang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (-6.86%)
Mutual labels:  crawler
Dynamixelsdk
ROBOTIS Dynamixel SDK (Protocol1.0/2.0)
Stars: ✭ 266 (-13.07%)
Mutual labels:  robot
jlsitemap
JL Sitemap - Component sitemap for Joomla
Stars: ✭ 20 (-93.46%)
Mutual labels:  sitemap
awesome-webots
Awesome Webots
Stars: ✭ 46 (-84.97%)
Mutual labels:  robot
Line Bot Tutorial
line-bot-tutorial use python flask
Stars: ✭ 267 (-12.75%)
Mutual labels:  crawler
kinpy
Simple kinematics calculation toolkit for robotics
Stars: ✭ 48 (-84.31%)
Mutual labels:  robot
erdos
Dataflow system for building self-driving car and robotics applications.
Stars: ✭ 135 (-55.88%)
Mutual labels:  robot
Crawlertutorial
爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Stars: ✭ 282 (-7.84%)
Mutual labels:  crawler
Free gait
An Architecture for the Versatile Control of Legged Robots
Stars: ✭ 263 (-14.05%)
Mutual labels:  robot
codes-scratch-crawler
读书笔记《自己动手写网络爬虫》,自己敲的代码。主要记录了网络爬虫的基本实现,网页去重的算法,网页指纹算法,文本信息挖掘
Stars: ✭ 44 (-85.62%)
Mutual labels:  crawler
Go-Mirai-Client
基于MiraiGo的客户端,使用反向 websocket 收发私聊、群聊消息,消息格式类似onebot。支持多账号,很稳定
Stars: ✭ 90 (-70.59%)
Mutual labels:  robot
Bt Btt
磁力網站U3C3介紹以及域名更新
Stars: ✭ 261 (-14.71%)
Mutual labels:  crawler
indieweb-search
Source code for the IndieWeb search engine.
Stars: ✭ 16 (-94.77%)
Mutual labels:  crawler
1-60 of 756 similar projects