All Projects → Spidy → Similar Projects or Alternatives

487 Open source projects that are alternatives of or similar to Spidy

Ncov2019 data crawler

疫情数据爬虫，2019新型冠状病毒数据仓库，轨迹数据，同乘数据，报道

Stars: ✭ 175 (-31.91%)

Mutual labels: crawler

arachnod

High performance crawler for Nodejs

Stars: ✭ 17 (-93.39%)

Mutual labels: crawler

Pylinkvalidator

pylinkvalidator is a standalone and pure python link validator and crawler that traverses a web site and reports errors (e.g., 500 and 404 errors) encountered.

Stars: ✭ 109 (-57.59%)

Mutual labels: crawler

Vulnx

vulnx 🕷️ is an intelligent bot auto shell injector that detect vulnerabilities in multiple types of cms { `wordpress , joomla , drupal , prestashop .. `}

Stars: ✭ 1,009 (+292.61%)

Mutual labels: crawler

Spoon

🥄 A package for building specific Proxy Pool for different Sites.

Stars: ✭ 173 (-32.68%)

Mutual labels: crawler

Webvideobot

Web crawler.

Stars: ✭ 214 (-16.73%)

Mutual labels: crawler

Linkcrawler

Cross-platform persistent and distributed web crawler 🔗

Stars: ✭ 109 (-57.59%)

Mutual labels: crawler

Dbworld Search

🔍 简单的搜索引擎, django 框架

Stars: ✭ 39 (-84.82%)

Mutual labels: crawler

Scrapedin Linkedin Crawler

Crawler for LinkedIn full profiles 2019

Stars: ✭ 170 (-33.85%)

Mutual labels: crawler

ComicBookMaker

Script to fetch webcomics and use them to create ebooks.

Stars: ✭ 27 (-89.49%)

Mutual labels: web-crawler

Lumberjack

An automated website accessibility scanner and cli

Stars: ✭ 109 (-57.59%)

Mutual labels: crawler

Gorecon

Gorecon is a All in one Reconnaissance Tool , a.k.a swiss knife for Reconnaissance , A tool that every pentester/bughunter might wanna consider into their arsenal

Stars: ✭ 208 (-19.07%)

Mutual labels: crawler

Fawkes

Fawkes is a tool to search for targets vulnerable to SQL Injection. Performs the search using Google search engine.

Stars: ✭ 108 (-57.98%)

Mutual labels: crawler

Gargantua

The fast website crawler

Stars: ✭ 35 (-86.38%)

Mutual labels: crawler

Proxy pool

Python爬虫代理IP池(proxy pool)

Stars: ✭ 13,964 (+5333.46%)

Mutual labels: crawler

Diskover

File system crawler, disk space usage, file search engine and file system analytics powered by Elasticsearch

Stars: ✭ 977 (+280.16%)

Mutual labels: crawler

scrapy-fieldstats

A Scrapy extension to log items coverage when the spider shuts down

Stars: ✭ 17 (-93.39%)

Mutual labels: crawling

News Please

news-please - an integrated web crawler and information extractor for news that just works.

Stars: ✭ 969 (+277.04%)

Mutual labels: crawler

Fun crawler

Crawl some picture for fun

Stars: ✭ 169 (-34.24%)

Mutual labels: crawler

Douyin Crawler

抖音爬虫. 通过手机代理爬取用户的作品和用户的喜欢

Stars: ✭ 33 (-87.16%)

Mutual labels: crawler

Vw Crawler

🐞简单轻便的Java爬虫框架，只要会一点简单的正则表达式和简单的css选择器就能轻松的采集数据。

Stars: ✭ 32 (-87.55%)

Mutual labels: crawler

Douyin crawler

抖音爬虫，tiktok crawler，抖音数据采集接口，抖音视频去水印，百分百成功，不需要服务器，不需要代理 IP。

Stars: ✭ 169 (-34.24%)

Mutual labels: crawler

Universityrecruitment Ssurvey

用严肃的数据来回答“什么样的企业会到什么样的大学招聘”？

Stars: ✭ 30 (-88.33%)

Mutual labels: crawler

Raspagem-de-dados-para-iniciantes

Raspagem de dados para iniciante usando Scrapy e outras libs básicas

Stars: ✭ 113 (-56.03%)

Mutual labels: web-crawler

Papercrawler

Crawler used to crawl papers

Stars: ✭ 20 (-92.22%)

Mutual labels: crawler

Scrapingoutsourcing

ScrapingOutsourcing专注分享爬虫代码尽量每周更新一个

Stars: ✭ 164 (-36.19%)

Mutual labels: crawler

Onion Crawler

Tor website crawler (specific for Alphabay at the time)

Stars: ✭ 15 (-94.16%)

Mutual labels: crawler

TumblTwo

TumblTwo, an Improved Fork of TumblOne, a Tumblr Downloader.

Stars: ✭ 57 (-77.82%)

Mutual labels: crawler

Axegrinder

Crawl websites for accessibility issues from the command line.

Stars: ✭ 12 (-95.33%)

Mutual labels: crawler

Datmusic Api

Alternative for VK Audio API

Stars: ✭ 160 (-37.74%)

Mutual labels: crawler

Ccrawl

Simple CORPORA list crawler

Stars: ✭ 11 (-95.72%)

Mutual labels: crawler

auctus

Dataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index

Stars: ✭ 34 (-86.77%)

Mutual labels: crawling

Goods Crawling

爬取amazon/bestbuy/costco/6pm 的商品详情

Stars: ✭ 9 (-96.5%)

Mutual labels: crawler

Yispider

一款分布式爬虫平台，帮助你更好的管理和开发爬虫。内置一套爬虫定义规则（模版），可使用模版快速定义爬虫，也可当作框架手动开发爬虫。(兴趣使然的项目，用的不爽了就更新)

Stars: ✭ 158 (-38.52%)

Mutual labels: crawler

Symfony Crawler Bundle

Implements the crawler package into Symfony

Stars: ✭ 8 (-96.89%)

Mutual labels: crawler

crawler

A simple and flexible web crawler framework for java.

Stars: ✭ 20 (-92.22%)

Mutual labels: crawler

Sqliv

massive SQL injection vulnerability scanner

Stars: ✭ 840 (+226.85%)

Mutual labels: crawler

Scrapit

Scraping scripts for various websites.

Stars: ✭ 25 (-90.27%)

Mutual labels: crawler

doc crawler.py

Explore a website recursively and download all the wanted documents (PDF, ODT…)

Stars: ✭ 22 (-91.44%)

Mutual labels: web-crawler

Tumblthree

A Tumblr Blog Backup Application

Stars: ✭ 923 (+259.14%)

Mutual labels: crawler

Crawler

An easy to use, powerful crawler implemented in PHP. Can execute Javascript.

Stars: ✭ 2,055 (+699.61%)

Mutual labels: crawler

eastmoney

python requests + Django+ nodejs koa+ mysql to crawl eastmoney fund and stock data,for data analysis and visualiaztion .

Stars: ✭ 56 (-78.21%)

Mutual labels: crawler

Tumblthree

A Tumblr Backup Application

Stars: ✭ 211 (-17.9%)

Mutual labels: crawler

Webmagic

A scalable web crawler framework for Java.

Stars: ✭ 10,186 (+3863.42%)

Mutual labels: crawler

Zhihu Crawler

zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目

Stars: ✭ 890 (+246.3%)

Mutual labels: crawler

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+728.4%)

Mutual labels: crawler

Python

Python脚本。模拟登录知乎，爬虫，操作excel，微信公众号，远程开机

Stars: ✭ 7,355 (+2761.87%)

Mutual labels: crawler

socials

👨‍👩‍👦 Social account detection and extraction in Python, e.g. for crawling/scraping.

Stars: ✭ 37 (-85.6%)

Mutual labels: crawling

Py3 scripts

Life is short, *****.

Stars: ✭ 5 (-98.05%)

Mutual labels: crawler

Jlitespider

A lite distributed Java spider framework :-)

Stars: ✭ 151 (-41.25%)

Mutual labels: crawler

TaobaoAnalysis

练习NLP，分析淘宝评论的项目

Stars: ✭ 28 (-89.11%)

Mutual labels: crawler

Not Your Average Web Crawler

A web crawler (for bug hunting) that gathers more than you can imagine.

Stars: ✭ 107 (-58.37%)

Mutual labels: crawler

lostark-wait-notifier

🐤️ Lost Ark wait notifier

Stars: ✭ 38 (-85.21%)

Mutual labels: crawler

crawlkit

A crawler based on Phantom. Allows discovery of dynamic content and supports custom scrapers.

Stars: ✭ 23 (-91.05%)

Mutual labels: crawling

Goose Parser

Universal scrapping tool, which allows you to extract data using multiple environments

Stars: ✭ 211 (-17.9%)

Mutual labels: crawler

Crawler Detect

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

Stars: ✭ 1,549 (+502.72%)

Mutual labels: crawler

Crawler

爬虫, http代理, 模拟登陆!

Stars: ✭ 106 (-58.75%)

Mutual labels: crawler

Algoliasearch Netlify

Official Algolia Plugin for Netlify. Index your website to Algolia when deploying your project to Netlify with the Algolia Crawler

Stars: ✭ 208 (-19.07%)

Mutual labels: crawler

D4n155

OWASP D4N155 - Intelligent and dynamic wordlist using OSINT

Stars: ✭ 105 (-59.14%)

Mutual labels: crawler

scrapy-distributed

A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.

Stars: ✭ 38 (-85.21%)

Mutual labels: crawling

301-360 of 487 similar projects

first

‹

›