AV 电影管理系统， avmoo , javbus , javlibrary 爬虫，线上 AV 影片图书馆，AV 磁力链接数据库，Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database

Stars: ✭ 8,133 (+2264.24%)

Mutual labels: crawler, spider, scraper

Fbcrawl

A Facebook crawler

Stars: ✭ 536 (+55.81%)

Mutual labels: crawler, spider, scraper

Crawler

A high performance web crawler in Elixir.

Stars: ✭ 781 (+127.03%)

Mutual labels: crawler, spider, scraper

Freshonions Torscraper

Fresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion

Stars: ✭ 348 (+1.16%)

Mutual labels: crawler, spider, scraper

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+90.7%)

Mutual labels: crawler, spider, scraper

Not Your Average Web Crawler

A web crawler (for bug hunting) that gathers more than you can imagine.

Stars: ✭ 107 (-68.9%)

Mutual labels: crawler, spider, scraper

Scrapit

Scraping scripts for various websites.

Stars: ✭ 25 (-92.73%)

Mutual labels: crawler, spider, scraper

Awesome Crawler

A collection of awesome web crawler,spider in different languages

Stars: ✭ 4,793 (+1293.31%)

Mutual labels: crawler, spider, scraper

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (-50.29%)

Mutual labels: crawler, spider, scraper

arachnod

High performance crawler for Nodejs

Stars: ✭ 17 (-95.06%)

Mutual labels: crawler, scraper, spider

Weixin Spider

微信公众号爬虫，公众号历史文章，文章评论，文章阅读及在看数据，可视化web页面，可部署于Windows服务器。基于Python3之flask/mysql/redis/mitmproxy/pywin32等实现，高效微信爬虫，微信公众号爬虫，历史文章，文章评论，数据更新。

Stars: ✭ 287 (-16.57%)

Mutual labels: crawler, spider

Tianyancha

pip安装的天眼查爬虫API，指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.

Stars: ✭ 206 (-40.12%)

Mutual labels: crawler, scraper

Goose Parser

Universal scrapping tool, which allows you to extract data using multiple environments

Stars: ✭ 211 (-38.66%)

Mutual labels: crawler, scraper

Chromium for spider

dynamic crawler for web vulnerability scanner

Stars: ✭ 220 (-36.05%)

Mutual labels: crawler, spider

Webvideobot

Web crawler.

Stars: ✭ 214 (-37.79%)

Mutual labels: crawler, spider

Ruiji.net

crawler framework, distributed crawler extractor

Stars: ✭ 220 (-36.05%)

Mutual labels: crawler, scraper

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Stars: ✭ 4,077 (+1085.17%)

Mutual labels: crawler, scraper

Hacker News Digest

📰 A responsive interface of Hacker News with summaries and thumbnails.

Stars: ✭ 278 (-19.19%)

Mutual labels: crawler, spider

Fast Lianjia Crawler

直接通过链家 API 抓取数据的极速爬虫，宇宙最快~~ 🚀

Stars: ✭ 247 (-28.2%)

Mutual labels: crawler, spider

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (-19.48%)

Mutual labels: crawler, spider

Jssoup

JavaScript + BeautifulSoup = JSSoup

Stars: ✭ 203 (-40.99%)

Mutual labels: crawler, spider

ant

A web crawler for Go

Stars: ✭ 264 (-23.26%)

Mutual labels: scraper, spider

Media Scraper

Scrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok

Stars: ✭ 206 (-40.12%)

Mutual labels: crawler, scraper

Laravel Crawler Detect

A Laravel wrapper for CrawlerDetect - the web crawler detection library

Stars: ✭ 227 (-34.01%)

Mutual labels: crawler, spider

Polite

Be nice on the web

Stars: ✭ 253 (-26.45%)

Mutual labels: crawler, scraper

91porn Api

🌭💦 91porn爬虫在线无限制API接口（永久有效，口令每日更新）及在线web预览

Stars: ✭ 341 (-0.87%)

Mutual labels: crawler, spider

Toapi

Every web site provides APIs.

Stars: ✭ 3,209 (+832.85%)

Mutual labels: crawler, spider

Gospider

golang实现的爬虫框架，使用者只需关心页面规则，提供web管理界面。基于colly开发。

Stars: ✭ 285 (-17.15%)

Mutual labels: crawler, spider

Jd mask robot

京东口罩库存监控爬虫(非selenium)，扫码登录、查价、加购、下单、秒杀

Stars: ✭ 216 (-37.21%)

Mutual labels: crawler, spider

Crawlertutorial

爬蟲極簡教學（fetch, parse, search, multiprocessing, API）- PTT 為例

Stars: ✭ 282 (-18.02%)

Mutual labels: crawler, spider

Zhihuspider

多线程知乎用户爬虫，基于python3

Stars: ✭ 201 (-41.57%)

Mutual labels: crawler, spider

Ppspider

web spider built by puppeteer, support task-queue and task-scheduling by decorators，support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架，提供灵活的任务队列管理调度方案，提供便捷的数据保存方案（nedb/mongodb），提供数据可视化和用户交互的实现方案

Stars: ✭ 237 (-31.1%)

Mutual labels: crawler, spider

Skrape.it

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

Stars: ✭ 231 (-32.85%)

Mutual labels: crawler, scraper

Magic google

Google search results crawler, get google search results that you need

Stars: ✭ 247 (-28.2%)

Mutual labels: crawler, spider

Annie

👾 Fast and simple video download library and CLI tool written in Go

Stars: ✭ 16,369 (+4658.43%)

Mutual labels: crawler, scraper

Java Spider

一个基于webmagic框架二次开发的java爬虫框架实战，已实现能爬取腾讯，搜狐，今日头条（单独集成功能）等资讯内容，配合elasticsearch框架用法，实现了自动爬虫，已投入线上生产使用。

Stars: ✭ 276 (-19.77%)

Mutual labels: spider, scraper

crawler-chrome-extensions

爬虫工程师常用的 Chrome 插件 | Chrome extensions used by crawler developer

Stars: ✭ 53 (-84.59%)

Mutual labels: scraper, spider

Rcrawler

An R web crawler and scraper

Stars: ✭ 274 (-20.35%)

Mutual labels: crawler, scraper

Ok ip proxy pool

🍿爬虫代理IP池(proxy pool) python🍟一个还ok的IP代理池

Stars: ✭ 196 (-43.02%)

Mutual labels: crawler, spider

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (-84.88%)

Mutual labels: scraper, spider

robotstxt

robots.txt file parsing and checking for R

Stars: ✭ 65 (-81.1%)

Mutual labels: scraper, spider

Bt Btt

磁力網站U3C3介紹以及域名更新

Stars: ✭ 261 (-24.13%)

Mutual labels: crawler, spider

aliexscrape

Get Aliexpress product details in JSON

Stars: ✭ 80 (-76.74%)

Mutual labels: scraper, spider

TikTokDownloader PyWebIO

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具，支持API调用，在线批量解析及下载。

Stars: ✭ 919 (+167.15%)

Mutual labels: scraper, spider

OpenScraper

An open source webapp for scraping: towards a public service for webscraping

Stars: ✭ 80 (-76.74%)

Mutual labels: scraper, spider

Spydan

A web spider for shodan.io without using the Developer API.

Stars: ✭ 30 (-91.28%)

Mutual labels: scraper, spider

crawler

A simple and flexible web crawler framework for java.

Stars: ✭ 20 (-94.19%)

Mutual labels: crawler, spider

papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-95.64%)

Mutual labels: crawler, scraper

Weibo terminator workflow

Update Version of weibo_terminator, This is Workflow Version aim at Get Job Done!

Stars: ✭ 259 (-24.71%)

Mutual labels: crawler, scraper

Zhihu Login

知乎模拟登录，支持提取验证码和保存 Cookies

Stars: ✭ 340 (-1.16%)

Mutual labels: crawler, spider

scraper

图片爬取下载工具，极速爬取下载站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户上传的图片/照片/插画。