All Projects → landchina-spider → Similar Projects or Alternatives

394 Open source projects that are alternatives of or similar to landchina-spider

A bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.

Stars: ✭ 97 (+646.15%)

Mutual labels: spider

small-spider-project

日常爬虫

Stars: ✭ 14 (+7.69%)

Mutual labels: spider

一个php爬虫

Stars: ✭ 13 (+0%)

Mutual labels: spider

ZUCC ZhenFangHelper

正方教务管理系统学生版的自动登录、选课、信息获取

Stars: ✭ 36 (+176.92%)

Mutual labels: spider

这是一个用Python写的小说爬虫软件

Stars: ✭ 75 (+476.92%)

Mutual labels: spider

golang spider Crawler 爬虫电影

Stars: ✭ 168 (+1192.31%)

Mutual labels: spider

DSpiderDemo-Android

客户端爬虫安卓端demo

Stars: ✭ 43 (+230.77%)

Mutual labels: spider

🎊 Design and implement of lightweight crawler framework.

Stars: ✭ 322 (+2376.92%)

Mutual labels: spider

python 爬虫(amazon, confluence ...)

Stars: ✭ 21 (+61.54%)

Mutual labels: spider

ICP备案查询，可查询企业或域名的ICP备案信息，自动完成滑动验证，保存结果到Excel表格，适用于2022年新版的工信部备案管理系统网站，告别频繁拖动验证，以及某站*工具要开通VIP才可查看备案信息的坑

Stars: ✭ 119 (+815.38%)

Mutual labels: spider

一些脚本和工具

Stars: ✭ 20 (+53.85%)

Mutual labels: spider

Your web font utility belt. It can subset web fonts. It can find unicode-ranges for you automatically. It makes julienne fries.

Stars: ✭ 422 (+3146.15%)

Mutual labels: spider

A web spider framework

Stars: ✭ 25 (+92.31%)

Mutual labels: spider

动漫之家漫画站电脑版原图爬虫

Stars: ✭ 67 (+415.38%)

Mutual labels: spider

node-html-crawler

Simple for use node html crawler (spider) of site web pages

Stars: ✭ 30 (+130.77%)

Mutual labels: spider

bangumi yearly report

No description or website provided.

Stars: ✭ 24 (+84.62%)

Mutual labels: spider

Spider项目将会不断更新本人学习使用过的爬虫方法！！！

Stars: ✭ 16 (+23.08%)

Mutual labels: spider

爬虫管理系统，支持集群，弹性伸缩。支持运行feapder、scrapy、selenium、playwright等各种框架及脚本

Stars: ✭ 42 (+223.08%)

Mutual labels: spider

千万级图片爬虫、视频爬虫 [开源版本] Image Spider

Stars: ✭ 122 (+838.46%)

Mutual labels: spider

图片爬取下载工具，极速爬取下载站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户上传的图片/照片/插画。

Stars: ✭ 64 (+392.31%)

Mutual labels: spider

计算机基础知识，数据结构，设计模式，Tomcat中间件的实现

Stars: ✭ 19 (+46.15%)

Mutual labels: spider

crawlBaiduWenku

这可能是爬百度文库最全的项目了

Stars: ✭ 63 (+384.62%)

Mutual labels: spider

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (+300%)

Mutual labels: spider

蜘蛛纸牌 for mac

Stars: ✭ 29 (+123.08%)

Mutual labels: spider

研究学习各种拦截：反爬虫、拦截ad、防广告注入、斗黄牛等

Stars: ✭ 59 (+353.85%)

Mutual labels: spider

多线程爬取互联网行业常用招聘网站

Stars: ✭ 28 (+115.38%)

Mutual labels: spider

Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data

Stars: ✭ 83 (+538.46%)

Mutual labels: spider

妹子图全站采集10G套图资源

Stars: ✭ 80 (+515.38%)

Mutual labels: spider

利昂图书馆预约系统自动预约&签到程序。支持包括中国人民大学、北京师范大学、济南大学、哈尔滨工业大学等在内的38所高校的图书馆系统

Stars: ✭ 39 (+200%)

Mutual labels: spider

自动答题程序🎉

Stars: ✭ 37 (+184.62%)

Mutual labels: spider

Bilibili manga download

带图形界面的哔哩哔哩漫画下载工具

Stars: ✭ 52 (+300%)

Mutual labels: spider

Subbranch-China

银行、支行名称。中国各地区各银行支行名称数据爬虫，数据来源微信商户平台，已经整理可直接导入的sql文件

Stars: ✭ 31 (+138.46%)

Mutual labels: spider

Tieba-Birthday-Spider

百度贴吧生日爬虫，可抓取贴吧内吧友生日，并且在对应日期自动发送祝福

Stars: ✭ 28 (+115.38%)

Mutual labels: spider

An open source webapp for scraping: towards a public service for webscraping

Stars: ✭ 80 (+515.38%)

Mutual labels: spider

PTT Beauty Spider

PTT 表特版爬蟲圖片下載器

Stars: ✭ 47 (+261.54%)

Mutual labels: spider

新浪爬虫，基于Python+Selenium。模拟登陆后保存cookie，实现登录状态的保存。可以通过输入关键词来爬取到关键词相关的热门微博。

Stars: ✭ 25 (+92.31%)

Mutual labels: spider

利用墨墨背单词的分享功能拿每日20个的单词上限奖励（多线程

Stars: ✭ 45 (+246.15%)

Mutual labels: spider

Scrapy IPProxyPool

免费 IP 代理池。Scrapy 爬虫框架插件

Stars: ✭ 100 (+669.23%)

Mutual labels: spider

some small project and some articles

Stars: ✭ 56 (+330.77%)

Mutual labels: spider

微博话题关键词,个人微博采集, 微博博文一键删除 selenium获取cookie,requests处理

Stars: ✭ 28 (+115.38%)

Mutual labels: spider

TikTokDownloader PyWebIO

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具，支持API调用，在线批量解析及下载。

Stars: ✭ 919 (+6969.23%)

Mutual labels: spider

Get movie info from douban(豆瓣) and display in your terminal

Stars: ✭ 17 (+30.77%)

Mutual labels: spider

Golang module to detect bots and crawlers via the user agent

Stars: ✭ 22 (+69.23%)

Mutual labels: spider

A web search engine built with Python which uses TF-IDF and PageRank to sort search results.

Stars: ✭ 52 (+300%)

Mutual labels: spider

Generate an object for testing if a request is sent, request is Mikeal's request.

Stars: ✭ 42 (+223.08%)

Mutual labels: spider

🌟 powered by python3( simple learning of spider) 百度文库；网易云歌曲；豆瓣电影； GitHub；京东； QQ空间；天气； vip解析助手； TED文本内容； wifi破解脚本；必应图片设置为桌面等爬取

Stars: ✭ 124 (+853.85%)

Mutual labels: spider

blinkist-m4a-downloader

Grabs all of the audio files from all of the Blinkist books

Stars: ✭ 100 (+669.23%)

Mutual labels: spider

NScrapy is a .net core corss platform Distributed Spider Framework which provide an easy way to write your own Spider

Stars: ✭ 88 (+576.92%)

Mutual labels: spider

bet365-websocket-crawler

bet365 bot: bet365的比赛实时比分数据、实时赔率

Stars: ✭ 67 (+415.38%)

Mutual labels: spider

photo-spider-scrapy

10 photo website spiders, 10 个国外图库的 scrapy 爬虫代码

Stars: ✭ 17 (+30.77%)

Mutual labels: spider

A web crawler for Go

Stars: ✭ 264 (+1930.77%)

Mutual labels: spider

robots.txt file parsing and checking for R

Stars: ✭ 65 (+400%)

Mutual labels: spider

该项目是一个使用celery作为主体框架的爬虫应用，能够灵活的添加爬虫任务，并且同时运行多站点的爬虫工作，所有组件都能够原生支持规模并发和分布式，加上celery原生的分布式调用，实现大规模并发。

Stars: ✭ 38 (+192.31%)

Mutual labels: spider

163music spider by scrapy.

Stars: ✭ 60 (+361.54%)

Mutual labels: spider

徒手实现定时爬取知乎，从中发掘有价值的信息，并可视化爬取的数据作网页展示。

Stars: ✭ 56 (+330.77%)

Mutual labels: spider

scrapy-distributed

A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.

Stars: ✭ 38 (+192.31%)

Mutual labels: spider

Get Aliexpress product details in JSON

Stars: ✭ 80 (+515.38%)

Mutual labels: spider

ChineseStarsRelationship

中国明星数据爬取。你甚至可以拿到互联网上所有的人之间的关系，接下来你可以自己发挥！基于这些数据，你可以完成更多有趣的事情。比如说社交网络分析，关系网络可视化，算法研究，和其他有意思的事情。Chinese star data crawling. You can even get all the people on the internet! Based on these data, you can do more interesting things. For example, social network analysis, relational network visualization, algorithm research, and other interesting things.

Stars: ✭ 26 (+100%)

Mutual labels: spider

😚 Q & A website based on Spring Boot.

Stars: ✭ 46 (+253.85%)

Mutual labels: spider

🧊 一个可爱且任性的 B 站视频下载器（bilili V2）

Stars: ✭ 383 (+2846.15%)

Mutual labels: spider

1-60 of 394 similar projects