All Projects → fetchurls → Similar Projects or Alternatives

436 Open source projects that are alternatives of or similar to fetchurls

wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-46.39%)
Mutual labels:  spider, wget, crawl
gathertool
gathertool是golang脚本化开发库,目的是提高对应场景程序开发的效率;轻量级爬虫库,接口测试&压力测试库,DB操作库等。
Stars: ✭ 36 (-62.89%)
Mutual labels:  spider, crawl
Infospider
INFO-SPIDER 是一个集众多数据源于一身的爬虫工具箱🧰,旨在安全快捷的帮助用户拿回自己的数据,工具代码开源,流程透明。支持数据源包括GitHub、QQ邮箱、网易邮箱、阿里邮箱、新浪邮箱、Hotmail邮箱、Outlook邮箱、京东、淘宝、支付宝、中国移动、中国联通、中国电信、知乎、哔哩哔哩、网易云音乐、QQ好友、QQ群、生成朋友圈相册、浏览器浏览历史、12306、博客园、CSDN博客、开源中国博客、简书。
Stars: ✭ 5,984 (+6069.07%)
Mutual labels:  spider, crawl
gospider
⚡ Light weight Golang spider framework | 轻量的 Golang 爬虫框架
Stars: ✭ 183 (+88.66%)
Mutual labels:  spider, crawl
Pspider
一个简单的分布式爬虫框架
Stars: ✭ 102 (+5.15%)
Mutual labels:  spider, crawl
Bitextor
Bitextor generates translation memories from multilingual websites.
Stars: ✭ 168 (+73.2%)
Mutual labels:  wget, crawl
Fbcrawl
A Facebook crawler
Stars: ✭ 536 (+452.58%)
Mutual labels:  spider, crawl
Proxy pool
Python爬虫代理IP池(proxy pool)
Stars: ✭ 13,964 (+14295.88%)
Mutual labels:  spider, crawl
Scrapy IPProxyPool
免费 IP 代理池。Scrapy 爬虫框架插件
Stars: ✭ 100 (+3.09%)
Mutual labels:  spider, crawl
Grab Site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+601.03%)
Mutual labels:  spider, crawl
Geetest
geetest,滑动验证码
Stars: ✭ 293 (+202.06%)
Mutual labels:  spider, crawl
Nodespider
[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-65.98%)
Mutual labels:  spider, crawl
crawler-chrome-extensions
爬虫工程师常用的 Chrome 插件 | Chrome extensions used by crawler developer
Stars: ✭ 53 (-45.36%)
Mutual labels:  spider, crawl
Novel Plus
小说精品屋-plus是一个多端(PC、WAP)阅读、功能完善的原创文学CMS系统,由前台门户系统、作家后台管理系统、平台后台管理系统、爬虫管理系统等多个子系统构成,支持多模版、会员充值、订阅模式、新闻发布和实时统计报表等功能,新书自动入库,老书自动更新。
Stars: ✭ 1,122 (+1056.7%)
Mutual labels:  spider, crawl
Geetest
滑动验证码,希望对你们有所帮助❤️
Stars: ✭ 114 (+17.53%)
Mutual labels:  spider, crawl
Zhihu Login
知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (+250.52%)
Mutual labels:  spider, crawl
Python3 Spider
Python爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (+2094.85%)
Mutual labels:  spider, crawl
Crack Js Spider
破解JS反爬虫加密参数,已破解中国裁判文书网(2020-06-30更新),淘宝密码,天安保险登录,b站登录,房天下登录,WPS登录,微博登录,有道翻译,网易登录,微信公众号登录,空中网登录,今目标登录,学生信息管理系统登录,共赢金融登录,重庆科技资源共享平台登录,网易云音乐下载,一键解析视频链接,财联社登录。
Stars: ✭ 175 (+80.41%)
Mutual labels:  spider, crawl
crawl
Lightweight library for scalable crawlers in Go.
Stars: ✭ 20 (-79.38%)
Mutual labels:  crawl
bet365-websocket-crawler
bet365 bot: bet365的比赛实时比分数据、实时赔率
Stars: ✭ 67 (-30.93%)
Mutual labels:  spider
ZSpider
基于Electron爬虫程序
Stars: ✭ 37 (-61.86%)
Mutual labels:  spider
Instagram Stalker Scraper
(UNMAINTAINED) Fetch data of any public Instagram profile, without using api
Stars: ✭ 39 (-59.79%)
Mutual labels:  wget
small-spider-project
日常爬虫
Stars: ✭ 14 (-85.57%)
Mutual labels:  spider
diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-45.36%)
Mutual labels:  crawl
weixin article spiders
A spiders' program for weixin which made by Express & cheerio
Stars: ✭ 33 (-65.98%)
Mutual labels:  spider
github-endpoints
Find endpoints on GitHub.
Stars: ✭ 118 (+21.65%)
Mutual labels:  urls
crawlBaiduWenku
这可能是爬百度文库最全的项目了
Stars: ✭ 63 (-35.05%)
Mutual labels:  spider
ant
A web crawler for Go
Stars: ✭ 264 (+172.16%)
Mutual labels:  spider
Web-Iota
Iota is a web scraper which can find all of the images and links/suburls on a webpage
Stars: ✭ 60 (-38.14%)
Mutual labels:  spider
article-spider
文章采集工具 Article collection tool
Stars: ✭ 130 (+34.02%)
Mutual labels:  spider
TikTokDownloader PyWebIO
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+847.42%)
Mutual labels:  spider
TaobaoSpider
This taobao spider has been archived
Stars: ✭ 28 (-71.13%)
Mutual labels:  spider
Novel-crawler
这是一个用Python写的小说爬虫软件
Stars: ✭ 75 (-22.68%)
Mutual labels:  spider
grapy
Grapy, a fast high-level web crawling framework for Python 3.3 or later base on asyncio.
Stars: ✭ 18 (-81.44%)
Mutual labels:  spider
MoMo
利用墨墨背单词的分享功能拿每日20个的单词上限奖励(多线程
Stars: ✭ 45 (-53.61%)
Mutual labels:  spider
scrapy helper
Dynamic configurable crawl (动态可配置化爬虫)
Stars: ✭ 84 (-13.4%)
Mutual labels:  spider
main project
基于nodejs的网络聊天室、爬虫,vue音乐播放器,及php后台开发的管理系统等项目
Stars: ✭ 49 (-49.48%)
Mutual labels:  spider
crawlerdetect
Golang module to detect bots and crawlers via the user agent
Stars: ✭ 22 (-77.32%)
Mutual labels:  spider
GitHub-Trending-Crawler
Crawling GitHub Trending Pages every day
Stars: ✭ 55 (-43.3%)
Mutual labels:  spider
url-regex-safe
Regular expression matching for URL's. Maintained, safe, and browser-friendly version of url-regex. Resolves CVE-2020-7661 for Node.js servers.
Stars: ✭ 59 (-39.18%)
Mutual labels:  urls
bangumi yearly report
No description or website provided.
Stars: ✭ 24 (-75.26%)
Mutual labels:  spider
bilibili-smallvideo
🕷️用于爬取B站前top100的小视频
Stars: ✭ 133 (+37.11%)
Mutual labels:  spider
DSpiderDemo-Android
客户端爬虫安卓端demo
Stars: ✭ 43 (-55.67%)
Mutual labels:  spider
scraper
图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
Stars: ✭ 64 (-34.02%)
Mutual labels:  spider
SpiderCard
蜘蛛纸牌 for mac
Stars: ✭ 29 (-70.1%)
Mutual labels:  spider
spider
裁判文书网爬虫
Stars: ✭ 19 (-80.41%)
Mutual labels:  spider
HTML-DEV-ToolLink
HTML Development Tool Link-常用的在线字符串编解码、代码压缩、美化、JSON格式化、正则表达式、时间转换工具、二维码生成与解码等工具,支持在线搜索和Chrome插件。
Stars: ✭ 44 (-54.64%)
Mutual labels:  urls
DeadPool
该项目是一个使用celery作为主体框架的爬虫应用,能够灵活的添加爬虫任务,并且同时运行多站点的爬虫工作,所有组件都能够原生支持规模并发和分布式,加上celery原生的分布式调用,实现大规模并发。
Stars: ✭ 38 (-60.82%)
Mutual labels:  spider
Spider
资讯爬虫App
Stars: ✭ 24 (-75.26%)
Mutual labels:  spider
ben-ben-spider
犇犇爬虫
Stars: ✭ 36 (-62.89%)
Mutual labels:  spider
seenreq
Generate an object for testing if a request is sent, request is Mikeal's request.
Stars: ✭ 42 (-56.7%)
Mutual labels:  spider
glyphhanger
Your web font utility belt. It can subset web fonts. It can find unicode-ranges for you automatically. It makes julienne fries.
Stars: ✭ 422 (+335.05%)
Mutual labels:  spider
BaiduSpider
项目已经移动至:https://github.com/BaiduSpider/BaiduSpider !! 一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 29 (-70.1%)
Mutual labels:  spider
young-crawler
scala结合actor编写的分布式网络爬虫
Stars: ✭ 15 (-84.54%)
Mutual labels:  spider
sede
Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data
Stars: ✭ 83 (-14.43%)
Mutual labels:  spider
simpyder
超高速异步协程Python爬虫
Stars: ✭ 74 (-23.71%)
Mutual labels:  spider
imdb-spider
scrapy spider for scraping imdb {movie_id: [recommended, ...]}
Stars: ✭ 23 (-76.29%)
Mutual labels:  spider
goSpider
some small project and some articles
Stars: ✭ 56 (-42.27%)
Mutual labels:  spider
spider
python 爬虫(amazon, confluence ...)
Stars: ✭ 21 (-78.35%)
Mutual labels:  spider
spider-mzitu
妹子图
Stars: ✭ 13 (-86.6%)
Mutual labels:  spider
1-60 of 436 similar projects