A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+637.08%)

Mutual labels: spider, web-crawler

Examples Of Web Crawlers

一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )

Stars: ✭ 10,724 (+11949.44%)

Mutual labels: spider, selenium

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (+211.24%)

Mutual labels: spider, web-crawler

zhihu-crawler

徒手实现定时爬取知乎，从中发掘有价值的信息，并可视化爬取的数据作网页展示。

Stars: ✭ 56 (-37.08%)

Mutual labels: spider, selenium

Pulsar

Turn large Web sites into tables and charts using simple SQLs.

Stars: ✭ 100 (+12.36%)

Mutual labels: web-crawler, selenium

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+9329.21%)

Mutual labels: spider, web-crawler

Spider

Spider项目将会不断更新本人学习使用过的爬虫方法！！！

Stars: ✭ 16 (-82.02%)

Mutual labels: spider, selenium

Spider Flow

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

Stars: ✭ 365 (+310.11%)

Mutual labels: spider, web-crawler

Python Spider

豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章

Stars: ✭ 615 (+591.01%)

Mutual labels: spider, selenium

Abot

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (+2103.37%)

Mutual labels: spider, web-crawler

ant

A web crawler for Go

Stars: ✭ 264 (+196.63%)

Mutual labels: spider, web-crawler

Netdiscovery

NetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。

Stars: ✭ 573 (+543.82%)

Mutual labels: spider, selenium

Infospider

INFO-SPIDER 是一个集众多数据源于一身的爬虫工具箱🧰，旨在安全快捷的帮助用户拿回自己的数据，工具代码开源，流程透明。支持数据源包括GitHub、QQ邮箱、网易邮箱、阿里邮箱、新浪邮箱、Hotmail邮箱、Outlook邮箱、京东、淘宝、支付宝、中国移动、中国联通、中国电信、知乎、哔哩哔哩、网易云音乐、QQ好友、QQ群、生成朋友圈相册、浏览器浏览历史、12306、博客园、CSDN博客、开源中国博客、简书。

Stars: ✭ 5,984 (+6623.6%)

Mutual labels: spider, selenium

Alipayspider Scrapy

AlipaySpider on Scrapy(use chrome driver); 支付宝爬虫(基于Scrapy)

Stars: ✭ 70 (-21.35%)

Mutual labels: spider, selenium

Maman

Rust Web Crawler saving pages on Redis

Stars: ✭ 39 (-56.18%)

Mutual labels: spider, web-crawler

Zhihu Crawler People

A simple distributed crawler for zhihu && data analysis

Stars: ✭ 182 (+104.49%)

Mutual labels: spider, web-crawler

Crawlab Lite

Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台

Stars: ✭ 122 (+37.08%)

Mutual labels: spider, web-crawler

weibo topic

微博话题关键词,个人微博采集, 微博博文一键删除 selenium获取cookie,requests处理

Stars: ✭ 28 (-68.54%)

Mutual labels: spider, selenium

OpenYspider

千万级图片爬虫、视频爬虫 [开源版本] Image Spider

Stars: ✭ 122 (+37.08%)

Mutual labels: spider, selenium

Z-Spider

一些爬虫开发的技巧和案例

Stars: ✭ 33 (-62.92%)

Mutual labels: spider

RomanceBreaker

Python script which sends a custom morning message to your significant other every morning at a given time range on Facebook Messenger, WhatsApp, Telegram or SMS, for lazy people

Stars: ✭ 36 (-59.55%)

Mutual labels: selenium

douyin-api

抖音接口、抖音API、抖音数据爬虫、抖音直播数据、抖音直播Api、抖音视频Api、抖音爬虫、抖音去水印、抖音视频下载、抖音视频解析、抖音直播监控、抖音数据采集

Stars: ✭ 41 (-53.93%)

Mutual labels: spider

gcf-packs

Library packs for google cloud functions

Stars: ✭ 48 (-46.07%)

Mutual labels: selenium

geetest test

极验滑动验证码研究报告

Stars: ✭ 66 (-25.84%)

Mutual labels: selenium

FofaMap

FofaMap是一款基于Python3开发的跨平台FOFA数据采集器，支持网站图标查询、批量查询和自定义查询FOFA数据，能够根据查询结果自动去重并生成对应的Excel表格。另外春节特别版还可以调用Nuclei对目标进行漏洞扫描，让你在挖洞路上快人一步。

Stars: ✭ 118 (+32.58%)

Mutual labels: spider

impf-bot

💉🤖 Bot for the German "ImpfterminService - 116117"

Stars: ✭ 167 (+87.64%)

Mutual labels: selenium

carina-demo

Carina demo project.

Stars: ✭ 40 (-55.06%)

Mutual labels: selenium

nightwatch-boilerplate

boilerplate for nightwatch.js with selenium

Stars: ✭ 16 (-82.02%)

Mutual labels: selenium

image-crawler

An image scraper that scraps images from unsplash.com

Stars: ✭ 12 (-86.52%)

Mutual labels: selenium

Scrapy-Spiders

一个基于Scrapy的数据采集爬虫代码库

Stars: ✭ 34 (-61.8%)

Mutual labels: spider

XMQ-BackUp

小密圈备份，圈子/话题/图片/文件。

Stars: ✭ 22 (-75.28%)

Mutual labels: selenium

web-data-extractor

Extracting and parsing structured data with jQuery Selector, XPath or JsonPath from common web format like HTML, XML and JSON.

Stars: ✭ 52 (-41.57%)

Mutual labels: spider

kick-off-web-scraping-python-selenium-beautifulsoup

A tutorial-based introduction to web scraping with Python.

Stars: ✭ 18 (-79.78%)

Mutual labels: selenium

Instagram-Comments-Scraper

Instagram comment scraper using python and selenium. Save the comments into excel.

Stars: ✭ 73 (-17.98%)

Mutual labels: selenium

nivinEdu

拟物校园，一个开源的高校教务移动化解决方案。

Stars: ✭ 24 (-73.03%)

Mutual labels: spider

documentDownloader

download document from book118 for free

Stars: ✭ 72 (-19.1%)

Mutual labels: spider

hupu Album Downloader

虎扑网相册下载工具

Stars: ✭ 17 (-80.9%)

Mutual labels: spider

PyWhatsapp

Python script to control whatsapp web using terminal

Stars: ✭ 20 (-77.53%)

Mutual labels: selenium

scrapy-admin

A django admin site for scrapy

Stars: ✭ 44 (-50.56%)

Mutual labels: spider

test login

问卷星

Stars: ✭ 53 (-40.45%)

Mutual labels: selenium

rb-spider

基于 RabbitMQ 中间件的爬虫的 Ruby 实现 [Developing]

Stars: ✭ 13 (-85.39%)

Mutual labels: spider

instagram-post-scheduler

Python Program To Schedule Your Instagram Posts

Stars: ✭ 30 (-66.29%)

Mutual labels: selenium

L-Spider

A DHT Spider allows you to sniff the torrents and magnets.You can download them directly.

Stars: ✭ 64 (-28.09%)

Mutual labels: spider

zucc xk ZhengFang

ZUCC正方教务系统抢课助手。针对ZUCC正方教务系统模拟登录，爬取课程信息，自动抓包发包抢课。具体实现流程可参考README中的实现原理链接

Stars: ✭ 40 (-55.06%)

Mutual labels: spider

Selenium.WebDriver.Extensions

Extensions for Selenium WebDriver including jQuery/Sizzle selector support.

Stars: ✭ 46 (-48.31%)

Mutual labels: selenium

headless-chrome

Implementation of the new headless chrome with chromedriver and selenium.

Stars: ✭ 34 (-61.8%)

Mutual labels: selenium

aliexpress

An AliExpress spider for Node

Stars: ✭ 39 (-56.18%)

Mutual labels: spider

primefaces-selenium

PrimeFaces testing support for Selenium

Stars: ✭ 16 (-82.02%)

Mutual labels: selenium

hcaptcha-solver-python-selenium

hCaptcha solver and bypasser for Python Selenium. Simple website to try to solve hCaptcha.

Stars: ✭ 32 (-64.04%)

Mutual labels: selenium

arquillian-graphene

Robust Functional Tests leveraging WebDriver with flavour of neat AJAX-ready API

Stars: ✭ 91 (+2.25%)

Mutual labels: selenium

SJS DROPS

Script using requests module to register accounts to Slam Jam Socialism raffles.

Stars: ✭ 21 (-76.4%)

Mutual labels: selenium

Ucampus

解放双手，u校园的题再也不用写啦(暂停维护

Stars: ✭ 28 (-68.54%)

Mutual labels: selenium

1-60 of 818 similar projects

›

next*5