All Projects → luolongfei → qzone-spider

luolongfei / qzone-spider

Licence: other
QQ 空间爬虫,基于 selenium 模拟登录空间,破解滑动验证码,拿到 cookies,然后使用 requests 抓取好友留言板的所有留言与回复,并生成词图。只抓了留言,本来还想抓说说,不过因为我已经好多年不玩 QQ 空间,感觉它对我已经没什么意义了,遂作罢。

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to qzone-spider

whatsapp-web
Simon is a Python library that helps made easy the browser automation for WhatsApp Web service
Stars: ✭ 67 (+191.3%)
Mutual labels:  selenium
giulius-selenium-tests
A test harness that allows Selenium tests to be run using JUnit and test fixtures to be created and injected by a WebDriver-aware Guice
Stars: ✭ 12 (-47.83%)
Mutual labels:  selenium
vaccipy
Automatische Impfterminbuchung für www.impfterminservice.de
Stars: ✭ 548 (+2282.61%)
Mutual labels:  selenium
tithiwa
Automate Web WhatsApp with selenium in python.
Stars: ✭ 17 (-26.09%)
Mutual labels:  selenium
facebook-cleaner
It is almost spring, so time for a pre spring cleaning. This time: taking care of your Facebook. This script can safe you a lot of time if you would try to do that by hand.
Stars: ✭ 52 (+126.09%)
Mutual labels:  selenium
dnevnik mos ru
Python module to get easy machine access to dnevnik.mos.ru
Stars: ✭ 21 (-8.7%)
Mutual labels:  selenium
linkedinBot
Automate the process of sending referral request and cold mailing on LinkedIn
Stars: ✭ 25 (+8.7%)
Mutual labels:  selenium
python-crawler
爬虫学习仓库,适合零基础的人学习,对新手比较友好
Stars: ✭ 37 (+60.87%)
Mutual labels:  selenium
Easytax
A simple automation script that logs into your kra account and files your taxes with one command
Stars: ✭ 13 (-43.48%)
Mutual labels:  selenium
google-image-downloader
A script to download images from images.google.com
Stars: ✭ 28 (+21.74%)
Mutual labels:  selenium
SeleniumTDD
A Selenium TDD framework that incorporates key features of Selenium and TestNG which can be used to create web-based automation scripts.
Stars: ✭ 23 (+0%)
Mutual labels:  selenium
zhihu-crawler
徒手实现定时爬取知乎,从中发掘有价值的信息,并可视化爬取的数据作网页展示。
Stars: ✭ 56 (+143.48%)
Mutual labels:  selenium
automation-for-humans
Converts English statements to automation.
Stars: ✭ 67 (+191.3%)
Mutual labels:  selenium
spydriver
🕵️ Lightweight utility to intercept WebDriver and WebElement method calls.
Stars: ✭ 24 (+4.35%)
Mutual labels:  selenium
throughout
🎪 End-to-end testing made simple (using Jest and Puppeteer)
Stars: ✭ 16 (-30.43%)
Mutual labels:  selenium
Whatsapp-Bot
Web.whatsapp.com bot made with selenium
Stars: ✭ 39 (+69.57%)
Mutual labels:  selenium
resgen
Keep track of jobs you've applied to, automate resume & cover letter creation; generate PDFs from .odt templates on the fly while scraping the job post and tracking employer status.
Stars: ✭ 31 (+34.78%)
Mutual labels:  selenium
selenium-java
This is the sample repository that we use in the Complete Selenium WebDriver with Java Bootcamp
Stars: ✭ 45 (+95.65%)
Mutual labels:  selenium
devtools-proxy
Multiplexing proxy for Chrome DevTools. Fully compatible with Selenium and ChromeDriver
Stars: ✭ 64 (+178.26%)
Mutual labels:  selenium
TikTok
Download public videos on TikTok using Python with Selenium
Stars: ✭ 37 (+60.87%)
Mutual labels:  selenium

闲话

QQ 空间爬虫,基于 selenium 模拟登录空间,拿到 cookies,然后使用 requests 抓取好友留言板的所有留言与回复,并生成词图。只抓了留言,本来还想抓说说,不过因为我已经好多年不玩 QQ 空间,感觉它对我已经没什么意义了,遂作罢。

演示

这是我初中老号的留言板词云图,这个号已经弃用很多年了,里面黑历史满满,回忆满满,时间过得真他妈快。

我初中老号的留言板词云图

使用

获取源码

$ git clone https://github.com/luolongfei/qzone_spider.git qzone_spider/
$ cd qzone_spider/

安装依赖包

$ pip install -r requirements.txt

配置

# 复制配置
$ cp .env.example .env

# 根据 .env 文件中的注释,将其中对应的项目改为你自己的
$ vim .env

抓取

$ python qzone_spider.py

注意:chromedriver 路径需要根据你本地浏览器的实际情况填写,这个玩意儿必须和你浏览器版本相对应,否则启动不了。你可以去 https://chromedriver.chromium.org/downloads 下载对应的 chromedriver 版本,并在 .env 文件中指明 chromedriver 执行文件的路径。

参考

开源协议

MIT

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].