simoncos / Zhihu Analysis Python
Social Network Analysis of Zhihu with Python
Stars: ✭ 215
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Zhihu Analysis Python
Data-Mining-on-Social-Media
Python scripts to extract tweets and facebook posts from public users.
Stars: ✭ 99 (-53.95%)
Mutual labels: data-mining, social-media
Swiftdailyapi
A Swift API framework for ZhiHu's Daily News.
Stars: ✭ 204 (-5.12%)
Mutual labels: zhihu
Supertinyicons
Under 1KB each! Super Tiny Icons are miniscule SVG versions of your favourite website and app logos
Stars: ✭ 13,177 (+6028.84%)
Mutual labels: social-media
Ail Framework
AIL framework - Analysis Information Leak framework
Stars: ✭ 191 (-11.16%)
Mutual labels: data-mining
Instascrape
Powerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (-6.05%)
Mutual labels: data-mining
Dataaspirant codes
Complete machine learning model codes
Stars: ✭ 185 (-13.95%)
Mutual labels: data-mining
Graphjs
A set of widgets for a meaningfully social web.
Stars: ✭ 212 (-1.4%)
Mutual labels: social-media
Estadistica Con R
Apuntes personales sobre estadística, machine learning y lenguaje de programación R
Stars: ✭ 201 (-6.51%)
Mutual labels: data-mining
Awesome Ensemble Learning
Ensemble learning related books, papers, videos, and toolboxes
Stars: ✭ 195 (-9.3%)
Mutual labels: data-mining
Pyss3
A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (-11.16%)
Mutual labels: data-mining
Awesome Startup Tools List
List of all tools (apps, services) that startups should use.
Stars: ✭ 188 (-12.56%)
Mutual labels: social-media
Awesome Deep Learning And Machine Learning Questions
【不定期更新】收集整理的一些网站中(如知乎、Quora、Reddit、Stack Exchange等)与深度学习、机器学习、强化学习、数据科学相关的有价值的问题
Stars: ✭ 203 (-5.58%)
Mutual labels: zhihu
Smartproxy
HTTP(S) Rotating Residential proxies - Code examples & General information
Stars: ✭ 205 (-4.65%)
Mutual labels: data-mining
Free Gophers Pack
✨ This pack of 100+ gopher pictures and elements will help you to build own design of almost anything related to Go Programming Language: presentations, posts in blogs or social media, courses, videos and many, many more.
Stars: ✭ 2,343 (+989.77%)
Mutual labels: social-media
Qminer
Analytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (-4.19%)
Mutual labels: data-mining
知乎社交网络分析
简介
项目包含基于zhihu-python的多线程爬虫,数据I/O(SQLite
,csv
),以及基于用户关注网络的分析(使用networkx作为图算法库)。
注:本项目所使用的zhihu-python已与原版存在差异
详细内容
- Dataset
- 中文
- English
文件说明
-
crawler
文件夹:爬虫部分,以广度优先策略爬取知乎数据,并以csv格式储存(这一部分代码目前版本有误,爬到的数据文件与zhihu_database.py
无法衔接,此外存在topic爬漏的问题,待修复) -
zhihu_schema.sql
:SQLite数据库的schema -
zhihu_database.py
:将csv中的数据导入至数据库中 -
zhihu_analysis.py
:从数据库中提取数据并进行分析
爬虫部分已知问题及(可能)原因
爬虫部分已年久失修,由于这个project的重点不在于爬虫,所以不打算更新了,还请谨慎入坑:)
- zhihu-python InsecureRequestWarning | urlib
- topic.py 会爬漏话题标签 | 原因未知
未来计划
之后考虑利用已有数据集再做一些分析,比如用户聚类、用户-话题-问题网络之类。
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].