All Projects → simoncos → Zhihu Analysis Python

simoncos / Zhihu Analysis Python

Social Network Analysis of Zhihu with Python

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Zhihu Analysis Python

Data-Mining-on-Social-Media
Python scripts to extract tweets and facebook posts from public users.
Stars: ✭ 99 (-53.95%)
Mutual labels:  data-mining, social-media
Swiftdailyapi
A Swift API framework for ZhiHu's Daily News.
Stars: ✭ 204 (-5.12%)
Mutual labels:  zhihu
Supertinyicons
Under 1KB each! Super Tiny Icons are miniscule SVG versions of your favourite website and app logos
Stars: ✭ 13,177 (+6028.84%)
Mutual labels:  social-media
Mosint
An automated e-mail OSINT tool
Stars: ✭ 184 (-14.42%)
Mutual labels:  social-media
Ail Framework
AIL framework - Analysis Information Leak framework
Stars: ✭ 191 (-11.16%)
Mutual labels:  data-mining
Instascrape
Powerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (-6.05%)
Mutual labels:  data-mining
Dataaspirant codes
Complete machine learning model codes
Stars: ✭ 185 (-13.95%)
Mutual labels:  data-mining
Graphjs
A set of widgets for a meaningfully social web.
Stars: ✭ 212 (-1.4%)
Mutual labels:  social-media
Estadistica Con R
Apuntes personales sobre estadística, machine learning y lenguaje de programación R
Stars: ✭ 201 (-6.51%)
Mutual labels:  data-mining
Awesome Ensemble Learning
Ensemble learning related books, papers, videos, and toolboxes
Stars: ✭ 195 (-9.3%)
Mutual labels:  data-mining
Pyss3
A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (-11.16%)
Mutual labels:  data-mining
Awesome Startup Tools List
List of all tools (apps, services) that startups should use.
Stars: ✭ 188 (-12.56%)
Mutual labels:  social-media
Awesome Deep Learning And Machine Learning Questions
【不定期更新】收集整理的一些网站中(如知乎、Quora、Reddit、Stack Exchange等)与深度学习、机器学习、强化学习、数据科学相关的有价值的问题
Stars: ✭ 203 (-5.58%)
Mutual labels:  zhihu
Social Media Data Scripts
Stars: ✭ 188 (-12.56%)
Mutual labels:  social-media
Smartproxy
HTTP(S) Rotating Residential proxies - Code examples & General information
Stars: ✭ 205 (-4.65%)
Mutual labels:  data-mining
Griffith
A React-based web video player
Stars: ✭ 2,287 (+963.72%)
Mutual labels:  zhihu
Free Gophers Pack
✨ This pack of 100+ gopher pictures and elements will help you to build own design of almost anything related to Go Programming Language: presentations, posts in blogs or social media, courses, videos and many, many more.
Stars: ✭ 2,343 (+989.77%)
Mutual labels:  social-media
Zhihuspider
多线程知乎用户爬虫,基于python3
Stars: ✭ 201 (-6.51%)
Mutual labels:  zhihu
Vue Zhihudaily
🗞 知乎日报 Web 基于 Vue 2.3
Stars: ✭ 213 (-0.93%)
Mutual labels:  zhihu
Qminer
Analytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (-4.19%)
Mutual labels:  data-mining

知乎社交网络分析

简介

项目包含基于zhihu-python的多线程爬虫,数据I/O(SQLite,csv),以及基于用户关注网络的分析(使用networkx作为图算法库)。

注:本项目所使用的zhihu-python已与原版存在差异

详细内容

文件说明

  • crawler文件夹:爬虫部分,以广度优先策略爬取知乎数据,并以csv格式储存(这一部分代码目前版本有误,爬到的数据文件与zhihu_database.py无法衔接,此外存在topic爬漏的问题,待修复)
  • zhihu_schema.sql:SQLite数据库的schema
  • zhihu_database.py:将csv中的数据导入至数据库中
  • zhihu_analysis.py:从数据库中提取数据并进行分析

爬虫部分已知问题及(可能)原因

爬虫部分已年久失修,由于这个project的重点不在于爬虫,所以不打算更新了,还请谨慎入坑:)

  • zhihu-python InsecureRequestWarning | urlib
  • topic.py 会爬漏话题标签 | 原因未知

未来计划

之后考虑利用已有数据集再做一些分析,比如用户聚类、用户-话题-问题网络之类。

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].