All Projects → Henryhaohao → Xiecheng_Comment

Henryhaohao / Xiecheng_Comment

Licence: other
🌈Xiecheng_Comment多线程Threading爬取携程的丽江古城景点评论并生成词云

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Xiecheng Comment

comment-box.nvim
✨ Clarify and beautify your comments using boxes and lines.
Stars: ✭ 91 (+295.65%)
Mutual labels:  comment
comment
Online Generator for Kary Foundation Comment System (KFCS) http://comment.kary.us
Stars: ✭ 20 (-13.04%)
Mutual labels:  comment
ObviousAwait
🧵 Expressive aliases to ConfigureAwait(true) and ConfigureAwait(false)
Stars: ✭ 55 (+139.13%)
Mutual labels:  threading
libquo
Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (-8.7%)
Mutual labels:  threading
ThreadBoat
Program Uses Thread Execution Hijacking To Inject Native Shell-code Into a Standard Win32 Application
Stars: ✭ 162 (+604.35%)
Mutual labels:  threading
YouTube.js
🎥 full-featured wrapper around YouTube's private API — reverse engineering InnerTube
Stars: ✭ 2,232 (+9604.35%)
Mutual labels:  comment
wordclouds
Wordclouds in go
Stars: ✭ 41 (+78.26%)
Mutual labels:  wordcloud
android-actors-library
Android Actors Library was inspired by the Actor model. The main purpose of this library is to help developers in creating a worker attached to a thread and make all interactions with this worker natural and simple.
Stars: ✭ 24 (+4.35%)
Mutual labels:  threading
think-async
🌿 Exploring cooperative concurrency primitives in Python
Stars: ✭ 178 (+673.91%)
Mutual labels:  threading
soan
Social Analysis based on Whatsapp data
Stars: ✭ 106 (+360.87%)
Mutual labels:  wordcloud
People-Counting-in-Real-Time
People Counting in Real-Time with an IP camera.
Stars: ✭ 233 (+913.04%)
Mutual labels:  threading
MoviesApp
A Movie Application uses TheMovie API, MVVM architecture and all jetpack components.
Stars: ✭ 100 (+334.78%)
Mutual labels:  threading
DanDanPlay-PHP
DanDanPlay远程访问 的 PHP 实现版本
Stars: ✭ 30 (+30.43%)
Mutual labels:  comment
action-create-comment
💬 GitHub Action to create a comment
Stars: ✭ 30 (+30.43%)
Mutual labels:  comment
SwiftConcurrentCollections
Swift Concurrent Collections
Stars: ✭ 40 (+73.91%)
Mutual labels:  threading
InstaBot
Simple and friendly Bot for Instagram, using Selenium and Scrapy with Python.
Stars: ✭ 32 (+39.13%)
Mutual labels:  comment
Perfect-Thread
Core threading library for Perfect Server Side Swift. Includes support for serial and concurrent thread queues, locks, read/write locks and events.
Stars: ✭ 17 (-26.09%)
Mutual labels:  threading
label-actions
🤖 GitHub Action that performs certain tasks when issues, pull requests or discussions are labeled or unlabeled
Stars: ✭ 60 (+160.87%)
Mutual labels:  comment
JessicaLim8
Profile ReadME! Join my community word cloud!
Stars: ✭ 81 (+252.17%)
Mutual labels:  wordcloud
concurrent-resource
A header-only C++ library that allows easily creating thread-safe, concurrency friendly resources.
Stars: ✭ 17 (-26.09%)
Mutual labels:  threading

多线程爬取携程的丽江古城景点评论并生成词云 enter image description here

携程官网 - http://you.ctrip.com/

Author 😎Henryhaohao😎
Email ♥️[email protected]♥️

🐬声明

软件均仅用于学习交流,请勿用于任何商业用途!感谢大家!

🐬介绍

该项目为多线程Threading爬取携程的丽江古城景点评论并生成词云

  • 项目介绍:通过爬取携程网中关于丽江古城景点的点评(共计3010条),并对这些评论进行中文分词(Jieba库)及预处理,最终生成一张关于“丽江古城”的词云图(Wordcloud词云库)
  • 爬虫文件:Spiders目录下的xiecheng_comment.py
  • 生成词云图文件:运行Spiders目录下的comment_wordcloud.py
  • 数据库配置文件:运行前修改Spiders目录下的config.py,其中为mongodb的相关配置

🐬运行环境

Version: Python3

🐬安装依赖库

pip3 install -r requirements.txt

🐬运行截图

  • 词云图

    enter image description here
  • 携程景点评论

    enter image description here
  • 爬取评论(3010条耗时1秒)

    enter image description here
  • 评论内容

    enter image description here
  • 评论数

    enter image description here

🐬总结

最后,如果你觉得这个项目不错或者对你有帮助,给个Star呗,也算是对我学习路上的一种鼓励!
哈哈哈,感谢大家!笔芯~
💘💘

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].