All Projects → xfgryujk → TaobaoAnalysis

xfgryujk / TaobaoAnalysis

Licence: MIT License
练习NLP,分析淘宝评论的项目

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to TaobaoAnalysis

Decryptlogin
APIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+6546.43%)
Mutual labels:  crawler, taobao
Examples Of Web Crawlers
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
Stars: ✭ 10,724 (+38200%)
Mutual labels:  crawler, taobao
Lxspider
爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》
Stars: ✭ 60 (+114.29%)
Mutual labels:  crawler, taobao
Python3 Spider
Python爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (+7503.57%)
Mutual labels:  crawler, taobao
xianyu
一个解锁闲鱼搜索框的Chrome插件!
Stars: ✭ 49 (+75%)
Mutual labels:  taobao
Fast Lianjia Crawler
直接通过链家 API 抓取数据的极速爬虫,宇宙最快~~ 🚀
Stars: ✭ 247 (+782.14%)
Mutual labels:  crawler
Ppspider
web spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
Stars: ✭ 237 (+746.43%)
Mutual labels:  crawler
Ecommercecrawlers
码云仓库链接:AJay13/ECommerceCrawlers Github 仓库链接:DropsDevopsOrg/ECommerceCrawlers 项目展示平台链接:http://wechat.doonsec.com
Stars: ✭ 3,073 (+10875%)
Mutual labels:  crawler
Python3Webcrawler
🌈Python3网络爬虫实战:QQ音乐歌曲、京东商品信息、房天下、破解有道翻译、构建代理池、豆瓣读书、百度图片、破解网易登录、B站模拟扫码登录、小鹅通、荔枝微课
Stars: ✭ 208 (+642.86%)
Mutual labels:  crawler
ToolsCollection
No description or website provided.
Stars: ✭ 20 (-28.57%)
Mutual labels:  taobao
DeadPool
该项目是一个使用celery作为主体框架的爬虫应用,能够灵活的添加爬虫任务,并且同时运行多站点的爬虫工作,所有组件都能够原生支持规模并发和分布式,加上celery原生的分布式调用,实现大规模并发。
Stars: ✭ 38 (+35.71%)
Mutual labels:  taobao
Weibopicdownloader
免登录下载微博图片 爬虫 Download Weibo Images without Logging-in
Stars: ✭ 247 (+782.14%)
Mutual labels:  crawler
CoolFrame
iOS搭建高可用APP框架,实现快速开发 。
Stars: ✭ 38 (+35.71%)
Mutual labels:  taobao
Strong Web Crawler
基于C#.NET+PhantomJS+Sellenium的高级网络爬虫程序。可执行Javascript代码、触发各类事件、操纵页面Dom结构。
Stars: ✭ 238 (+750%)
Mutual labels:  crawler
opentaobao-go
🎉淘宝Api、淘宝开放平台Api请求基础SDK
Stars: ✭ 12 (-57.14%)
Mutual labels:  taobao
Skrape.it
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (+725%)
Mutual labels:  crawler
three-platformize
一个让 THREE 平台化的项目,目前已适配微信,淘宝,头条小程序,微信小游戏
Stars: ✭ 418 (+1392.86%)
Mutual labels:  taobao
taobaoke
淘宝客小程序源码
Stars: ✭ 17 (-39.29%)
Mutual labels:  taobao
Pyspider
A Powerful Spider(Web Crawler) System in Python.
Stars: ✭ 15,241 (+54332.14%)
Mutual labels:  crawler
Polite
Be nice on the web
Stars: ✭ 253 (+803.57%)
Mutual labels:  crawler

TaobaoAnalysis

练习NLP,分析淘宝评论的项目

项目结构

为了方便,约定本项目中所有程序的当前目录都是项目根目录

.
├─analyze             分析用的主要程序
│  ├─dataprocess      准备训练数据
│  └─models           机器学习模型
├─crawler             爬虫
│  └─Taobao           Scrapy爬虫项目
│     └─spiders       Scrapy爬虫
├─data                所有程序数据
│  ├─crawler          爬虫数据,如要爬的商品ID
│  ├─models           机器学习模型数据
│  ├─plots            统计图
│  └─train            机器学习训练数据,如语料库、正负样本
└─utils               辅助模块,如读写数据库

依赖

Python库见requirements.txt

另外需要安装PhantomJS,并将其所在目录添加到环境变量Path

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].