All Projects → iHealth-ecnu → iHealth_crawler

iHealth-ecnu / iHealth_crawler

Licence: GPL-2.0 license
iHealth 项目的内容爬虫(一个基于 python 和 MongoDB 的医疗咨询爬虫)

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to iHealth crawler

dnevnikru
dnevnik.ru parser
Stars: ✭ 20 (-16.67%)
Mutual labels:  requests, lxml
Tieba-Birthday-Spider
百度贴吧生日爬虫,可抓取贴吧内吧友生日,并且在对应日期自动发送祝福
Stars: ✭ 28 (+16.67%)
Mutual labels:  pymongo, requests
crawler
requests+lxml爬虫,简单爬虫架构
Stars: ✭ 72 (+200%)
Mutual labels:  requests, lxml
Instagram Stalker Scraper
(UNMAINTAINED) Fetch data of any public Instagram profile, without using api
Stars: ✭ 39 (+62.5%)
Mutual labels:  requests, lxml
Requests Html
Pythonic HTML Parsing for Humans™
Stars: ✭ 12,268 (+51016.67%)
Mutual labels:  requests, lxml
megadlbot oss
Megatron was a telegram file management bot that helped a lot of users, specially movie channel managers to upload their files to telegram by just providing a link to it. The project initially started as roanuedhuru_bot which lately retired and came back as Megatron which was a side project of the famous Maldivian Telegram community - @baivaru u…
Stars: ✭ 151 (+529.17%)
Mutual labels:  pymongo
buscaimoveis
Agregador de anúncios de imóveis a venda
Stars: ✭ 15 (-37.5%)
Mutual labels:  pymongo
axiol
🚀 An advanced Python Discord bot for everyone
Stars: ✭ 39 (+62.5%)
Mutual labels:  pymongo
Pulse
✨ Pulse is a global state and logic framework for reactive Typescript & Javascript applications. Supporting frameworks like VueJS, React and React Native.
Stars: ✭ 243 (+912.5%)
Mutual labels:  requests
doc2vec pymongo
Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)
Stars: ✭ 36 (+50%)
Mutual labels:  pymongo
Mongoengine
MongoEngine is a Python Object-Document Mapper for working with MongoDB. Documentation is available at https://mongoengine-odm.readthedocs.io - there is currently a tutorial, a user guide, and an API reference.
Stars: ✭ 3,632 (+15033.33%)
Mutual labels:  pymongo
Python-MongoDB-Example
A Live working Example Application of Python, Qt, PySide2, MongoDB, PyMongo, QTreeView, QAbstractTableModel
Stars: ✭ 41 (+70.83%)
Mutual labels:  pymongo
ask-hadith
🔎 A Hadith search engine
Stars: ✭ 33 (+37.5%)
Mutual labels:  pymongo
pymongo inmemory
A mongo mocking library with an ephemeral MongoDB running in memory.
Stars: ✭ 25 (+4.17%)
Mutual labels:  pymongo
serlist
Search engine results page scraper
Stars: ✭ 12 (-50%)
Mutual labels:  lxml
Curequests
Curio + Requests: Async HTTP for Humans
Stars: ✭ 243 (+912.5%)
Mutual labels:  requests
Deploy-ML-model
No description or website provided.
Stars: ✭ 57 (+137.5%)
Mutual labels:  pymongo
flask-admin-boilerplate
Flask Admin Boilerplate with MongoDB
Stars: ✭ 63 (+162.5%)
Mutual labels:  pymongo
quart-motor
Motor support for Quart applications
Stars: ✭ 14 (-41.67%)
Mutual labels:  pymongo
mongu
🌱 Yet another Python Object-Document Mapper on top of PyMongo. It's lightweight, intuitive to use and easy to understand.
Stars: ✭ 15 (-37.5%)
Mutual labels:  pymongo

iHealth_crawler

iHealth 项目的内容爬虫

TODO

安装依赖

  • requests
  • pymongo
  • lxml

MongoDB 配置

  1. 开启 MongoDB 权限认证:在配置文件中加入 auth = true

  2. 创建管理员用户(如果你是第一次使用 MongoDB)

use admin
db.createUser({user:"admin",pwd:"admin123",roles:["userAdminAnyDatabase"]})

管理员用户用来创建其他数据库和用户

  1. 使用管理员账户远程登录
C:\Users\cs>mongo [your_ip]:27017
> use admin
switched to db admin
> db.auth('admin','admin123')
1
  1. 创建 iHealth 数据库,以及操作该数据库的用户
use iHealth         // 创建数据库,并作为认证数据库
db.createUser({
    user:'admin',   // 用户名
    pwd:'admin123', // 用户密码
    roles:[{role:'readWrite',db:'iHealth'}]     // 为该用户赋予数据库的读写权限
})
  1. 使用该用户远程登录 iHealth 数据库
C:\Users\cs>mongo [your_ip]:27017
> use iHealth
switched to db iHealth
> db.auth('admin','admin123')
1
> db.getCollectionNames()
[ ]

数据库刚刚创建,所以没有数据

启动说明

  1. 安装环境:Python 环境和依赖 + MongoDB 配置

  2. 配置 common.py 中的数据库信息

# 数据库配置
mongo_dbname = 'iHealth'
mongo_host = 'your_ip'          # mongodb 主机地址
mongo_port = 27017              # mongodb 主机端口
mongo_user = 'your_user'        # mongodb 登陆用户
mongo_pwd  = 'your_password'    # mongodb 用户密码
  1. 运行
    • Windows :
    python iHealth_crawler.py
    
    • Linux :
    sh server.sh start
    

注意

  • 脚本功能:
    • server.sh:启动/停止/重启/查看状态/查看日志 heatbox 服务,用法:
    Usages: sh server.sh [start|stop|restart|status|log]
    

参考资料

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].