Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → tim232385 → Webvideobot

tim232385 / Webvideobot

Licence: mit

Web crawler.

Programming Languages

68154 projects - #9 most used programming language

Labels

Projects that are alternatives of or similar to Webvideobot

多线程知乎用户爬虫，基于python3

Stars: ✭ 201 (-6.07%)

Mutual labels: crawler, spider

Ncov2019 data crawler

疫情数据爬虫，2019新型冠状病毒数据仓库，轨迹数据，同乘数据，报道

Stars: ✭ 175 (-18.22%)

Mutual labels: crawler, spider

Python爬虫代理IP池(proxy pool)

Stars: ✭ 13,964 (+6425.23%)

Mutual labels: crawler, spider

Crawl some picture for fun

Stars: ✭ 169 (-21.03%)

Mutual labels: crawler, spider

[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。

Stars: ✭ 190 (-11.21%)

Mutual labels: crawler, spider

Web crawling framework based on asyncio.

Stars: ✭ 2,002 (+835.51%)

Mutual labels: crawler, spider

🥄 A package for building specific Proxy Pool for different Sites.

Stars: ✭ 173 (-19.16%)

Mutual labels: crawler, spider

一款分布式爬虫平台，帮助你更好的管理和开发爬虫。内置一套爬虫定义规则（模版），可使用模版快速定义爬虫，也可当作框架手动开发爬虫。(兴趣使然的项目，用的不爽了就更新)

Stars: ✭ 158 (-26.17%)

Mutual labels: crawler, spider

💐Marmot | Web Crawler/HTTP protocol Download Package 🐭

Stars: ✭ 186 (-13.08%)

Mutual labels: crawler, spider

Lianjia Beike Spider

链家网和贝壳网房价爬虫，采集北京上海广州深圳等21个中国主要城市的房价数据（小区，二手房，出租房，新房），稳定可靠快速！支持csv,MySQL, MongoDB,Excel, json存储，支持Python2和3，图表展示数据，注释丰富，点星支持，仅供学习参考，请勿用于商业用途，后果自负。

Stars: ✭ 2,257 (+954.67%)

Mutual labels: crawler, spider

JavaScript + BeautifulSoup = JSSoup

Stars: ✭ 203 (-5.14%)

Mutual labels: crawler, spider

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+7159.35%)

Mutual labels: crawler, spider

Scrapingoutsourcing

ScrapingOutsourcing专注分享爬虫代码尽量每周更新一个

Stars: ✭ 164 (-23.36%)

Mutual labels: crawler, spider

🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。

Stars: ✭ 2,392 (+1017.76%)

Mutual labels: crawler, spider

JS逆向研究

Stars: ✭ 159 (-25.7%)

Mutual labels: crawler, spider

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (-20.09%)

Mutual labels: crawler, spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+894.86%)

Mutual labels: crawler, spider

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (+816.36%)

Mutual labels: crawler, spider

Zhihu Crawler People

A simple distributed crawler for zhihu && data analysis

Stars: ✭ 182 (-14.95%)

Mutual labels: crawler, spider

稳健高效的评分制-针对性- IP代理池 + API服务，可以自己插入采集器进行代理IP的爬取，针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库，支持MongoDB 4.0 使用 Python3.7（Scored IP proxy pool ,customise proxy data crawler can be added anytime）

Stars: ✭ 195 (-8.88%)

Mutual labels: crawler, spider

View All Similar Projects ➔

Web Vedeo Bot

Easy install, all dependencies are already included, just need to have a java environment.

Environment, Architecture

Java1.8
Crawler4j
Spring Boot x H2 Db

How to use

git clone https://github.com/tim232385/WebVideoBot.git
cd WebVideoBot
java -jar PornBot.jar
**DOWNLOAD_VIDEO default is N** See Default Configuration

Database Description

http://localhost:8000/h2-console/

JDBC URL: jdbc:h2:~/porn/porn-db

User Name: sa

Password: ~~empty~~

Record Table：

 Table_Name               ：PORN_RECORD
 viewKey                  ：The website's video unique key.
 imageUrl                 ：Image url of video.
 linkUrl                  ：Video jump to Website`s link
 videoUrl                 ：Video adrress.
 videoTitle               ：Title of video.
 videoDuration            ：Video click count.
 videoQuality             ：Defualt quality - 240, 480, 960, 1280p.
 download                 ：Has been downloaded. True or false.
 createdTime              ：The record created time.
 filePath                 ：The video downloaded path.

Configuration

 FILE_PATH                ：Video download path.
 VIDEO_DOWNLOAD_SIZE      ：Maximum download size of the video.(Byte)
 MAX_PAGE_SIZE            ：Crawling page size.
 CONCURRENT_THREAD_SIZE   ：Muilti-threads request amount.
 START_URL                ：Crawling url.
 DOWNLOAD_VIDEO           ：Download video. Y or N.

[See Default Configuration](https://github.com/tim232385/PornBot/blob/master/config.properties

Stargazers over time

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 214

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗