All Projects → tim232385 → Webvideobot

tim232385 / Webvideobot

Licence: mit
Web crawler.

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to Webvideobot

Zhihuspider
多线程知乎用户爬虫,基于python3
Stars: ✭ 201 (-6.07%)
Mutual labels:  crawler, spider
Ncov2019 data crawler
疫情数据爬虫,2019新型冠状病毒数据仓库,轨迹数据,同乘数据,报道
Stars: ✭ 175 (-18.22%)
Mutual labels:  crawler, spider
Proxy pool
Python爬虫代理IP池(proxy pool)
Stars: ✭ 13,964 (+6425.23%)
Mutual labels:  crawler, spider
Fun crawler
Crawl some picture for fun
Stars: ✭ 169 (-21.03%)
Mutual labels:  crawler, spider
Goribot
[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (-11.21%)
Mutual labels:  crawler, spider
Gain
Web crawling framework based on asyncio.
Stars: ✭ 2,002 (+835.51%)
Mutual labels:  crawler, spider
Spoon
🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-19.16%)
Mutual labels:  crawler, spider
Yispider
一款分布式爬虫平台,帮助你更好的管理和开发爬虫。 内置一套爬虫定义规则(模版),可使用模版快速定义爬虫,也可当作框架手动开发爬虫。(兴趣使然的项目,用的不爽了就更新)
Stars: ✭ 158 (-26.17%)
Mutual labels:  crawler, spider
Marmot
💐Marmot | Web Crawler/HTTP protocol Download Package 🐭
Stars: ✭ 186 (-13.08%)
Mutual labels:  crawler, spider
Lianjia Beike Spider
链家网和贝壳网房价爬虫,采集北京上海广州深圳等21个中国主要城市的房价数据(小区,二手房,出租房,新房),稳定可靠快速!支持csv,MySQL, MongoDB,Excel, json存储,支持Python2和3,图表展示数据,注释丰富 ,点星支持,仅供学习参考,请勿用于商业用途,后果自负。
Stars: ✭ 2,257 (+954.67%)
Mutual labels:  crawler, spider
Jssoup
JavaScript + BeautifulSoup = JSSoup
Stars: ✭ 203 (-5.14%)
Mutual labels:  crawler, spider
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+7159.35%)
Mutual labels:  crawler, spider
Scrapingoutsourcing
ScrapingOutsourcing专注分享爬虫代码 尽量每周更新一个
Stars: ✭ 164 (-23.36%)
Mutual labels:  crawler, spider
Querylist
🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+1017.76%)
Mutual labels:  crawler, spider
Js Reverse
JS逆向研究
Stars: ✭ 159 (-25.7%)
Mutual labels:  crawler, spider
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-20.09%)
Mutual labels:  crawler, spider
Python3 Spider
Python爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (+894.86%)
Mutual labels:  crawler, spider
Abot
Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+816.36%)
Mutual labels:  crawler, spider
Zhihu Crawler People
A simple distributed crawler for zhihu && data analysis
Stars: ✭ 182 (-14.95%)
Mutual labels:  crawler, spider
Fooproxy
稳健高效的评分制-针对性- IP代理池 + API服务,可以自己插入采集器进行代理IP的爬取,针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库,支持MongoDB 4.0 使用 Python3.7(Scored IP proxy pool ,customise proxy data crawler can be added anytime)
Stars: ✭ 195 (-8.88%)
Mutual labels:  crawler, spider

Web Vedeo Bot

java-1.8 license: MIT

  • Easy install, all dependencies are already included, just need to have a java environment.

porn_flow

Environment, Architecture

  • Java1.8

  • Crawler4j

  • Spring Boot x H2 Db

How to use

git clone https://github.com/tim232385/WebVideoBot.git
cd WebVideoBot
java -jar PornBot.jar
**DOWNLOAD_VIDEO default is N** See Default Configuration

Demo h2_console

Database Description

http://localhost:8000/h2-console/

JDBC URL: jdbc:h2:~/porn/porn-db

User Name: sa

Password: empty

Record Table:

 Table_Name               :PORN_RECORD
 viewKey                  :The website's video unique key.
 imageUrl                 :Image url of video.
 linkUrl                  :Video jump to Website`s link
 videoUrl                 :Video adrress.
 videoTitle               :Title of video.
 videoDuration            :Video click count.
 videoQuality             :Defualt quality - 240, 480, 960, 1280p.
 download                 :Has been downloaded. True or false.
 createdTime              :The record created time.
 filePath                 :The video downloaded path.

Configuration

 FILE_PATH                :Video download path.
 VIDEO_DOWNLOAD_SIZE      :Maximum download size of the video.(Byte)
 MAX_PAGE_SIZE            :Crawling page size.
 CONCURRENT_THREAD_SIZE   :Muilti-threads request amount.
 START_URL                :Crawling url.
 DOWNLOAD_VIDEO           :Download video. Y or N.

[See Default Configuration](https://github.com/tim232385/PornBot/blob/master/config.properties

Stargazers over time

Stargazers over time

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].