Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → botsphp → Douyin_crawler

botsphp / Douyin_crawler

抖音爬虫，tiktok crawler，抖音数据采集接口，抖音视频去水印，百分百成功，不需要服务器，不需要代理 IP。

Labels

crawler

Projects that are alternatives of or similar to Douyin crawler

Pachong

一些爬虫的代码

Stars: ✭ 147 (-13.02%)

Mutual labels: crawler

Weibo wordcloud

根据关键词抓取微博数据，再生成词云

Stars: ✭ 154 (-8.88%)

Mutual labels: crawler

Js Reverse

JS逆向研究

Stars: ✭ 159 (-5.92%)

Mutual labels: crawler

Cocrawler

CoCrawler is a versatile web crawler built using modern tools and concurrency.

Stars: ✭ 148 (-12.43%)

Mutual labels: crawler

Ngmeta

Dynamic meta tags in your AngularJS single page application

Stars: ✭ 152 (-10.06%)

Mutual labels: crawler

Instagram Scraper

scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot

Stars: ✭ 2,209 (+1207.1%)

Mutual labels: crawler

Th Music Video Generator

Touhou Project random music video generator/player, crawling image and video from websites to generate MV.

Stars: ✭ 146 (-13.61%)

Mutual labels: crawler

Scrapingoutsourcing

ScrapingOutsourcing专注分享爬虫代码尽量每周更新一个

Stars: ✭ 164 (-2.96%)

Mutual labels: crawler

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+1159.76%)

Mutual labels: crawler

Yispider

一款分布式爬虫平台，帮助你更好的管理和开发爬虫。内置一套爬虫定义规则（模版），可使用模版快速定义爬虫，也可当作框架手动开发爬虫。(兴趣使然的项目，用的不爽了就更新)

Stars: ✭ 158 (-6.51%)

Mutual labels: crawler

Dxy Covid 19 Crawler

2019新型冠状病毒疫情实时爬虫及API | COVID-19/2019-nCoV Realtime Infection Crawler and API

Stars: ✭ 1,865 (+1003.55%)

Mutual labels: crawler

Jlitespider

A lite distributed Java spider framework :-)

Stars: ✭ 151 (-10.65%)

Mutual labels: crawler

Abot

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (+1060.36%)

Mutual labels: crawler

Rendora

dynamic server-side rendering using headless Chrome to effortlessly solve the SEO problem for modern javascript websites

Stars: ✭ 1,853 (+996.45%)

Mutual labels: crawler

Datmusic Api

Alternative for VK Audio API

Stars: ✭ 160 (-5.33%)

Mutual labels: crawler

Httpcode.core

简单、易用、高效一个有态度的开源.Net Http请求框架!可以用制作爬虫，api请求等等。

Stars: ✭ 146 (-13.61%)

Mutual labels: crawler

Crawler

An easy to use, powerful crawler implemented in PHP. Can execute Javascript.

Stars: ✭ 2,055 (+1115.98%)

Mutual labels: crawler

Bitextor

Bitextor generates translation memories from multilingual websites.

Stars: ✭ 168 (-0.59%)

Mutual labels: crawler

Gocrawl

Polite, slim and concurrent web crawler.

Stars: ✭ 1,962 (+1060.95%)

Mutual labels: crawler

Downzemall

DownZemAll! is a download manager for Windows, MacOS and Linux

Stars: ✭ 157 (-7.1%)

Mutual labels: crawler

View All Similar Projects ➔

抖音、Tiktok分布式爬虫使用文档 ENGLISH

2019.12.20 增加评论接口

2020.12.20 增加tiktok接口，除商品接口外全兼容抖音接口

2020.01.13 增加抖音商品接口

核心功能和优势

现在抖音升级特别频繁，老版本的限制越来越多，以前的签名算法，要么已经过期，要么就不返回数据，要么就是经常封 IP，造成接口和代码要经常改动，维护代理 IP 的成本也高
现在的方案是通过最新的分布式采集系统实现数据抓取，一个任务同时在后台，使用多个 IP 不同的协议和签名实现，保证返回数据，减少单点系统的瓶颈和成本
目前可以实现每天上千万次抓取，接口简单，效率极高
采用云方案，不需要部署采集服务器，不需要购买代理 IP，不需要处理升级
支持抖音、Tiktok

在线演示地址

https://yundou.me/

采集任务投放地址

https://service.yundou.me/

采集任务投放格式

curl -s -m 5 -d '{"token":"36ea7692e261cc32f593b2cd7eb7dc6c","type":"crawler_search_user","search":"面膜","num":20}' \
https://service.yundou.me/

采集成功后，会回调到客户的 http 接口

回调接口处理成功，必须返回 {"code":200} 的字符串回调接口请求超时时间为 10 秒，响应超时视为失败回调失败只重试一次，如果回调接收失败，可重新发起任务抓取任务会进行多次校验，采集失败会自动更换 IP 重新采集，一个抓取任务最多重试 9 次任务超过5分钟视为失败，可重新发起任务只支持 http 接口

支持的接口列表

crawler_search_user: 根据抖音号搜索 UID
crawler_search_video: 搜索视频
crawler_user_info: 根据 UID 返回用户详情
crawler_user_following: 根据 UID 返回用户关注列表
crawler_user_follower: 根据 UID 返回用户粉丝列表
crawler_user_post: 根据 UID 返回用户作品列表
crawler_user_favorite: 根据 UID 返回用户喜欢的列表
crawler_nearby_feed: 根据城市返回用户作品列表
crawler_comment_list: 根据视频 ID 返回评论列表
crawler_search_goods: 搜索商品，返回商品和对应主播的列表
crawler_user_goods: 根据 UID 返回用户橱窗商品列表

接受的参数列表

{
    "token": "",
    "num": 20,
    "type": "crawler_search_user",
    "uid": "85635793",
    "vid": "6763872129701124",
    "sec_uid": "MS4wLjABAAAA6FJbgV0BY17eGBY",
    "city_id": "510100",
    "search": "abcdefg",
    "task":"your_uniq_id",
    "result": []
}

> 不同的任务，需要传递的参数不一样，详细参数见下表
> `token` 为购买时分配的密钥，请注意续费
> `task` 是预留的字段，只支持16位以内字符串，可以用于标记客户自己的唯一任务
> `result` 为采集的结果
> 该请求参数，也是回调内容

响应内容

{
    "code": 200
}

{
    "code": 500,
    "msg": "异常内容"
}

详细任务接口

tiktok 需要增加一个参数 "app":"tiktok"

{
    "num": 20,
    "type": "crawler_search_user",
    "search": "abc12345678"
}

{
    "num": 20,
    "type": "crawler_search_video",
    "search": "abc12345678"
}

必须包含 uid 和 sec_uid

{
    "type": "crawler_user_info",
    "uid": "9338953804",
    "sec_uid": "MS4wLjABAAAAQ4xCNiRbRwIg"
}

必须包含 uid 和 sec_uid

{
    "num": 20,
    "type": "crawler_user_favorite",
    "uid": "632494600",
    "sec_uid": "MS4wLjABAAAAQ4xCNiRbRwIg"
}

必须包含 uid 和 sec_uid

{
    "num": 20,
    "type": "crawler_user_post",
    "uid": "632494600",
    "sec_uid": "MS4wLjABAAAAQ4xCNiRbRwIg"
}

必须包含 vid

{
    "num": 20,
    "type": "crawler_comment_list",
    "vid": "66082937525764932"
}

必须包含 uid 和 sec_uid

{
    "num": 20,
    "type": "crawler_user_follower",
    "uid": "16361944337",
    "sec_uid": "MS4wLjABAAAAQ4xCNiRbRwIg"
}

必须包含 uid 和 sec_uid

{
    "num": 20,
    "type": "crawler_user_following",
    "uid": "163619337",
    "sec_uid": "tWyVTUdvPOg90efQ7E"
}

city_id 和坐标必须包含其中一项

{
    "num": 20,
    "type": "crawler_nearby_feed",
    "city_id": "510900",
    "longitude": "105.389997",
    "latitude": "30.87346",
}

{
    "num": 20,
    "type": "crawler_search_goods",
    "search": "玩具"
}

必须包含 uid 和 sec_uid

{
    "num": 20,
    "type": "crawler_user_goods",
    "uid": "16361944337",
    "sec_uid": "MS4wLjABAAAAQ4xCNiRbRwIg"
}

说明

仅供学习研究使用，勿用于商业用途
请在获得授权后使用
欢迎交流学习

联系方式

[email protected]
tg: +639288446666
wx: wxfpy5201 botyan1980 (QQ 被盗，弃用)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 169

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗