WeiboCrawler无cookie版微博爬虫,可以连续爬取一个或多个新浪微博用户信息、用户微博及其微博评论转发。
Stars: ✭ 45 (-10%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-50%)
DecryptloginAPIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+3622%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+972%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (+1462%)
GoscraperGolang pkg to quickly return a preview of a webpage (title/description/images)
Stars: ✭ 72 (+44%)
WombatLightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
Stars: ✭ 1,220 (+2340%)
JvppeteerHeadless Chrome For Java (Java 爬虫)
Stars: ✭ 193 (+286%)
Datmusic ApiAlternative for VK Audio API
Stars: ✭ 160 (+220%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+30970%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+9574%)
bots-zooNo description or website provided.
Stars: ✭ 59 (+18%)
dijnet-botAz összes számlád még egy helyen :)
Stars: ✭ 17 (-66%)
Lxspider爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》
Stars: ✭ 60 (+20%)
ScrapoxyScrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Stars: ✭ 1,322 (+2544%)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-72%)
Youtube ProjectsThis repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (+188%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-66%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+242%)
Skrape.itA Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (+362%)
Annie👾 Fast and simple video download library and CLI tool written in Go
Stars: ✭ 16,369 (+32638%)
PoliteBe nice on the web
Stars: ✭ 253 (+406%)
ScrapedinLinkedIn Scraper (currently working 2020)
Stars: ✭ 453 (+806%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+9486%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+780%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+1212%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (+1174%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+1478%)
Weibo AnalystSocial media (Weibo) comments analyzing toolbox in Chinese 微博评论分析工具, 实现功能: 1.微博评论数据爬取; 2.分词与关键词提取; 3.词云与词频统计; 4.情感分析; 5.主题聚类
Stars: ✭ 430 (+760%)
Social ScraperTổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt
Stars: ✭ 47 (-6%)
Weibo Crawler新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频
Stars: ✭ 1,019 (+1938%)
Jd AutobuyPython爬虫,京东自动登录,在线抢购商品
Stars: ✭ 1,174 (+2248%)
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+16166%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+2392%)
GosintOSINT Swiss Army Knife
Stars: ✭ 401 (+702%)
Google Play ScraperGoogle play scraper for Python inspired by <facundoolano/google-play-scraper>
Stars: ✭ 143 (+186%)
OnegramThis repository is no longer maintained.
Stars: ✭ 137 (+174%)
Instagram Scraperscrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
Stars: ✭ 2,209 (+4318%)
NewspaperNews, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+22990%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (+280%)
Instagram CrawlerCrawl instagram photos, posts and videos for download.
Stars: ✭ 178 (+256%)
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+4684%)
Ruiji.netcrawler framework, distributed crawler extractor
Stars: ✭ 220 (+340%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+322%)
Weibopicdownloader免登录下载微博图片 爬虫 Download Weibo Images without Logging-in
Stars: ✭ 247 (+394%)
Media ScraperScrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok
Stars: ✭ 206 (+312%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (+588%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (+596%)
Tianyanchapip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
Stars: ✭ 206 (+312%)
zeekEyeA Fast and Powerful Scraping and Web Crawling Framework.
Stars: ✭ 36 (-28%)
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-70%)