Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+47.55%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+2751.05%)
RcrawlerAn R web crawler and scraper
Stars: ✭ 274 (+91.61%)
Instagram CrawlerCrawl instagram photos, posts and videos for download.
Stars: ✭ 178 (+24.48%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+451.75%)
Tianyanchapip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
Stars: ✭ 206 (+44.06%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-58.74%)
OnegramThis repository is no longer maintained.
Stars: ✭ 137 (-4.2%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+358.74%)
NewspaperNews, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+7973.43%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+771.33%)
Datmusic ApiAlternative for VK Audio API
Stars: ✭ 160 (+11.89%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+10763.64%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (+32.87%)
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-89.51%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-88.11%)
Weibo terminator workflowUpdate Version of weibo_terminator, This is Workflow Version aim at Get Job Done!
Stars: ✭ 259 (+81.12%)
Ruiji.netcrawler framework, distributed crawler extractor
Stars: ✭ 220 (+53.85%)
ScrapedinLinkedIn Scraper (currently working 2020)
Stars: ✭ 453 (+216.78%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (+345.45%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+3251.75%)
Jd AutobuyPython爬虫,京东自动登录,在线抢购商品
Stars: ✭ 1,174 (+720.98%)
GoscraperGolang pkg to quickly return a preview of a webpage (title/description/images)
Stars: ✭ 72 (-49.65%)
Instagram Scraperscrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
Stars: ✭ 2,209 (+1444.76%)
Youtube ProjectsThis repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (+0.7%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+19.58%)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-90.21%)
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+1572.73%)
JvppeteerHeadless Chrome For Java (Java 爬虫)
Stars: ✭ 193 (+34.97%)
Media ScraperScrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok
Stars: ✭ 206 (+44.06%)
WombatLightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
Stars: ✭ 1,220 (+753.15%)
PoliteBe nice on the web
Stars: ✭ 253 (+76.92%)
Skrape.itA Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (+61.54%)
dijnet-botAz összes számlád még egy helyen :)
Stars: ✭ 17 (-88.11%)
Annie👾 Fast and simple video download library and CLI tool written in Go
Stars: ✭ 16,369 (+11346.85%)
lightnovel epub🍭 epub generator for (light)novels (轻) 小说 epub 生成器,支持站点:轻之国度、轻小说文库
Stars: ✭ 89 (-37.76%)
Hquery.phpAn extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (+106.29%)
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+5587.41%)
BookcorpusCrawl BookCorpus
Stars: ✭ 443 (+209.79%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+207.69%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+3282.52%)
GosintOSINT Swiss Army Knife
Stars: ✭ 401 (+180.42%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+274.83%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (+446.15%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (+143.36%)
Social ScraperTổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt
Stars: ✭ 47 (-67.13%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (+140.56%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-82.52%)
ScrapoxyScrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Stars: ✭ 1,322 (+824.48%)
MwofflinerScrape any online Mediawiki motorised wiki (like Wikipedia) to your local filesystem
Stars: ✭ 121 (-15.38%)
ArxivscraperA python module to scrape arxiv.org for specific date range and categories
Stars: ✭ 121 (-15.38%)
Youtube Comment SuiteDownload YouTube comments from numerous videos, playlists, and channels for archiving, general search, and showing activity.
Stars: ✭ 120 (-16.08%)
Go Jd京东自动登录,在线商品自动下单
Stars: ✭ 139 (-2.8%)
ProxyscrapePython library for retrieving free proxies (HTTP, HTTPS, SOCKS4, SOCKS5).
Stars: ✭ 134 (-6.29%)