Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-86.28%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-64.69%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+1146.79%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+288.2%)
GosintOSINT Swiss Army Knife
Stars: ✭ 401 (-67.82%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+284.67%)
NewcrawlerFree Web Scraping Tool with Java
Stars: ✭ 589 (-52.73%)
Awesome Python Primer自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Stars: ✭ 57 (-95.43%)
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+91.97%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (-47.35%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (-37.32%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-95.83%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-97.99%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-98.64%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (-36.68%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (-56.98%)
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+552.73%)
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-98.8%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (-83.07%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (-84.75%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-95.26%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-77.77%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (-72.39%)
scrapy facebookerCollection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-98.23%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+227.21%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (-72.07%)
Spiderpython crawler spider
Stars: ✭ 70 (-94.38%)
ScrapedinLinkedIn Scraper (currently working 2020)
Stars: ✭ 453 (-63.64%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (-62.76%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-94.54%)
Bilili🍻 bilibili video (including bangumi) and danmaku downloader | B站视频(含番剧)、弹幕下载器
Stars: ✭ 379 (-69.58%)
DataflowkitExtract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (-63.4%)
WombatLightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
Stars: ✭ 1,220 (-2.09%)
XsrfprobeThe Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.
Stars: ✭ 532 (-57.3%)
NetdiscoveryNetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。
Stars: ✭ 573 (-54.01%)
Go jobs带你了解一下Golang的市场行情
Stars: ✭ 526 (-57.78%)
Xxl CrawlerA distributed web crawler framework.(分布式爬虫框架XXL-CRAWLER)
Stars: ✭ 561 (-54.98%)
DouyinAPI of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (-53.45%)
Imagescraper✂️ High performance, multi-threaded image scraper
Stars: ✭ 630 (-49.44%)
Signature algorithm各种App、小程序、网站的请求签名或加密算法。 现已有:自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)
Stars: ✭ 380 (-69.5%)
Haipproxy💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+300.72%)
IcrawlerA multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (-49.52%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (-48.88%)
GospiderGospider - Fast web spider written in Go
Stars: ✭ 785 (-37%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (-28.57%)
TorbotDark Web OSINT Tool
Stars: ✭ 821 (-34.11%)
PypatentSearch for and retrieve US Patent and Trademark Office Patent Data
Stars: ✭ 31 (-97.51%)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-98.88%)