Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+134.75%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+3961.86%)
flink-crawlerContinuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-59.32%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+7011.86%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+455.93%)
InfinitycrawlerA simple but powerful web crawler library for .NET
Stars: ✭ 97 (-17.8%)
AntchAntch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (+67.8%)
Crawlab LiteLite version of Crawlab. 轻量版 Crawlab 爬虫管理平台
Stars: ✭ 122 (+3.39%)
Spider Flow新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+209.32%)
D4n155OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (-11.02%)
AbotCross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+1561.86%)
SpidyThe simple, easy to use command line web crawler.
Stars: ✭ 257 (+117.8%)
SupercrawlerA web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Stars: ✭ 306 (+159.32%)
Pspider简单易用的Python爬虫框架,QQ交流群:597510560
Stars: ✭ 1,611 (+1265.25%)
MamanRust Web Crawler saving pages on Redis
Stars: ✭ 39 (-66.95%)
Strong Web Crawler基于C#.NET+PhantomJS+Sellenium的高级网络爬虫程序。可执行Javascript代码、触发各类事件、操纵页面Dom结构。
Stars: ✭ 238 (+101.69%)
OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-87.29%)
WebCrawlerJust a simple web crawler which return crawled links as IObservable using reactive extension and async await.
Stars: ✭ 55 (-53.39%)
learncpp-downloadScrape bot, to get you an offline copy of tutorials
Stars: ✭ 23 (-80.51%)
img-cliAn interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-87.29%)
bolsaBiblioteca feita em Python com o objetivo de facilitar o acesso a dados de seus investimentos na bolsa de valores(B3/CEI) através do Portal CEI.
Stars: ✭ 46 (-61.02%)
leekDistributed task redisqueue(最简单python分布式函数调度框架)
Stars: ✭ 60 (-49.15%)
siteshooter📷 Automate full website screenshots and PDF generation with multiple viewport support.
Stars: ✭ 63 (-46.61%)
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-87.29%)
StackOverflow-CrawlerIt is a web crawler which crawls the stackoverfolw website (http://stackoverflow.com/) and finds the most popular technologies at current point of time by getting the tags info of the newest questions asked on the website.
Stars: ✭ 25 (-78.81%)
Brutal-wordlist-GeneratorBrutal Wordlist Generator is a java based Application software used to generate the wordlist with best of UX interface
Stars: ✭ 24 (-79.66%)
spiderable-middleware🤖 Prerendering for JavaScript powered websites. Great solution for PWAs (Progressive Web Apps), SPAs (Single Page Applications), and other websites based on top of front-end JavaScript frameworks
Stars: ✭ 29 (-75.42%)
crawlerA simple and flexible web crawler framework for java.
Stars: ✭ 20 (-83.05%)
crackena fast password wordlist generator, Smartlist creation and password hybrid-mask analysis tool written in pure safe Rust
Stars: ✭ 192 (+62.71%)
BilibiliCrawler🌀 crawl bilibili user info and video info for data analysis | BiliBili爬虫
Stars: ✭ 25 (-78.81%)
ComPPCompany Passwords Profiler (aka ComPP) helps making a bruteforce wordlist for a targeted company.
Stars: ✭ 44 (-62.71%)
json-web-crawlerUse JSON to list all elements (with css 3 and jquery selector) that you want to crawl.
Stars: ✭ 17 (-85.59%)
domfindA Python DNS crawler to find identical domain names under different TLDs.
Stars: ✭ 22 (-81.36%)
WeReadScan扫描“微信读书”已购图书并下载本地PDF的爬虫
Stars: ✭ 273 (+131.36%)
Python3Webcrawler🌈Python3网络爬虫实战:QQ音乐歌曲、京东商品信息、房天下、破解有道翻译、构建代理池、豆瓣读书、百度图片、破解网易登录、B站模拟扫码登录、小鹅通、荔枝微课
Stars: ✭ 208 (+76.27%)
ronin-supportA support library for Ronin. Like activesupport, but for hacking!
Stars: ✭ 23 (-80.51%)
medium-stat-boxPractical pinned gist which show your latest medium status 📌
Stars: ✭ 29 (-75.42%)
SchweizerMesser🎯Python 3 网络爬虫实战、数据分析合集 | 当当 | 网易云音乐 | unsplash | 必胜客 | 猫眼 |
Stars: ✭ 89 (-24.58%)
antA web crawler for Go
Stars: ✭ 264 (+123.73%)
longtongueCustomized Password/Passphrase List inputting Target Info
Stars: ✭ 61 (-48.31%)
evineInteractive CLI Web Crawler
Stars: ✭ 140 (+18.64%)
WiCrackFiPython Script to help/automate the WiFi hacking exercises.
Stars: ✭ 61 (-48.31%)
tmpleakLeak other players' temporary workspaces for ctf and wargames.
Stars: ✭ 76 (-35.59%)
php-googleGoogle search results crawler, get google search results that you need - php
Stars: ✭ 23 (-80.51%)
pyCreeper一个用来快速提取网页内容的信息采集(爬虫)框架, 实现了对网页的动态加载与控制。
Stars: ✭ 25 (-78.81%)
doc crawler.pyExplore a website recursively and download all the wanted documents (PDF, ODT…)
Stars: ✭ 22 (-81.36%)
N-WEBWEB PENETRATION TESTING TOOL 💥
Stars: ✭ 56 (-52.54%)
simplemmaSimple multilingual lemmatizer for Python, especially useful for speed and efficiency
Stars: ✭ 32 (-72.88%)
roboxtractorExtract endpoints marked as disallow in robots files to generate wordlists.
Stars: ✭ 40 (-66.1%)
SourceWolfAmazingly fast response crawler to find juicy stuff in the source code! 😎🔥
Stars: ✭ 132 (+11.86%)