All Projects → CrawlBox → Similar Projects or Alternatives

487 Open source projects that are alternatives of or similar to CrawlBox

Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+134.75%)
Mutual labels:  crawler, web-crawler
Awesome Crawler
A collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+3961.86%)
Mutual labels:  crawler, web-crawler
flink-crawler
Continuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-59.32%)
Mutual labels:  crawler, web-crawler
Crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+7011.86%)
Mutual labels:  crawler, web-crawler
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+455.93%)
Mutual labels:  crawler, web-crawler
Infinitycrawler
A simple but powerful web crawler library for .NET
Stars: ✭ 97 (-17.8%)
Mutual labels:  crawler, web-crawler
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Stars: ✭ 63 (-46.61%)
Mutual labels:  crawler, web-crawler
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (+67.8%)
Mutual labels:  crawler, web-crawler
Crawlab Lite
Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台
Stars: ✭ 122 (+3.39%)
Mutual labels:  crawler, web-crawler
Spider Flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+209.32%)
Mutual labels:  crawler, web-crawler
D4n155
OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (-11.02%)
Mutual labels:  crawler, wordlist
Abot
Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+1561.86%)
Mutual labels:  crawler, web-crawler
Spidy
The simple, easy to use command line web crawler.
Stars: ✭ 257 (+117.8%)
Mutual labels:  crawler, web-crawler
Supercrawler
A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Stars: ✭ 306 (+159.32%)
Mutual labels:  crawler, web-crawler
Pspider
简单易用的Python爬虫框架,QQ交流群:597510560
Stars: ✭ 1,611 (+1265.25%)
Mutual labels:  crawler, web-crawler
Maman
Rust Web Crawler saving pages on Redis
Stars: ✭ 39 (-66.95%)
Mutual labels:  crawler, web-crawler
Zhihu Crawler People
A simple distributed crawler for zhihu && data analysis
Stars: ✭ 182 (+54.24%)
Mutual labels:  crawler, web-crawler
Strong Web Crawler
基于C#.NET+PhantomJS+Sellenium的高级网络爬虫程序。可执行Javascript代码、触发各类事件、操纵页面Dom结构。
Stars: ✭ 238 (+101.69%)
Mutual labels:  crawler, web-crawler
OLX Scraper
📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-87.29%)
Mutual labels:  web-crawler
auto crawler ptt beauty image
Auto Crawler Ptt Beauty Image Use Python Schedule
Stars: ✭ 35 (-70.34%)
Mutual labels:  crawler
WebCrawler
Just a simple web crawler which return crawled links as IObservable using reactive extension and async await.
Stars: ✭ 55 (-53.39%)
Mutual labels:  web-crawler
learncpp-download
Scrape bot, to get you an offline copy of tutorials
Stars: ✭ 23 (-80.51%)
Mutual labels:  web-crawler
ptt-web-crawler
PTT 網路版爬蟲
Stars: ✭ 20 (-83.05%)
Mutual labels:  crawler
img-cli
An interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-87.29%)
Mutual labels:  crawler
bolsa
Biblioteca feita em Python com o objetivo de facilitar o acesso a dados de seus investimentos na bolsa de valores(B3/CEI) através do Portal CEI.
Stars: ✭ 46 (-61.02%)
Mutual labels:  web-crawler
leek
Distributed task redisqueue(最简单python分布式函数调度框架)
Stars: ✭ 60 (-49.15%)
Mutual labels:  web-crawler
siteshooter
📷 Automate full website screenshots and PDF generation with multiple viewport support.
Stars: ✭ 63 (-46.61%)
Mutual labels:  web-crawler
papercut
Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-87.29%)
Mutual labels:  crawler
StackOverflow-Crawler
It is a web crawler which crawls the stackoverfolw website (http://stackoverflow.com/) and finds the most popular technologies at current point of time by getting the tags info of the newest questions asked on the website.
Stars: ✭ 25 (-78.81%)
Mutual labels:  web-crawler
Brutal-wordlist-Generator
Brutal Wordlist Generator is a java based Application software used to generate the wordlist with best of UX interface
Stars: ✭ 24 (-79.66%)
Mutual labels:  wordlist
spiderable-middleware
🤖 Prerendering for JavaScript powered websites. Great solution for PWAs (Progressive Web Apps), SPAs (Single Page Applications), and other websites based on top of front-end JavaScript frameworks
Stars: ✭ 29 (-75.42%)
Mutual labels:  crawler
python-wordlist-generator
Create awesome wordlist with python, demo: https://asciinema.org/a/101677
Stars: ✭ 87 (-26.27%)
Mutual labels:  wordlist
crawler
A simple and flexible web crawler framework for java.
Stars: ✭ 20 (-83.05%)
Mutual labels:  crawler
cracken
a fast password wordlist generator, Smartlist creation and password hybrid-mask analysis tool written in pure safe Rust
Stars: ✭ 192 (+62.71%)
Mutual labels:  wordlist
BilibiliCrawler
🌀 crawl bilibili user info and video info for data analysis | BiliBili爬虫
Stars: ✭ 25 (-78.81%)
Mutual labels:  crawler
ComPP
Company Passwords Profiler (aka ComPP) helps making a bruteforce wordlist for a targeted company.
Stars: ✭ 44 (-62.71%)
Mutual labels:  wordlist
TaobaoAnalysis
练习NLP,分析淘宝评论的项目
Stars: ✭ 28 (-76.27%)
Mutual labels:  crawler
json-web-crawler
Use JSON to list all elements (with css 3 and jquery selector) that you want to crawl.
Stars: ✭ 17 (-85.59%)
Mutual labels:  web-crawler
domfind
A Python DNS crawler to find identical domain names under different TLDs.
Stars: ✭ 22 (-81.36%)
Mutual labels:  crawler
WeReadScan
扫描“微信读书”已购图书并下载本地PDF的爬虫
Stars: ✭ 273 (+131.36%)
Mutual labels:  web-crawler
Python3Webcrawler
🌈Python3网络爬虫实战:QQ音乐歌曲、京东商品信息、房天下、破解有道翻译、构建代理池、豆瓣读书、百度图片、破解网易登录、B站模拟扫码登录、小鹅通、荔枝微课
Stars: ✭ 208 (+76.27%)
Mutual labels:  crawler
ronin-support
A support library for Ronin. Like activesupport, but for hacking!
Stars: ✭ 23 (-80.51%)
Mutual labels:  wordlist
Raspagem-de-dados-para-iniciantes
Raspagem de dados para iniciante usando Scrapy e outras libs básicas
Stars: ✭ 113 (-4.24%)
Mutual labels:  web-crawler
Crawling-CV-Conference-Papers
Crawling CV conference papers with Python.
Stars: ✭ 32 (-72.88%)
Mutual labels:  crawler
UltimateCMSWordlists
📚 An ultimate collection wordlists of the best-known CMS
Stars: ✭ 54 (-54.24%)
Mutual labels:  wordlist
medium-stat-box
Practical pinned gist which show your latest medium status 📌
Stars: ✭ 29 (-75.42%)
Mutual labels:  crawler
SchweizerMesser
🎯Python 3 网络爬虫实战、数据分析合集 | 当当 | 网易云音乐 | unsplash | 必胜客 | 猫眼 |
Stars: ✭ 89 (-24.58%)
Mutual labels:  web-crawler
ant
A web crawler for Go
Stars: ✭ 264 (+123.73%)
Mutual labels:  web-crawler
longtongue
Customized Password/Passphrase List inputting Target Info
Stars: ✭ 61 (-48.31%)
Mutual labels:  wordlist
evine
Interactive CLI Web Crawler
Stars: ✭ 140 (+18.64%)
Mutual labels:  web-crawler
WiCrackFi
Python Script to help/automate the WiFi hacking exercises.
Stars: ✭ 61 (-48.31%)
Mutual labels:  wordlist
tmpleak
Leak other players' temporary workspaces for ctf and wargames.
Stars: ✭ 76 (-35.59%)
Mutual labels:  wordlist
php-google
Google search results crawler, get google search results that you need - php
Stars: ✭ 23 (-80.51%)
Mutual labels:  crawler
pyCreeper
一个用来快速提取网页内容的信息采集(爬虫)框架, 实现了对网页的动态加载与控制。
Stars: ✭ 25 (-78.81%)
Mutual labels:  web-crawler
doc crawler.py
Explore a website recursively and download all the wanted documents (PDF, ODT…)
Stars: ✭ 22 (-81.36%)
Mutual labels:  web-crawler
N-WEB
WEB PENETRATION TESTING TOOL 💥
Stars: ✭ 56 (-52.54%)
Mutual labels:  admin-finder
simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
Stars: ✭ 32 (-72.88%)
Mutual labels:  wordlist
roboxtractor
Extract endpoints marked as disallow in robots files to generate wordlists.
Stars: ✭ 40 (-66.1%)
Mutual labels:  wordlist
SourceWolf
Amazingly fast response crawler to find juicy stuff in the source code! 😎🔥
Stars: ✭ 132 (+11.86%)
Mutual labels:  wordlist
lostark-wait-notifier
🐤️ Lost Ark wait notifier
Stars: ✭ 38 (-67.8%)
Mutual labels:  crawler
1-60 of 487 similar projects