All Projects → Linkcrawler → Similar Projects or Alternatives

398 Open source projects that are alternatives of or similar to Linkcrawler

It covers the blockade principle of most anti-climbing strategies and corresponding solutions.👽👽👽👽（涵盖了大部分的反爬策略的封锁原理以及对应的解决方案。）

Stars: ✭ 77 (-29.36%)

Mutual labels: crawler

Python Testing Crawler

A crawler for automated functional testing of a web application

Stars: ✭ 68 (-37.61%)

Mutual labels: crawler

Scrapoxy

Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!

Stars: ✭ 1,322 (+1112.84%)

Mutual labels: crawler

Swiftlinkpreview

It makes a preview from an URL, grabbing all the information such as title, relevant texts and images.

Stars: ✭ 1,216 (+1015.6%)

Mutual labels: crawler

Chemrtron

A document viewer; fuzzy match incremental search.

Stars: ✭ 59 (-45.87%)

Mutual labels: crawler

Infinitycrawler

A simple but powerful web crawler library for .NET

Stars: ✭ 97 (-11.01%)

Mutual labels: crawler

Jd Autobuy

Python爬虫，京东自动登录，在线抢购商品

Stars: ✭ 1,174 (+977.06%)

Mutual labels: crawler

Dotnetcrawler

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (-8.26%)

Mutual labels: crawler

Terpene Profile Parser For Cannabis Strains

Parser and database to index the terpene profile of different strains of Cannabis from online databases

Stars: ✭ 63 (-42.2%)

Mutual labels: crawler

Weibo Album Crawler

新浪微博相册大图多线程爬虫。

Stars: ✭ 83 (-23.85%)

Mutual labels: crawler

Work crawler

Download comics novels 小说漫画下载工具小説漫画のダウンローダ小說漫畫下載:腾讯漫画大角虫漫画有妖气知音漫客咪咕 SF漫画哦漫画看漫画漫画柜汗汗酷漫動漫伊甸園快看漫画微博动漫 733动漫网大古漫画网漫画DB 無限動漫動漫狂卡推漫画动漫之家动漫屋古风漫画网 36漫画网亲亲漫画网乙女漫画 comico webtoons 咚漫ニコニコ静画 ComicWalker ヤングエースUP モアイ pixivコミックサイコミ;アルファポリスカクヨムハーメルン小説家になろう起点中文网八一中文网顶点小说落霞小说网努努书坊笔趣阁→epub.

Stars: ✭ 1,224 (+1022.94%)

Mutual labels: crawler

Car Prices

Golang爬虫爬取汽车之家二手车产品库

Stars: ✭ 57 (-47.71%)

Mutual labels: crawler

Thesaurusspider

下载搜狗、百度、QQ输入法的词库文件的 python 爬虫，可用于构建不同行业的词汇库

Stars: ✭ 98 (-10.09%)

Mutual labels: crawler

Poopak

POOPAK - TOR Hidden Service Crawler

Stars: ✭ 78 (-28.44%)

Mutual labels: crawler

D4n155

OWASP D4N155 - Intelligent and dynamic wordlist using OSINT

Stars: ✭ 105 (-3.67%)

Mutual labels: crawler

Bee University

Project thu thập điểm chuẩn đại học 2014 - 2018 và phân tích dữ liệu

Stars: ✭ 73 (-33.03%)

Mutual labels: crawler

Lightcrawler

Crawl a website and run it through Google lighthouse

Stars: ✭ 1,339 (+1128.44%)

Mutual labels: crawler

Spider

python crawler spider

Stars: ✭ 70 (-35.78%)

Mutual labels: crawler

Not Your Average Web Crawler

A web crawler (for bug hunting) that gathers more than you can imagine.

Stars: ✭ 107 (-1.83%)

Mutual labels: crawler

Tracker Radar Collector

🕸 Modular, multithreaded, puppeteer-based crawler

Stars: ✭ 67 (-38.53%)

Mutual labels: crawler

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-15.6%)

Mutual labels: crawler

Hproxy

hproxy - Asynchronous IP proxy pool, aims to make getting proxy as convenient as possible.(异步爬虫代理池)

Stars: ✭ 62 (-43.12%)

Mutual labels: crawler

Crawlerpack

Java 網路資料爬蟲包

Stars: ✭ 99 (-9.17%)

Mutual labels: crawler

Auto Lighthouse

A utility package for automating lighthouse reporting

Stars: ✭ 58 (-46.79%)

Mutual labels: crawler

Tumblr crawler

tumblr解析网站

Stars: ✭ 83 (-23.85%)

Mutual labels: crawler

Is Google

Verify that a request is from Google crawlers using Google's DNS verification steps

Stars: ✭ 82 (-24.77%)

Mutual labels: crawler

Awesome Python Primer

自学入门 Python 优质中文资源索引，包含书籍 / 文档 / 视频，适用于爬虫 / Web / 数据分析 / 机器学习方向

Stars: ✭ 57 (-47.71%)

Mutual labels: crawler

Gopa Abandoned

GOPA, a spider written in Go.（NOTE: this project moved to https://github.com/infinitbyte/gopa ）

Stars: ✭ 98 (-10.09%)

Mutual labels: crawler

Wombat

Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.

Stars: ✭ 1,220 (+1019.27%)

Mutual labels: crawler

Skycaiji

蓝天采集器是一款免费的数据采集发布爬虫软件，采用php+mysql开发，可部署在云服务器，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Stars: ✭ 1,514 (+1288.99%)

Mutual labels: crawler

Puppeteer Walker

a puppeteer walker 🕷 🕸

Stars: ✭ 78 (-28.44%)

Mutual labels: crawler

Amazonrobot

Amazon商品引流的 python 爬虫

Stars: ✭ 97 (-11.01%)

Mutual labels: crawler

Webb

Python: An all-in-one Web Crawler, Web Parser and Web Scrapping library!

Stars: ✭ 77 (-29.36%)

Mutual labels: crawler

Webmagic

A scalable web crawler framework for Java.

Stars: ✭ 10,186 (+9244.95%)

Mutual labels: crawler

Crawler examples

Some classic web crawler projects.一些经典的爬虫

Stars: ✭ 74 (-32.11%)

Mutual labels: crawler

Scaleable Crawler With Docker Cluster

a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine

Stars: ✭ 96 (-11.93%)

Mutual labels: crawler

Goscraper

Golang pkg to quickly return a preview of a webpage (title/description/images)

Stars: ✭ 72 (-33.94%)

Mutual labels: crawler

Andvaranaut

A dungeon crawler

Stars: ✭ 103 (-5.5%)

Mutual labels: crawler

Scrapy Examples

Some scrapy and web.py exmaples

Stars: ✭ 71 (-34.86%)

Mutual labels: crawler

Gf Secrets

Secret and/ credential patterns used for gf.

Stars: ✭ 96 (-11.93%)

Mutual labels: crawler

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (-37.61%)

Mutual labels: crawler

Fawkes

Fawkes is a tool to search for targets vulnerable to SQL Injection. Performs the search using Google search engine.

Stars: ✭ 108 (-0.92%)

Mutual labels: crawler

Zhihuvapi

优雅地玩知乎

Stars: ✭ 67 (-38.53%)

Mutual labels: crawler

Hotnewsanalysis

利用文本挖掘技术进行新闻热点关注问题分析

Stars: ✭ 93 (-14.68%)

Mutual labels: crawler

Lxspider

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》

Stars: ✭ 60 (-44.95%)

Mutual labels: crawler

Ruia

Async Python 3.6+ web scraping micro-framework based on asyncio

Stars: ✭ 1,366 (+1153.21%)

Mutual labels: crawler

Tumblr Crawler

Easily download all the photos/videos from tumblr blogs. 下载指定的 Tumblr 博客中的图片，视频

Stars: ✭ 1,118 (+925.69%)

Mutual labels: crawler

Proxy Pool

爬虫代理IP池服务，可供其他爬虫程序通过restapi获取

Stars: ✭ 91 (-16.51%)

Mutual labels: crawler

Boj Autocommit

When you solve the problem of Baekjoon Online Judge, it automatically commits and pushes to the remote repository.

Stars: ✭ 60 (-44.95%)

Mutual labels: crawler

Crawler Detect

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

Stars: ✭ 1,549 (+1321.1%)

Mutual labels: crawler

Beanbun

Beanbun 是用 PHP 编写的多进程网络爬虫框架，具有良好的开放性、高可扩展性，基于 Workerman。

Stars: ✭ 1,096 (+905.5%)

Mutual labels: crawler

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (+1043.12%)

Mutual labels: crawler

Crawlergo

A powerful dynamic crawler for web vulnerability scanners

Stars: ✭ 1,088 (+898.17%)

Mutual labels: crawler

Antispider

Stars: ✭ 99 (-9.17%)

Mutual labels: crawler

Taiwan News Crawlers

Scrapy-based Crawlers for news of Taiwan

Stars: ✭ 83 (-23.85%)

Mutual labels: crawler

Lumberjack

An automated website accessibility scanner and cli

Stars: ✭ 109 (+0%)

Mutual labels: crawler

Scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Stars: ✭ 42,343 (+38746.79%)

Mutual labels: crawler

Crawler

爬虫, http代理, 模拟登陆!

Stars: ✭ 106 (-2.75%)

Mutual labels: crawler

Douyinsdk

抖音 SDK，数据采集，爬虫抓取不是梦

Stars: ✭ 99 (-9.17%)

Mutual labels: crawler

Acm Statistics

An online tool (crawler) to analyze users performance in online judges (coding competition websites). Supported OJ: POJ, HDU, ZOJ, HYSBZ, CodeForces, UVA, ICPC Live Archive, FZU, SPOJ, Timus (URAL), LeetCode_CN, CSU, LibreOJ, 洛谷, 牛客OJ, Lutece (UESTC), AtCoder, AIZU, CodeChef, El Judge, BNUOJ, Codewars, UOJ, NBUT, 51Nod, DMOJ, VJudge

Stars: ✭ 83 (-23.85%)

Mutual labels: crawler

1-60 of 398 similar projects

›

next*5