All Projects → json-web-crawler → Similar Projects or Alternatives

53 Open source projects that are alternatives of or similar to json-web-crawler

It is a web crawler which crawls the stackoverfolw website (http://stackoverflow.com/) and finds the most popular technologies at current point of time by getting the tags info of the newest questions asked on the website.

Stars: ✭ 25 (+47.06%)

Mutual labels: web-crawler

WeReadScan

扫描“微信读书”已购图书并下载本地PDF的爬虫

Stars: ✭ 273 (+1505.88%)

Mutual labels: web-crawler

Raspagem-de-dados-para-iniciantes

Raspagem de dados para iniciante usando Scrapy e outras libs básicas

Stars: ✭ 113 (+564.71%)

Mutual labels: web-crawler

ant

A web crawler for Go

Stars: ✭ 264 (+1452.94%)

Mutual labels: web-crawler

doc crawler.py

Explore a website recursively and download all the wanted documents (PDF, ODT…)

Stars: ✭ 22 (+29.41%)

Mutual labels: web-crawler

Market-Trend-Prediction

This is a project of build knowledge graph course. The project leverages historical stock price, and integrates social media listening from customers to predict market Trend On Dow Jones Industrial Average (DJIA).

Stars: ✭ 57 (+235.29%)

Mutual labels: web-crawler

Strong Web Crawler

基于C#.NET+PhantomJS+Sellenium的高级网络爬虫程序。可执行Javascript代码、触发各类事件、操纵页面Dom结构。

Stars: ✭ 238 (+1300%)

Mutual labels: web-crawler

Kochat

Opensource Korean chatbot framework

Stars: ✭ 204 (+1100%)

Mutual labels: web-crawler

Antch

Antch, a fast, powerful and extensible web crawling & scraping framework for Go

Stars: ✭ 198 (+1064.71%)

Mutual labels: web-crawler

Nutch

Apache Nutch is an extensible and scalable web crawler

Stars: ✭ 2,277 (+13294.12%)

Mutual labels: web-crawler

Zhihu Crawler People

A simple distributed crawler for zhihu && data analysis

Stars: ✭ 182 (+970.59%)

Mutual labels: web-crawler

Crawler Commons

A set of reusable Java components that implement functionality common to any web crawler

Stars: ✭ 173 (+917.65%)

Mutual labels: web-crawler

Abot

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (+11435.29%)

Mutual labels: web-crawler

Awesome Web Scraper

A collection of awesome web scaper, crawler.

Stars: ✭ 147 (+764.71%)

Mutual labels: web-crawler

Collector Http

Norconex HTTP Collector is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.

Stars: ✭ 130 (+664.71%)

Mutual labels: web-crawler

Proxy

A simple tool for fetching usable proxies from several websites.

Stars: ✭ 124 (+629.41%)

Mutual labels: web-crawler

Crawlab Lite

Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台

Stars: ✭ 122 (+617.65%)

Mutual labels: web-crawler

Pspider

简单易用的Python爬虫框架，QQ交流群：597510560

Stars: ✭ 1,611 (+9376.47%)

Mutual labels: web-crawler

Pulsar

Turn large Web sites into tables and charts using simple SQLs.

Stars: ✭ 100 (+488.24%)

Mutual labels: web-crawler

Infinitycrawler

A simple but powerful web crawler library for .NET

Stars: ✭ 97 (+470.59%)

Mutual labels: web-crawler

Ultimate Dork

Web Crawler

Stars: ✭ 79 (+364.71%)

Mutual labels: web-crawler

Ospider

开源矢量地理数据获取与预处理工具(POI/AOI/行政区/路网/土地利用)

Stars: ✭ 74 (+335.29%)

Mutual labels: web-crawler

Cvpr2019

Displays all the 2019 CVPR Accepted Papers in a way that they are easy to parse.

Stars: ✭ 65 (+282.35%)

Mutual labels: web-crawler

Abotx

Cross Platform C# Web crawler framework, headless browser, parallel crawler. Please star this project! +1.

Stars: ✭ 63 (+270.59%)

Mutual labels: web-crawler

Terpene Profile Parser For Cannabis Strains

Parser and database to index the terpene profile of different strains of Cannabis from online databases

Stars: ✭ 63 (+270.59%)

Mutual labels: web-crawler

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+49264.71%)

Mutual labels: web-crawler

Maman

Rust Web Crawler saving pages on Redis

Stars: ✭ 39 (+129.41%)

Mutual labels: web-crawler

Dutsso

快速登录大连理工大学统一身份认证系统（SSO）的Python模块，可轻松实现成绩提醒、抢课、玉兰卡信息、个人信息查询等功能。

Stars: ✭ 32 (+88.24%)

Mutual labels: web-crawler

Storm Crawler

A scalable, mature and versatile web crawler based on Apache Storm

Stars: ✭ 703 (+4035.29%)

Mutual labels: web-crawler

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+3758.82%)

Mutual labels: web-crawler

Awesome Crawler

A collection of awesome web crawler,spider in different languages

Stars: ✭ 4,793 (+28094.12%)

Mutual labels: web-crawler

Spider Flow

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

Stars: ✭ 365 (+2047.06%)

Mutual labels: web-crawler

Sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

Stars: ✭ 362 (+2029.41%)

Mutual labels: web-crawler

Ache

ACHE is a web crawler for domain-specific search.

Stars: ✭ 320 (+1782.35%)

Mutual labels: web-crawler

Supercrawler

A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.

Stars: ✭ 306 (+1700%)

Mutual labels: web-crawler

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (+1529.41%)

Mutual labels: web-crawler

Spidy

The simple, easy to use command line web crawler.

Stars: ✭ 257 (+1411.76%)

Mutual labels: web-crawler

Lagoujob

Job data mining repo for lagou.com