All Projects → Cnkispider → Similar Projects or Alternatives

227 Open source projects that are alternatives of or similar to Cnkispider

Scrapy Azuresearch Crawler Samples

Scrapy as a Web Crawler for Azure Search Samples

Stars: ✭ 20 (-82.91%)

Mutual labels: scrapy

Funpyspidersearchengine

Word2vec 千人千面个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索

Stars: ✭ 782 (+568.38%)

Mutual labels: scrapy

Taobao duoshou

使用Scrapy采集淘宝数据，Flask展示

Stars: ✭ 63 (-46.15%)

Mutual labels: scrapy

App comments spider

爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy)，过滤器采用了bloomfilter。

Stars: ✭ 38 (-67.52%)

Mutual labels: scrapy

Python Spider

豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章

Stars: ✭ 615 (+425.64%)

Mutual labels: scrapy

Capturer

capture pictures from website like sina, lofter, huaban and so on

Stars: ✭ 76 (-35.04%)

Mutual labels: scrapy

Pdf downloader

A Scrapy Spider for downloading PDF files from a webpage.

Stars: ✭ 18 (-84.62%)

Mutual labels: scrapy

Dotnetcrawler

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (-14.53%)

Mutual labels: scrapy

Webhubbot

Python + Scrapy + MongoDB . 5 million data per day !!!💥 The world's largest website.

Stars: ✭ 5,427 (+4538.46%)

Mutual labels: scrapy

Scrapy Craigslist

Web Scraping Craigslist's Engineering Jobs in NY with Scrapy

Stars: ✭ 54 (-53.85%)

Mutual labels: scrapy

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+7072.65%)

Mutual labels: scrapy

Spider python

python爬虫

Stars: ✭ 557 (+376.07%)

Mutual labels: scrapy

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

Stars: ✭ 81 (-30.77%)

Mutual labels: scrapy

Place2live

Analysis of the characteristics of different countries

Stars: ✭ 30 (-74.36%)

Mutual labels: scrapy

Scrapyd Cluster On Heroku

Set up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO 👉

Stars: ✭ 106 (-9.4%)

Mutual labels: scrapy

Scrapy Cluster

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.

Stars: ✭ 921 (+687.18%)

Mutual labels: scrapy

Image Downloader

Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.

Stars: ✭ 1,173 (+902.56%)

Mutual labels: scrapy

Seeker

Seeker - another job board aggregator.

Stars: ✭ 16 (-86.32%)

Mutual labels: scrapy

Programer log

最新动态在这里【我的程序员日志】

Stars: ✭ 112 (-4.27%)

Mutual labels: scrapy

Jd spider

两只蠢萌京东的分布式爬虫.

Stars: ✭ 738 (+530.77%)

Mutual labels: scrapy

Warta Scrap

Indonesia Index News Crawler, including 10 online media

Stars: ✭ 57 (-51.28%)

Mutual labels: scrapy

Scrapyrt

HTTP API for Scrapy spiders

Stars: ✭ 637 (+444.44%)

Mutual labels: scrapy

Proxy server crawler

an awesome public proxy server crawler based on scrapy framework

Stars: ✭ 94 (-19.66%)

Mutual labels: scrapy

Easy Scraping Tutorial

Simple but useful Python web scraping tutorial code.

Stars: ✭ 583 (+398.29%)

Mutual labels: scrapy

Scrapy Pyppeteer

Pyppeteer integration for Scrapy

Stars: ✭ 48 (-58.97%)

Mutual labels: scrapy

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (+775.21%)

Mutual labels: scrapy

Scrapy Selenium

Scrapy middleware to handle javascript pages using selenium

Stars: ✭ 550 (+370.09%)

Mutual labels: scrapy

Taiwan News Crawlers

Scrapy-based Crawlers for news of Taiwan

Stars: ✭ 83 (-29.06%)

Mutual labels: scrapy

Articlespider

慕课网python分布式爬虫源码-长期更新维护

Stars: ✭ 40 (-65.81%)

Mutual labels: scrapy

Crawler

爬虫, http代理, 模拟登陆!

Stars: ✭ 106 (-9.4%)

Mutual labels: scrapy

Scrapymon

Simple Web UI for Scrapy spider management via Scrapyd

Stars: ✭ 35 (-70.09%)

Mutual labels: scrapy

Olxscraper

OLX Scraper in Python Scrapy

Stars: ✭ 76 (-35.04%)

Mutual labels: scrapy

Jspider

JSpider会每周更新至少一个网站的JS解密方式，欢迎 Star，交流微信：13298307816

Stars: ✭ 914 (+681.2%)

Mutual labels: scrapy

Scrala

Unmaintained 🐳 ☕️ 🕷 Scala crawler(spider) framework, inspired by scrapy, created by @gaocegege

Stars: ✭ 113 (-3.42%)

Mutual labels: scrapy

Voyages Sncf Api

A scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.

Stars: ✭ 7 (-94.02%)

Mutual labels: scrapy

Scrapy Examples

Some scrapy and web.py exmaples

Stars: ✭ 71 (-39.32%)

Mutual labels: scrapy

Mailinglistscraper

A python web scraper for public email lists.

Stars: ✭ 19 (-83.76%)

Mutual labels: scrapy

Decoration Design Crawler

土巴兔和谷居装修网站爬虫

Stars: ✭ 105 (-10.26%)

Mutual labels: scrapy

Scrapy Finance

[OUTDATED] scrapy spiders to crawl the financial text data 📚 📜 pertinent to train word vectors 🚀

Stars: ✭ 17 (-85.47%)

Mutual labels: scrapy

Alipayspider Scrapy

AlipaySpider on Scrapy(use chrome driver); 支付宝爬虫(基于Scrapy)

Stars: ✭ 70 (-40.17%)

Mutual labels: scrapy

Py3 scripts

Life is short, *****.

Stars: ✭ 5 (-95.73%)

Mutual labels: scrapy

Patentcrawler

scrapy专利爬虫（停止维护）

Stars: ✭ 114 (-2.56%)

Mutual labels: scrapy

House Renting

Possibly the best practice of Scrapy 🕷 and renting a house 🏡

Stars: ✭ 741 (+533.33%)

Mutual labels: scrapy

Terpene Profile Parser For Cannabis Strains

Parser and database to index the terpene profile of different strains of Cannabis from online databases

Stars: ✭ 63 (-46.15%)

Mutual labels: scrapy

Tweetscraper

TweetScraper is a simple crawler/spider for Twitter Search without using API

Stars: ✭ 694 (+493.16%)

Mutual labels: scrapy

Experiments

Some research experiments

Stars: ✭ 95 (-18.8%)

Mutual labels: scrapy

Faster Than Requests

Faster requests on Python 3

Stars: ✭ 639 (+446.15%)

Mutual labels: scrapy

Scrapy S3pipeline

Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.

Stars: ✭ 57 (-51.28%)

Mutual labels: scrapy

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (+437.61%)

Mutual labels: scrapy

Wswp

Code for the second edition Web Scraping with Python book by Packt Publications

Stars: ✭ 112 (-4.27%)

Mutual labels: scrapy

Pythonspidernotes

Python入门网络爬虫之精华版

Stars: ✭ 5,634 (+4715.38%)

Mutual labels: scrapy

Reptile

🏀 Python3 网络爬虫实战（部分含详细教程）猫眼腾讯视频豆瓣研招网微博笔趣阁小说百度热点 B站 CSDN 网易云阅读阿里文学百度股票今日头条微信公众号网易云音乐拉勾有道 unsplash 实习僧汽车之家英雄联盟盒子大众点评链家 LPL赛程台风梦幻西游、阴阳师藏宝阁天气牛客网百度文库睡前故事知乎 Wish

Stars: ✭ 1,048 (+795.73%)

Mutual labels: scrapy

Wechatsogou

基于搜狗微信搜索的微信公众号爬虫接口

Stars: ✭ 5,220 (+4361.54%)

Mutual labels: scrapy

Scrapoxy

Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!

Stars: ✭ 1,322 (+1029.91%)

Mutual labels: scrapy

Wescraper

依赖Scrapy和搜狗搜索微信公众号文章

Stars: ✭ 46 (-60.68%)

Mutual labels: scrapy

Maria Quiteria

Backend para coleta e disponibilização dos dados 📜

Stars: ✭ 115 (-1.71%)

Mutual labels: scrapy

Weibo hot search

微博爬虫：每天定时爬取微博热搜榜的内容，留下互联网人的记忆。