All Projects → Spider_python → Similar Projects or Alternatives

496 Open source projects that are alternatives of or similar to Spider_python

Docs
《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (-78.82%)
Mutual labels:  scrapy, xpath, requests
python-crawler
爬虫学习仓库,适合零基础的人学习,对新手比较友好
Stars: ✭ 37 (-93.36%)
Mutual labels:  requests, xpath, scrapy
Reptile
🏀 Python3 网络爬虫实战(部分含详细教程)猫眼 腾讯视频 豆瓣 研招网 微博 笔趣阁小说 百度热点 B站 CSDN 网易云阅读 阿里文学 百度股票 今日头条 微信公众号 网易云音乐 拉勾 有道 unsplash 实习僧 汽车之家 英雄联盟盒子 大众点评 链家 LPL赛程 台风 梦幻西游、阴阳师藏宝阁 天气 牛客网 百度文库 睡前故事 知乎 Wish
Stars: ✭ 1,048 (+88.15%)
Mutual labels:  scrapy, requests
Place2live
Analysis of the characteristics of different countries
Stars: ✭ 30 (-94.61%)
Mutual labels:  scrapy, requests
Scrapingoutsourcing
ScrapingOutsourcing专注分享爬虫代码 尽量每周更新一个
Stars: ✭ 164 (-70.56%)
Mutual labels:  scrapy, requests
web full stack application
show full stack technology applications : Scrapy + webservice[restful] + websocket + VueJS + MongoDB
Stars: ✭ 16 (-97.13%)
Mutual labels:  requests, scrapy
Sourcecodeofbook
《Python爬虫开发 从入门到实战》配套源代码。
Stars: ✭ 226 (-59.43%)
Mutual labels:  scrapy, requests
Python Spider
豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Stars: ✭ 615 (+10.41%)
Mutual labels:  scrapy, xpath
Easy Scraping Tutorial
Simple but useful Python web scraping tutorial code.
Stars: ✭ 583 (+4.67%)
Mutual labels:  scrapy, requests
OpenScraper
An open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (-85.64%)
Mutual labels:  xpath, scrapy
python-fxxk-spider
收集各种免费的 Python 爬虫项目
Stars: ✭ 184 (-66.97%)
Mutual labels:  requests, scrapy
Asks
Async requests-like httplib for python.
Stars: ✭ 429 (-22.98%)
Mutual labels:  requests
Post Tuto Deployment
Build and deploy a machine learning app from scratch 🚀
Stars: ✭ 368 (-33.93%)
Mutual labels:  scrapy
Awesome Scrapy
A curated list of awesome packages, articles, and other cool resources from the Scrapy community.
Stars: ✭ 360 (-35.37%)
Mutual labels:  scrapy
Cpr
C++ Requests: Curl for People, a spiritual port of Python Requests.
Stars: ✭ 4,200 (+654.04%)
Mutual labels:  requests
Scrapy Rotating Proxies
use multiple proxies with Scrapy
Stars: ✭ 488 (-12.39%)
Mutual labels:  scrapy
Django Request
django-request is a statistics module for django. It stores requests in a database for admins to see, it can also be used to get statistics on who is online etc.
Stars: ✭ 419 (-24.78%)
Mutual labels:  requests
Robotframework Requests
Robot Framework keyword library wrapper for requests
Stars: ✭ 345 (-38.06%)
Mutual labels:  requests
Webspider
在线地址: http://119.23.223.90:8000
Stars: ✭ 340 (-38.96%)
Mutual labels:  requests
Drissionpage
A module that integrates selenium and requests session, encapsulates common page operations, can achieve seamless switching between the two modes.
Stars: ✭ 409 (-26.57%)
Mutual labels:  requests
Jsoupxpath
纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java,ha ha.Just try it.
Stars: ✭ 331 (-40.57%)
Mutual labels:  xpath
Fluentdom
A fluent api for working with XML in PHP
Stars: ✭ 327 (-41.29%)
Mutual labels:  xpath
Haipproxy
💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+796.41%)
Mutual labels:  scrapy
Jiekou Python3
接口自动化测试框架——python版,支持HTTP,dubbo协议接口
Stars: ✭ 468 (-15.98%)
Mutual labels:  requests
Scrapydouban
豆瓣电影/豆瓣读书 Scarpy 爬虫
Stars: ✭ 400 (-28.19%)
Mutual labels:  scrapy
Elves
🎊 Design and implement of lightweight crawler framework.
Stars: ✭ 315 (-43.45%)
Mutual labels:  scrapy
Spider Flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (-34.47%)
Mutual labels:  xpath
Camaro
camaro is an utility to transform XML to JSON, using Node.js binding to native XML parser pugixml, one of the fastest XML parser around.
Stars: ✭ 438 (-21.36%)
Mutual labels:  xpath
Requests Threads
🎭 Twisted Deferred Thread backend for Requests.
Stars: ✭ 366 (-34.29%)
Mutual labels:  requests
Basex
BaseX Main Repository.
Stars: ✭ 515 (-7.54%)
Mutual labels:  xpath
Proxy requests
a class that uses scraped proxies to make http GET/POST requests (Python requests)
Stars: ✭ 357 (-35.91%)
Mutual labels:  requests
Httmock
A mocking library for requests
Stars: ✭ 421 (-24.42%)
Mutual labels:  requests
Vault
swiss army knife for hackers
Stars: ✭ 346 (-37.88%)
Mutual labels:  scrapy
Scrapy Redis
Redis-based components for Scrapy.
Stars: ✭ 4,998 (+797.31%)
Mutual labels:  scrapy
Xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (-39.86%)
Mutual labels:  xpath
Requests Respectful
Minimalist Requests wrapper to work within rate limits of any amount of services simultaneously. Parallel processing friendly.
Stars: ✭ 417 (-25.13%)
Mutual labels:  requests
Htmlquery
htmlquery is golang XPath package for HTML query.
Stars: ✭ 338 (-39.32%)
Mutual labels:  xpath
Pycookiecheat
Borrow cookies from your browser's authenticated session for use in Python scripts.
Stars: ✭ 465 (-16.52%)
Mutual labels:  requests
Node Request Retry
💂 Wrap NodeJS request module to retry http requests in case of errors
Stars: ✭ 330 (-40.75%)
Mutual labels:  requests
Khttp
Kotlin HTTP requests library. Similar to Python requests.
Stars: ✭ 410 (-26.39%)
Mutual labels:  requests
J.a.r.v.i.s
python powered Intelligent System
Stars: ✭ 325 (-41.65%)
Mutual labels:  requests
Curl
Custom PHP curl library for the Laravel 5 framework - developed by Ixudra
Stars: ✭ 537 (-3.59%)
Mutual labels:  requests
Spiderman
基于 scrapy-redis 的通用分布式爬虫框架
Stars: ✭ 392 (-29.62%)
Mutual labels:  scrapy
Begoneads
BeGoneAds is a script that puts some popular hosts file lists into the systems hosts file as a adblocker measure.
Stars: ✭ 314 (-43.63%)
Mutual labels:  requests
Renrenbackup
A backup tool for renren.com
Stars: ✭ 309 (-44.52%)
Mutual labels:  requests
Crawlerforreader
Android 本地网络小说爬虫,基于jsoup及xpath
Stars: ✭ 312 (-43.99%)
Mutual labels:  xpath
Scrapple
A framework for creating semi-automatic web content extractors
Stars: ✭ 464 (-16.7%)
Mutual labels:  scrapy
Files
Docs and files for ScrapydWeb, Scrapyd, Scrapy, and other projects
Stars: ✭ 390 (-29.98%)
Mutual labels:  scrapy
Linkedin
Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (-44.52%)
Mutual labels:  scrapy
Turkce Python Kaynaklari
Türkçe olarak hazırlanmış Python programlama dili ile ilgili içeriklerin derlendiği sayfa.
Stars: ✭ 295 (-47.04%)
Mutual labels:  requests
Advanced Web Scraping Tutorial
The Zipru scraper developed in the Advanced Web Scraping Tutorial.
Stars: ✭ 384 (-31.06%)
Mutual labels:  scrapy
Exist
eXist Native XML Database and Application Platform
Stars: ✭ 294 (-47.22%)
Mutual labels:  xpath
Dianping textmining
大众点评评论文本挖掘,包括点评数据爬取、数据清洗入库、数据分析、评论情感分析等的完整挖掘项目
Stars: ✭ 289 (-48.11%)
Mutual labels:  requests
Lassie
Web Content Retrieval for Humans™
Stars: ✭ 521 (-6.46%)
Mutual labels:  requests
Wring
Extract content from webpages using CSS Selectors, XPath, and JS expressions
Stars: ✭ 462 (-17.06%)
Mutual labels:  xpath
Many requests
Dead easy interface for executing many HTTP requests asynchronously. Also provides helper functions for executing embarrassingly parallel async coroutines.
Stars: ✭ 384 (-31.06%)
Mutual labels:  requests
Sasila
一个灵活、友好的爬虫框架
Stars: ✭ 286 (-48.65%)
Mutual labels:  requests
Scrapy Crawlera
Crawlera middleware for Scrapy
Stars: ✭ 281 (-49.55%)
Mutual labels:  scrapy
Bilili
🍻 bilibili video (including bangumi) and danmaku downloader | B站视频(含番剧)、弹幕下载器
Stars: ✭ 379 (-31.96%)
Mutual labels:  requests
Alltheplaces
A set of spiders and scrapers to extract location information from places that post their location on the internet.
Stars: ✭ 277 (-50.27%)
Mutual labels:  scrapy
1-60 of 496 similar projects