All Projects → zhangchenchen → Proxyspider

zhangchenchen / Proxyspider

代理IP 采集程序

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to Proxyspider

Mubeng
An incredibly fast proxy checker & IP rotator with ease.
Stars: ✭ 234 (-11.36%)
Mutual labels:  proxy
Icmptunnel
Transparently tunnel your IP traffic through ICMP echo and reply packets.
Stars: ✭ 2,753 (+942.8%)
Mutual labels:  proxy
Shadowsocks Windows
A C# port of shadowsocks
Stars: ✭ 53,572 (+20192.42%)
Mutual labels:  proxy
Flaresolverr
Proxy server to bypass Cloudflare protection
Stars: ✭ 241 (-8.71%)
Mutual labels:  proxy
Fasttunnel
NAT 内网穿透 远程内网计算机 域名访问内网站点 反向代理内网服务 花生壳 端口转发 http代理 微信 小程序 expose a local server behind a NAT or firewall to the internet like ngrok and frp. NAT ssh proxy tunnel reverse-proxy
Stars: ✭ 248 (-6.06%)
Mutual labels:  proxy
Bus
Bus 是一个基础框架、服务套件,它基于Java8编写,参考、借鉴了大量已有框架、组件的设计,可以作为后端服务的开发基础中间件。代码简洁,架构清晰,非常适合学习使用。
Stars: ✭ 253 (-4.17%)
Mutual labels:  proxy
Go Http Tunnel
Fast and secure tunnels over HTTP/2
Stars: ✭ 2,786 (+955.3%)
Mutual labels:  proxy
Multiple Host
虚拟host解决方案,轻松实现两套host环境
Stars: ✭ 258 (-2.27%)
Mutual labels:  proxy
Blackwhite
Domain, CIDR, IP black/white list, PAC generator
Stars: ✭ 251 (-4.92%)
Mutual labels:  proxy
Websocketd
Turn any program that uses STDIN/STDOUT into a WebSocket server. Like inetd, but for WebSockets.
Stars: ✭ 15,828 (+5895.45%)
Mutual labels:  proxy
Websocat
Command-line client for WebSockets, like netcat (or curl) for ws:// with advanced socat-like functions
Stars: ✭ 3,477 (+1217.05%)
Mutual labels:  proxy
Toxy
Hackable HTTP proxy for resiliency testing and simulated network conditions
Stars: ✭ 2,698 (+921.97%)
Mutual labels:  proxy
Websockify
Websockify is a WebSocket to TCP proxy/bridge. This allows a browser to connect to any application/server/service.
Stars: ✭ 2,942 (+1014.39%)
Mutual labels:  proxy
Kedge
kEdge - Kubernetes Edge Proxy for gRPC and HTTP Microservices
Stars: ✭ 244 (-7.58%)
Mutual labels:  proxy
Dorknet
Selenium powered Python script to automate searching for vulnerable web apps.
Stars: ✭ 256 (-3.03%)
Mutual labels:  proxy
Winxray
Xray / V2Ray( vmess/vless )、Shadowsocks、Trojan 通用客户端(Windows),可自动维持稳定上网 - 代理服务器异常自动切换,并提供一键部署代理服务端工具,使用 aardio 编写,绿色便携版免安装仅740KB、不需要.Net等外部运行库。
Stars: ✭ 241 (-8.71%)
Mutual labels:  proxy
Mallory
HTTP/HTTPS proxy over SSH
Stars: ✭ 251 (-4.92%)
Mutual labels:  proxy
Manba
HTTP API Gateway
Stars: ✭ 3,000 (+1036.36%)
Mutual labels:  proxy
Php Curl Class
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+999.62%)
Mutual labels:  proxy
Mtail
extract internal monitoring data from application logs for collection in a timeseries database
Stars: ✭ 3,028 (+1046.97%)
Mutual labels:  proxy

更新 ---2017-07-06

  • 因为代理IP源网站为了反爬取,会经常变更版版面,所以爬取策略做了一些修改,删除了一些无用的源。

  • 增加了代理IP的隐匿级别,最终输出的格式如下:

    proxy type anonymity_level
    212.126.113.179:80 HTTP Elite
    61.191.41.130:80 HTTP Elite
    47.52.18.182:80 HTTP Transparent

更新 ---2017-06-12

  • 因为vps的问题,该程序已经不再更新七牛云的代理IP 文件,也就是说该文件 不再是最新的代理IP。
  • 建议按照使用步骤自己爬取。

用途

用于采集网上的公用代理IP(源站见配置文件),目前该程序仍在跑,每隔三小时刷新一遍代理IP,并将可用代理IP 上传到七牛云,可用性测试用的百度,因为时效性,不保证完全可用,但能保证基本的可用性。

想直接获取可用代理IP 的可以直接使用这个

说明

文件更新慢问题

因为七牛CDN的原因,可能存在源文件已更新,但命中缓存仍然是之前没变的文件,这时 参考这里的第一种情况来解决。 可以在这个 url “http://7xrnwq.com1.z0.glb.clouddn.com/proxy_list.txt” 后面添加“?v=2000” ,比如 “http://7xrnwq.com1.z0.glb.clouddn.com/proxy_list.txt?v=2000” 数字可以随意,尽量大,即可获取最新的文件。

源码说明

config.py

相关配置文件,比如代理网站站点,代理IP 输出位置,超时时间等,详情见注释。

proxyspider.py

主程序,大致包括一个用于采集网上代理IP 的线程,多个用于测试代理IP 是否可用的线程,测试网站用的百度,详情见注释。

proxy_list.txt

最终代理IP 的输出文档。

qiniuupload.py

将proxy_list.txt,定期提交到七牛云存储,目前是维护在这里链接

如果要用此模块的话,需要先安装七牛的库,pip install qiniu

使用

注:建议使用virtualenv

git clone https://github.com/zhangchenchen/proxyspider.git

cd proxyspider

pip install -r requirements.txt

python proxyspider.py

注:程序跑完后,所有代理IP输出到当前目录下的proxy_list.txt

TO BE CONTINUED

  • 增加https 代理
  • 增加socks代理
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].