All Projects → Scrapoxy → Similar Projects or Alternatives

2287 Open source projects that are alternatives of or similar to Scrapoxy

Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-92.44%)
Mutual labels:  crawler, scrapy
Google Play Scraper
Node.js scraper to get data from Google Play
Stars: ✭ 1,606 (+21.48%)
Mutual labels:  crawler, scraper
Instagram Proxy Api
CORS compliant API to access Instagram's public data
Stars: ✭ 245 (-81.47%)
Mutual labels:  scraper, proxy
Free proxy website
获取免费socks/https/http代理的网站集合
Stars: ✭ 119 (-91%)
Mutual labels:  crawler, proxy
Docs
《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (-91.07%)
Mutual labels:  crawler, scrapy
Newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+773.3%)
Mutual labels:  crawler, scraper
Proxyscrape
Python library for retrieving free proxies (HTTP, HTTPS, SOCKS4, SOCKS5).
Stars: ✭ 134 (-89.86%)
Mutual labels:  scraper, proxy
Python3 Spider
Python爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (+61.04%)
Mutual labels:  crawler, scrapy
Ngmeta
Dynamic meta tags in your AngularJS single page application
Stars: ✭ 152 (-88.5%)
Mutual labels:  crawler, angularjs
Scrapingoutsourcing
ScrapingOutsourcing专注分享爬虫代码 尽量每周更新一个
Stars: ✭ 164 (-87.59%)
Mutual labels:  crawler, scrapy
Youtube Projects
This repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (-89.11%)
Mutual labels:  crawler, scraper
Spoon
🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-86.91%)
Mutual labels:  crawler, proxy
Seleniumcrawler
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Stars: ✭ 117 (-91.15%)
Mutual labels:  scraper, scrapy
Media Scraper
Scrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok
Stars: ✭ 206 (-84.42%)
Mutual labels:  crawler, scraper
Tianyancha
pip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
Stars: ✭ 206 (-84.42%)
Mutual labels:  crawler, scraper
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+1075.11%)
Mutual labels:  crawler, scraper
Social Scraper
Tổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt
Stars: ✭ 47 (-96.44%)
Mutual labels:  crawler, scraper
Skrape.it
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (-82.53%)
Mutual labels:  crawler, scraper
Ecommercecrawlers
码云仓库链接:AJay13/ECommerceCrawlers Github 仓库链接:DropsDevopsOrg/ECommerceCrawlers 项目展示平台链接:http://wechat.doonsec.com
Stars: ✭ 3,073 (+132.45%)
Mutual labels:  crawler, scrapy
Gobetween
☁️ Modern & minimalistic load balancer for the Сloud era
Stars: ✭ 1,631 (+23.37%)
Mutual labels:  cloud, proxy
Querylist
🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+80.94%)
Mutual labels:  crawler, scraper
OLX Scraper
📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-98.87%)
Mutual labels:  scraper, scrapy
scrapy-LBC
Araignée LeBonCoin avec Scrapy et ElasticSearch
Stars: ✭ 14 (-98.94%)
Mutual labels:  scraper, scrapy
scrapy facebooker
Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-98.34%)
Mutual labels:  scraper, scrapy
Skipper
An HTTP router and reverse proxy for service composition, including use cases like Kubernetes Ingress
Stars: ✭ 2,606 (+97.13%)
Mutual labels:  cloud, proxy
dijnet-bot
Az összes számlád még egy helyen :)
Stars: ✭ 17 (-98.71%)
Mutual labels:  crawler, scraper
ptt-web-crawler
PTT 網路版爬蟲
Stars: ✭ 20 (-98.49%)
Mutual labels:  crawler, scrapy
MyCrawler
我的爬虫合集
Stars: ✭ 55 (-95.84%)
Mutual labels:  crawler, scraper
Fp Server
Free proxy server, continuously crawling and providing proxies, based on Tornado and Scrapy. 免费代理服务器,基于Tornado和Scrapy,在本地搭建属于自己的代理池
Stars: ✭ 154 (-88.35%)
Mutual labels:  scrapy, proxy
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+208.4%)
Mutual labels:  crawler, scraper
Linkedin
Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (-76.63%)
Mutual labels:  scraper, scrapy
Vault
swiss army knife for hackers
Stars: ✭ 346 (-73.83%)
Mutual labels:  crawler, scrapy
Hquery.php
An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (-77.69%)
Mutual labels:  crawler, scraper
Advanced Web Scraping Tutorial
The Zipru scraper developed in the Advanced Web Scraping Tutorial.
Stars: ✭ 384 (-70.95%)
Mutual labels:  scraper, scrapy
Spring Cloud Microservice Examples
spring-cloud-microservice-examples
Stars: ✭ 372 (-71.86%)
Mutual labels:  cloud, angularjs
Bookcorpus
Crawl BookCorpus
Stars: ✭ 443 (-66.49%)
Mutual labels:  crawler, scraper
Django Dynamic Scraper
Creating Scrapy scrapers via the Django admin interface
Stars: ✭ 1,024 (-22.54%)
Mutual labels:  scraper, scrapy
Warta Scrap
Indonesia Index News Crawler, including 10 online media
Stars: ✭ 57 (-95.69%)
Mutual labels:  scraper, scrapy
Scrapy Rotating Proxies
use multiple proxies with Scrapy
Stars: ✭ 488 (-63.09%)
Mutual labels:  scrapy, proxy
Ferret
Declarative web scraping
Stars: ✭ 4,837 (+265.89%)
Mutual labels:  crawler, scraper
Haipproxy
💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+277.69%)
Mutual labels:  crawler, scrapy
Scrapple
A framework for creating semi-automatic web content extractors
Stars: ✭ 464 (-64.9%)
Mutual labels:  crawler, scrapy
Goscraper
Golang pkg to quickly return a preview of a webpage (title/description/images)
Stars: ✭ 72 (-94.55%)
Mutual labels:  crawler, scraper
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+287.97%)
Mutual labels:  crawler, scraper
Rcrawler
An R web crawler and scraper
Stars: ✭ 274 (-79.27%)
Mutual labels:  crawler, scraper
Crawler
A high performance web crawler in Elixir.
Stars: ✭ 781 (-40.92%)
Mutual labels:  crawler, scraper
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (-50.38%)
Mutual labels:  crawler, scraper
Py3 scripts
Life is short, *****.
Stars: ✭ 5 (-99.62%)
Mutual labels:  crawler, scrapy
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Stars: ✭ 63 (-95.23%)
Mutual labels:  crawler, scrapy
Voyages Sncf Api
A scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.
Stars: ✭ 7 (-99.47%)
Mutual labels:  scraper, scrapy
Scrapit
Scraping scripts for various websites.
Stars: ✭ 25 (-98.11%)
Mutual labels:  crawler, scraper
Crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+534.8%)
Mutual labels:  crawler, scrapy
Icrawler
A multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (-52.42%)
Mutual labels:  crawler, scrapy
Weibo terminator workflow
Update Version of weibo_terminator, This is Workflow Version aim at Get Job Done!
Stars: ✭ 259 (-80.41%)
Mutual labels:  crawler, scraper
Easy Scraping Tutorial
Simple but useful Python web scraping tutorial code.
Stars: ✭ 583 (-55.9%)
Mutual labels:  crawler, scrapy
Pacbot
PacBot (Policy as Code Bot)
Stars: ✭ 1,017 (-23.07%)
Mutual labels:  cloud, angularjs
Hproxy
hproxy - Asynchronous IP proxy pool, aims to make getting proxy as convenient as possible.(异步爬虫代理池)
Stars: ✭ 62 (-95.31%)
Mutual labels:  crawler, proxy
Email Extractor
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (-93.87%)
Mutual labels:  scraper, scrapy
Gomplate
A flexible commandline tool for template rendering. Supports lots of local and remote datasources.
Stars: ✭ 1,270 (-3.93%)
Mutual labels:  cloud
Enseada
A Cloud native multi-package registry
Stars: ✭ 80 (-93.95%)
Mutual labels:  cloud
61-120 of 2287 similar projects