All Projects → scrapy-zyte-smartproxy → Similar Projects or Alternatives

435 Open source projects that are alternatives of or similar to scrapy-zyte-smartproxy

torchestrator

Spin up Tor containers and then proxy HTTP requests via these Tor instances

Stars: ✭ 32 (-89.91%)

Mutual labels: scraping, scrapy

Scrapple

A framework for creating semi-automatic web content extractors

Stars: ✭ 464 (+46.37%)

Mutual labels: scraping, scrapy

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (+223.03%)

Mutual labels: scraping, scrapy

ARGUS

ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9

Stars: ✭ 68 (-78.55%)

Mutual labels: scraping, scrapy

Easy Scraping Tutorial

Simple but useful Python web scraping tutorial code.

Stars: ✭ 583 (+83.91%)

Mutual labels: scraping, scrapy

Seleniumcrawler

An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site

Stars: ✭ 117 (-63.09%)

Mutual labels: scraping, scrapy

policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

Stars: ✭ 22 (-93.06%)

Mutual labels: scraping, scrapy

Dotnetcrawler

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (-68.45%)

Mutual labels: scraping, scrapy

memes-api

API for scrapping common meme sites

Stars: ✭ 17 (-94.64%)

Mutual labels: scraping, scrapy

Post Tuto Deployment

Build and deploy a machine learning app from scratch 🚀

Stars: ✭ 368 (+16.09%)

Mutual labels: scraping, scrapy

Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy

Stars: ✭ 309 (-2.52%)

Mutual labels: scraping, scrapy

scrapy-fieldstats

A Scrapy extension to log items coverage when the spider shuts down

Stars: ✭ 17 (-94.64%)

Mutual labels: scraping, scrapy

RARBG-scraper

With Selenium headless browsing and CAPTCHA solving

Stars: ✭ 38 (-88.01%)

Mutual labels: scraping, scrapy

double-agent

A test suite of common scraper detection techniques. See how detectable your scraper stack is.

Stars: ✭ 123 (-61.2%)

Mutual labels: scraping, scrapy

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

Stars: ✭ 81 (-74.45%)

Mutual labels: scraping, scrapy

scrapy-distributed

A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.

Stars: ✭ 38 (-88.01%)

Mutual labels: scraping, scrapy

proxi

Proxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.

Stars: ✭ 32 (-89.91%)

Mutual labels: scraping, scrapy

Scrapy Crawlera

Crawlera middleware for Scrapy

Stars: ✭ 281 (-11.36%)

Mutual labels: scraping, scrapy

Scrapy Cluster

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.

Stars: ✭ 921 (+190.54%)

Mutual labels: scraping, scrapy

InstaBot

Simple and friendly Bot for Instagram, using Selenium and Scrapy with Python.

Stars: ✭ 32 (-89.91%)

Mutual labels: scraping, scrapy

scrapy facebooker

Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.

Stars: ✭ 22 (-93.06%)

Mutual labels: scraping, scrapy

Data-Engineering-Projects

Personal Data Engineering Projects

Stars: ✭ 167 (-47.32%)

Mutual labels: scrapy

web-clipper

Easily download the main content of a web page in html, markdown, and/or epub format from command line.

Stars: ✭ 15 (-95.27%)

Mutual labels: scraping

internet-affordability

🌍 Dataset that shows the Internet affordability by country (a shocking reality!)

Stars: ✭ 13 (-95.9%)

Mutual labels: scraping

logparser

A tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.

Stars: ✭ 70 (-77.92%)

Mutual labels: scrapy

shup

A POSIX shell script to parse HTML

Stars: ✭ 28 (-91.17%)

Mutual labels: scraping

AngleParse

HTML parsing and processing tool for PowerShell.

Stars: ✭ 35 (-88.96%)

Mutual labels: scraping

bgmtools

Bangumi小工具

Stars: ✭ 66 (-79.18%)

Mutual labels: scrapy

python-spider

python爬虫小项目【持续更新】【笔趣阁小说下载、Tweet数据抓取、天气查询、网易云音乐逆向、天天基金网查询、微博数据抓取（生成cookie）、有道翻译逆向、企查查免登陆爬虫、大众点评svg加密破解、B站用户爬虫、拉钩免登录爬虫、自如租房字体加密、知乎问答

Stars: ✭ 45 (-85.8%)

Mutual labels: scrapy

naos

📉 Uptime and error monitoring CLI

Stars: ✭ 30 (-90.54%)

Mutual labels: scraping

dmi-instascraper

A GUI for Instaloader to scrape users and hashtags with on Instagram

Stars: ✭ 21 (-93.38%)

Mutual labels: scraping

scraping-ebay

Scraping Ebay's products using Scrapy Web Crawling Framework

Stars: ✭ 79 (-75.08%)

Mutual labels: scrapy

kuwala

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…

Stars: ✭ 474 (+49.53%)

Mutual labels: scraping

pomp

Screen scraping and web crawling framework

Stars: ✭ 61 (-80.76%)

Mutual labels: scraping

restaurant-finder-featureReviews

Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).

Stars: ✭ 21 (-93.38%)

Mutual labels: scrapy

IMDB-Scraper

Scrapy project for scraping data from IMDB with Movie Dataset including 58,623 movies' data.

Stars: ✭ 37 (-88.33%)

Mutual labels: scrapy

gunaydin

Your good mornings ☀️

Stars: ✭ 16 (-94.95%)

Mutual labels: scraping

OpenScraper

An open source webapp for scraping: towards a public service for webscraping

Stars: ✭ 80 (-74.76%)

Mutual labels: scrapy

hk0weather

Web scraper project to collect the useful Hong Kong weather data from HKO website

Stars: ✭ 49 (-84.54%)

Mutual labels: scrapy

top-github-scraper

Scape top GitHub repositories and users based on keywords

Stars: ✭ 40 (-87.38%)

Mutual labels: scraping

document-dl

Command line program to download documents from web portals.

Stars: ✭ 14 (-95.58%)

Mutual labels: scraping

chesf

CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages

Stars: ✭ 18 (-94.32%)

Mutual labels: scraping

subscene scraper

Library to download subtitles from subscene.com

Stars: ✭ 14 (-95.58%)

Mutual labels: scraping

Intelligent Document Finder

Document Search Engine Tool

Stars: ✭ 45 (-85.8%)

Mutual labels: scrapy

photo-spider-scrapy

10 photo website spiders, 10 个国外图库的 scrapy 爬虫代码

Stars: ✭ 17 (-94.64%)

Mutual labels: scrapy

XMQ-BackUp

小密圈备份，圈子/话题/图片/文件。

Stars: ✭ 22 (-93.06%)

Mutual labels: scrapy

GPlayCrawler

No description or website provided.

Stars: ✭ 47 (-85.17%)

Mutual labels: scrapy

V2EX Spider

V2EX爬虫

Stars: ✭ 21 (-93.38%)

Mutual labels: scrapy

Scrapy-Spiders

一个基于Scrapy的数据采集爬虫代码库

Stars: ✭ 34 (-89.27%)

Mutual labels: scrapy

factory

Docker microservice & Crawler by scrapy

Stars: ✭ 56 (-82.33%)

Mutual labels: scrapy

scrapy.dart

Scrapy, a fast high-level web crawling & scraping framework for dart and Flutter

Stars: ✭ 50 (-84.23%)

Mutual labels: scrapy

BOC FER Spider

Use Scrapy crawl foreign exchange rate from BOC (Bank of China)

Stars: ✭ 18 (-94.32%)

Mutual labels: scrapy

go-scrapy

Web crawling and scraping framework for Golang

Stars: ✭ 17 (-94.64%)

Mutual labels: scraping

OLX Scraper

📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.

Stars: ✭ 15 (-95.27%)

Mutual labels: scrapy

ImageGrabber

A Scrapy demo : Download all images from a site

Stars: ✭ 33 (-89.59%)

Mutual labels: scrapy

NovelCrawler

基于Scrapy的爬虫demo

Stars: ✭ 15 (-95.27%)

Mutual labels: scrapy

Scrapy IPProxyPool

免费 IP 代理池。Scrapy 爬虫框架插件

Stars: ✭ 100 (-68.45%)

Mutual labels: scrapy

rubium

Rubium is a lightweight alternative to Selenium/Capybara/Watir if you need to perform some operations (like web scraping) using Headless Chromium and Ruby

Stars: ✭ 65 (-79.5%)

Mutual labels: scraping

JustDownlink

基于Scrapy+Elasticsearch+Django搭建的分布式电影搜索

Stars: ✭ 28 (-91.17%)

Mutual labels: scrapy

elves

🎊 Design and implement of lightweight crawler framework.

Stars: ✭ 322 (+1.58%)

Mutual labels: scrapy

1-60 of 435 similar projects

›

next*5