All Projects → Scrapoxy → Similar Projects or Alternatives

2287 Open source projects that are alternatives of or similar to Scrapoxy

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (-92.44%)

Mutual labels: crawler, scrapy

Google Play Scraper

Node.js scraper to get data from Google Play

Stars: ✭ 1,606 (+21.48%)

Mutual labels: crawler, scraper

Instagram Proxy Api

CORS compliant API to access Instagram's public data

Stars: ✭ 245 (-81.47%)

Mutual labels: scraper, proxy

Free proxy website

获取免费socks/https/http代理的网站集合

Stars: ✭ 119 (-91%)

Mutual labels: crawler, proxy

Docs

《数据采集从入门到放弃》源码。内容简介：爬虫介绍、就业情况、爬虫工程师面试题；HTTP协议介绍； Requests使用；解析器Xpath介绍； MongoDB与MySQL；多线程爬虫； Scrapy介绍；Scrapy-redis介绍；使用docker部署；使用nomad管理docker集群；使用EFK查询docker日志

Stars: ✭ 118 (-91.07%)

Mutual labels: crawler, scrapy

Newspaper

News, full-text, and article metadata extraction in Python 3. Advanced docs:

Stars: ✭ 11,545 (+773.3%)

Mutual labels: crawler, scraper

Proxyscrape

Python library for retrieving free proxies (HTTP, HTTPS, SOCKS4, SOCKS5).

Stars: ✭ 134 (-89.86%)

Mutual labels: scraper, proxy

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+61.04%)

Mutual labels: crawler, scrapy

Ngmeta

Dynamic meta tags in your AngularJS single page application

Stars: ✭ 152 (-88.5%)

Mutual labels: crawler, angularjs

Scrapingoutsourcing

ScrapingOutsourcing专注分享爬虫代码尽量每周更新一个

Stars: ✭ 164 (-87.59%)

Mutual labels: crawler, scrapy

Youtube Projects

This repository contains all the code I use in my YouTube tutorials.

Stars: ✭ 144 (-89.11%)

Mutual labels: crawler, scraper

Spoon

🥄 A package for building specific Proxy Pool for different Sites.

Stars: ✭ 173 (-86.91%)

Mutual labels: crawler, proxy

Seleniumcrawler

An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site

Stars: ✭ 117 (-91.15%)

Mutual labels: scraper, scrapy

Media Scraper

Scrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok

Stars: ✭ 206 (-84.42%)

Mutual labels: crawler, scraper

Tianyancha

pip安装的天眼查爬虫API，指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.

Stars: ✭ 206 (-84.42%)

Mutual labels: crawler, scraper

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+1075.11%)

Mutual labels: crawler, scraper

Social Scraper

Tổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt

Stars: ✭ 47 (-96.44%)

Mutual labels: crawler, scraper

Skrape.it

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

Stars: ✭ 231 (-82.53%)

Mutual labels: crawler, scraper

Ecommercecrawlers

码云仓库链接:AJay13/ECommerceCrawlers Github 仓库链接:DropsDevopsOrg/ECommerceCrawlers 项目展示平台链接:http://wechat.doonsec.com

Stars: ✭ 3,073 (+132.45%)

Mutual labels: crawler, scrapy

Gobetween

☁️ Modern & minimalistic load balancer for the Сloud era

Stars: ✭ 1,631 (+23.37%)

Mutual labels: cloud, proxy

Querylist

🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。

Stars: ✭ 2,392 (+80.94%)

Mutual labels: crawler, scraper

OLX Scraper

📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.

Stars: ✭ 15 (-98.87%)

Mutual labels: scraper, scrapy

scrapy-LBC

Araignée LeBonCoin avec Scrapy et ElasticSearch

Stars: ✭ 14 (-98.94%)

Mutual labels: scraper, scrapy

scrapy facebooker

Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.

Stars: ✭ 22 (-98.34%)

Mutual labels: scraper, scrapy

Skipper

An HTTP router and reverse proxy for service composition, including use cases like Kubernetes Ingress

Stars: ✭ 2,606 (+97.13%)

Mutual labels: cloud, proxy

dijnet-bot

Az összes számlád még egy helyen :)

Stars: ✭ 17 (-98.71%)

Mutual labels: crawler, scraper

ptt-web-crawler

PTT 網路版爬蟲

Stars: ✭ 20 (-98.49%)

Mutual labels: crawler, scrapy

MyCrawler

我的爬虫合集

Stars: ✭ 55 (-95.84%)

Mutual labels: crawler, scraper

Fp Server

Free proxy server, continuously crawling and providing proxies, based on Tornado and Scrapy. 免费代理服务器，基于Tornado和Scrapy，在本地搭建属于自己的代理池

Stars: ✭ 154 (-88.35%)

Mutual labels: scrapy, proxy

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Stars: ✭ 4,077 (+208.4%)

Mutual labels: crawler, scraper

Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy

Stars: ✭ 309 (-76.63%)

Mutual labels: scraper, scrapy

Vault

swiss army knife for hackers

Stars: ✭ 346 (-73.83%)

Mutual labels: crawler, scrapy

Hquery.php

An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.

Stars: ✭ 295 (-77.69%)

Mutual labels: crawler, scraper

Advanced Web Scraping Tutorial

The Zipru scraper developed in the Advanced Web Scraping Tutorial.

Stars: ✭ 384 (-70.95%)

Mutual labels: scraper, scrapy

Spring Cloud Microservice Examples

spring-cloud-microservice-examples

Stars: ✭ 372 (-71.86%)

Mutual labels: cloud, angularjs

Bookcorpus

Crawl BookCorpus

Stars: ✭ 443 (-66.49%)

Mutual labels: crawler, scraper

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (-22.54%)

Mutual labels: scraper, scrapy

Warta Scrap

Indonesia Index News Crawler, including 10 online media

Stars: ✭ 57 (-95.69%)

Mutual labels: scraper, scrapy

Scrapy Rotating Proxies

use multiple proxies with Scrapy

Stars: ✭ 488 (-63.09%)

Mutual labels: scrapy, proxy

Ferret

Declarative web scraping

Stars: ✭ 4,837 (+265.89%)

Mutual labels: crawler, scraper

Haipproxy

💖 High available distributed ip proxy pool, powerd by Scrapy and Redis

Stars: ✭ 4,993 (+277.69%)

Mutual labels: crawler, scrapy

Scrapple

A framework for creating semi-automatic web content extractors

Stars: ✭ 464 (-64.9%)

Mutual labels: crawler, scrapy

Goscraper

Golang pkg to quickly return a preview of a webpage (title/description/images)

Stars: ✭ 72 (-94.55%)

Mutual labels: crawler, scraper

Headless Chrome Crawler

Distributed crawler powered by Headless Chrome

Stars: ✭ 5,129 (+287.97%)

Mutual labels: crawler, scraper

Rcrawler

An R web crawler and scraper

Stars: ✭ 274 (-79.27%)

Mutual labels: crawler, scraper

Crawler

A high performance web crawler in Elixir.

Stars: ✭ 781 (-40.92%)

Mutual labels: crawler, scraper

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (-50.38%)

Mutual labels: crawler, scraper

Py3 scripts

Life is short, *****.

Stars: ✭ 5 (-99.62%)

Mutual labels: crawler, scrapy

Terpene Profile Parser For Cannabis Strains

Parser and database to index the terpene profile of different strains of Cannabis from online databases

Stars: ✭ 63 (-95.23%)

Mutual labels: crawler, scrapy

Voyages Sncf Api

A scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.

Stars: ✭ 7 (-99.47%)

Mutual labels: scraper, scrapy

Scrapit

Scraping scripts for various websites.

Stars: ✭ 25 (-98.11%)

Mutual labels: crawler, scraper

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+534.8%)

Mutual labels: crawler, scrapy

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (-52.42%)

Mutual labels: crawler, scrapy

Weibo terminator workflow

Update Version of weibo_terminator, This is Workflow Version aim at Get Job Done!

Stars: ✭ 259 (-80.41%)

Mutual labels: crawler, scraper

Easy Scraping Tutorial

Simple but useful Python web scraping tutorial code.

Stars: ✭ 583 (-55.9%)

Mutual labels: crawler, scrapy

Pacbot

PacBot (Policy as Code Bot)

Stars: ✭ 1,017 (-23.07%)

Mutual labels: cloud, angularjs

Hproxy

hproxy - Asynchronous IP proxy pool, aims to make getting proxy as convenient as possible.(异步爬虫代理池)

Stars: ✭ 62 (-95.31%)

Mutual labels: crawler, proxy

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

Stars: ✭ 81 (-93.87%)

Mutual labels: scraper, scrapy

Gomplate

A flexible commandline tool for template rendering. Supports lots of local and remote datasources.

Stars: ✭ 1,270 (-3.93%)

Mutual labels: cloud

Enseada

A Cloud native multi-package registry

Stars: ✭ 80 (-93.95%)

Mutual labels: cloud

61-120 of 2287 similar projects

‹

›

next*5