All Projects → Dotnetcrawler → Similar Projects or Alternatives

1381 Open source projects that are alternatives of or similar to Dotnetcrawler

Webmagic
A scalable web crawler framework for Java.
Stars: ✭ 10,186 (+10086%)
Mutual labels:  crawler, scraping
Docs
《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (+18%)
Mutual labels:  crawler, scrapy
Crawler
爬虫, http代理, 模拟登陆!
Stars: ✭ 106 (+6%)
Mutual labels:  crawler, scrapy
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (+25%)
Mutual labels:  crawler, crawling
Crawlab Lite
Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台
Stars: ✭ 122 (+22%)
Mutual labels:  crawler, scrapy
Youtube Projects
This repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (+44%)
Mutual labels:  crawler, webscraping
D4n155
OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (+5%)
Mutual labels:  crawler, scraping
N2h4
네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (+77%)
Mutual labels:  crawler, crawling
Scrapingoutsourcing
ScrapingOutsourcing专注分享爬虫代码 尽量每周更新一个
Stars: ✭ 164 (+64%)
Mutual labels:  crawler, scrapy
Goose Parser
Universal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+111%)
Mutual labels:  crawler, scraping
Spidermon
Scrapy Extension for monitoring spiders execution.
Stars: ✭ 309 (+209%)
Mutual labels:  scraping, crawling
Scrapy Examples
Some scrapy and web.py exmaples
Stars: ✭ 71 (-29%)
Mutual labels:  crawler, scrapy
Seleniumcrawler
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Stars: ✭ 117 (+17%)
Mutual labels:  scrapy, scraping
socials
👨‍👩‍👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (-63%)
Mutual labels:  scraping, crawling
Simplcommerce
A simple, cross platform, modularized ecommerce system built on .NET Core
Stars: ✭ 3,474 (+3374%)
Vault
swiss army knife for hackers
Stars: ✭ 346 (+246%)
Mutual labels:  crawler, scrapy
scrape-github-trending
Tutorial for web scraping / crawling with Node.js.
Stars: ✭ 42 (-58%)
Mutual labels:  scraping, crawling
zcrawl
An open source web crawling platform
Stars: ✭ 21 (-79%)
Mutual labels:  scraping, crawling
crawling-framework
Easily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-78%)
Mutual labels:  scraping, crawling
InstaBot
Simple and friendly Bot for Instagram, using Selenium and Scrapy with Python.
Stars: ✭ 32 (-68%)
Mutual labels:  scraping, scrapy
Polite
Be nice on the web
Stars: ✭ 253 (+153%)
Mutual labels:  crawler, webscraping
chesf
CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages
Stars: ✭ 18 (-82%)
Mutual labels:  scraping, webscraping
go-scrapy
Web crawling and scraping framework for Golang
Stars: ✭ 17 (-83%)
Mutual labels:  scraping, crawling
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-48%)
Mutual labels:  scraping, crawling
Module Shop
一个基于 .NET Core构建的简单、跨平台、模块化的商城系统
Stars: ✭ 398 (+298%)
hk0weather
Web scraper project to collect the useful Hong Kong weather data from HKO website
Stars: ✭ 49 (-51%)
Mutual labels:  scrapy, webscraping
feedsearch-crawler
Crawl sites for RSS, Atom, and JSON feeds.
Stars: ✭ 23 (-77%)
Mutual labels:  scraping, crawling
pomp
Screen scraping and web crawling framework
Stars: ✭ 61 (-39%)
Mutual labels:  scraping, crawling
Wswp
Code for the second edition Web Scraping with Python book by Packt Publications
Stars: ✭ 112 (+12%)
Mutual labels:  scrapy, webscraping
Taiwan News Crawlers
Scrapy-based Crawlers for news of Taiwan
Stars: ✭ 83 (-17%)
Mutual labels:  crawler, scrapy
flink-crawler
Continuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-52%)
Mutual labels:  crawler, crawling
memes-api
API for scrapping common meme sites
Stars: ✭ 17 (-83%)
Mutual labels:  scraping, scrapy
img-cli
An interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-85%)
Mutual labels:  crawler, crawling
schedule-tweet
Schedules tweets using TweetDeck
Stars: ✭ 14 (-86%)
Mutual labels:  scraping, webscraping
Dotnetspider
DotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework
Stars: ✭ 3,233 (+3133%)
Mutual labels:  crawler, dotnetcore
Webster
a reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (+264%)
Mutual labels:  crawler, crawling
Dataflowkit
Extract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (+356%)
Mutual labels:  scraping, crawling
Email Extractor
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (-19%)
Mutual labels:  scrapy, scraping
Linkedin
Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (+209%)
Mutual labels:  scrapy, scraping
Fbcrawl
A Facebook crawler
Stars: ✭ 536 (+436%)
Mutual labels:  crawler, scrapy
Scrapy Selenium
Scrapy middleware to handle javascript pages using selenium
Stars: ✭ 550 (+450%)
Mutual labels:  scrapy, crawling
Wechatsogou
基于搜狗微信搜索的微信公众号爬虫接口
Stars: ✭ 5,220 (+5120%)
Mutual labels:  crawler, scrapy
Post Tuto Deployment
Build and deploy a machine learning app from scratch 🚀
Stars: ✭ 368 (+268%)
Mutual labels:  scrapy, scraping
Icrawler
A multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (+529%)
Mutual labels:  crawler, scrapy
Awesome Python Primer
自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Stars: ✭ 57 (-43%)
Mutual labels:  crawler, scraping
Py3 scripts
Life is short, *****.
Stars: ✭ 5 (-95%)
Mutual labels:  crawler, scrapy
papercut
Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-85%)
Mutual labels:  crawler, scraping
Scrapy Redis
Redis-based components for Scrapy.
Stars: ✭ 4,998 (+4898%)
Mutual labels:  crawler, scrapy
Gazpacho
🥫 The simple, fast, and modern web scraping library
Stars: ✭ 525 (+425%)
Mutual labels:  scraping, webscraping
Haipproxy
💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+4893%)
Mutual labels:  crawler, scrapy
Newcrawler
Free Web Scraping Tool with Java
Stars: ✭ 589 (+489%)
Mutual labels:  crawler, scraping
Scrapy Cluster
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Stars: ✭ 921 (+821%)
Mutual labels:  scrapy, scraping
Mailinglistscraper
A python web scraper for public email lists.
Stars: ✭ 19 (-81%)
Mutual labels:  scrapy, webscraping
Scrapy Azuresearch Crawler Samples
Scrapy as a Web Crawler for Azure Search Samples
Stars: ✭ 20 (-80%)
Mutual labels:  crawler, scrapy
Awesome Puppeteer
A curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+1628%)
Mutual labels:  scraping, crawling
Memorious
Distributed crawling framework for documents and structured data.
Stars: ✭ 248 (+148%)
Mutual labels:  scraping, crawling
scrapy-zyte-smartproxy
Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
Stars: ✭ 317 (+217%)
Mutual labels:  scraping, scrapy
Pdf downloader
A Scrapy Spider for downloading PDF files from a webpage.
Stars: ✭ 18 (-82%)
Mutual labels:  scrapy, crawling
Configs
Public, free to use, repository with diggers configs for scraping / extracting data from various e-commerce websites and online stores
Stars: ✭ 37 (-63%)
Mutual labels:  scraping, webscraping
Crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+8292%)
Mutual labels:  crawler, scrapy
61-120 of 1381 similar projects