All Projects → Dotnetcrawler → Similar Projects or Alternatives

1381 Open source projects that are alternatives of or similar to Dotnetcrawler

A scalable web crawler framework for Java.

Stars: ✭ 10,186 (+10086%)

《数据采集从入门到放弃》源码。内容简介：爬虫介绍、就业情况、爬虫工程师面试题；HTTP协议介绍； Requests使用；解析器Xpath介绍； MongoDB与MySQL；多线程爬虫； Scrapy介绍；Scrapy-redis介绍；使用docker部署；使用nomad管理docker集群；使用EFK查询docker日志

Stars: ✭ 118 (+18%)

Mutual labels: crawler, scrapy

Crawler

爬虫, http代理, 模拟登陆!

Stars: ✭ 106 (+6%)

Mutual labels: crawler, scrapy

Squidwarc

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

Stars: ✭ 125 (+25%)

Mutual labels: crawler, crawling

Crawlab Lite

Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台

Stars: ✭ 122 (+22%)

Mutual labels: crawler, scrapy

Youtube Projects

This repository contains all the code I use in my YouTube tutorials.

Stars: ✭ 144 (+44%)

Mutual labels: crawler, webscraping

D4n155

OWASP D4N155 - Intelligent and dynamic wordlist using OSINT

Stars: ✭ 105 (+5%)

Mutual labels: crawler, scraping

N2h4

네이버 뉴스 수집을 위한 도구

Stars: ✭ 177 (+77%)

Mutual labels: crawler, crawling

Scrapingoutsourcing

ScrapingOutsourcing专注分享爬虫代码尽量每周更新一个

Stars: ✭ 164 (+64%)

Mutual labels: crawler, scrapy

Goose Parser

Universal scrapping tool, which allows you to extract data using multiple environments

Stars: ✭ 211 (+111%)

Mutual labels: crawler, scraping

Spidermon

Scrapy Extension for monitoring spiders execution.

Stars: ✭ 309 (+209%)

Mutual labels: scraping, crawling

Scrapy Examples

Some scrapy and web.py exmaples

Stars: ✭ 71 (-29%)

Mutual labels: crawler, scrapy

Seleniumcrawler

An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site

Stars: ✭ 117 (+17%)

Mutual labels: scrapy, scraping

socials

👨‍👩‍👦 Social account detection and extraction in Python, e.g. for crawling/scraping.

Stars: ✭ 37 (-63%)

Mutual labels: scraping, crawling

Simplcommerce

A simple, cross platform, modularized ecommerce system built on .NET Core

Stars: ✭ 3,474 (+3374%)

Mutual labels: entity-framework-core, dotnetcore

Vault

swiss army knife for hackers

Stars: ✭ 346 (+246%)

Mutual labels: crawler, scrapy

scrape-github-trending

Tutorial for web scraping / crawling with Node.js.

Stars: ✭ 42 (-58%)

Mutual labels: scraping, crawling

zcrawl

An open source web crawling platform

Stars: ✭ 21 (-79%)

Mutual labels: scraping, crawling

crawling-framework

Easily crawl news portals or blog sites using Storm Crawler.

Stars: ✭ 22 (-78%)

Mutual labels: scraping, crawling

InstaBot

Simple and friendly Bot for Instagram, using Selenium and Scrapy with Python.

Stars: ✭ 32 (-68%)

Mutual labels: scraping, scrapy

Polite

Be nice on the web

Stars: ✭ 253 (+153%)

Mutual labels: crawler, webscraping

chesf

CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages

Stars: ✭ 18 (-82%)

Mutual labels: scraping, webscraping

go-scrapy

Web crawling and scraping framework for Golang

Stars: ✭ 17 (-83%)

Mutual labels: scraping, crawling

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (-48%)

Mutual labels: scraping, crawling

Module Shop

一个基于 .NET Core构建的简单、跨平台、模块化的商城系统

Stars: ✭ 398 (+298%)

Mutual labels: entity-framework-core, dotnetcore

hk0weather

Web scraper project to collect the useful Hong Kong weather data from HKO website

Stars: ✭ 49 (-51%)

Mutual labels: scrapy, webscraping

feedsearch-crawler

Crawl sites for RSS, Atom, and JSON feeds.

Stars: ✭ 23 (-77%)

Mutual labels: scraping, crawling

pomp

Screen scraping and web crawling framework

Stars: ✭ 61 (-39%)

Mutual labels: scraping, crawling

Wswp

Code for the second edition Web Scraping with Python book by Packt Publications

Stars: ✭ 112 (+12%)

Mutual labels: scrapy, webscraping

Taiwan News Crawlers

Scrapy-based Crawlers for news of Taiwan

Stars: ✭ 83 (-17%)

Mutual labels: crawler, scrapy

flink-crawler

Continuous scalable web crawler built on top of Flink and crawler-commons

Stars: ✭ 48 (-52%)

Mutual labels: crawler, crawling

memes-api

API for scrapping common meme sites

Stars: ✭ 17 (-83%)

Mutual labels: scraping, scrapy

img-cli

An interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL

Stars: ✭ 15 (-85%)

Mutual labels: crawler, crawling

schedule-tweet

Schedules tweets using TweetDeck

Stars: ✭ 14 (-86%)

Mutual labels: scraping, webscraping

Dotnetspider

DotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework

Stars: ✭ 3,233 (+3133%)

Mutual labels: crawler, dotnetcore

Webster

a reliable high-level web crawling & scraping framework for Node.js.

Stars: ✭ 364 (+264%)

Mutual labels: crawler, crawling

Dataflowkit

Extract structured data from web sites. Web sites scraping.

Stars: ✭ 456 (+356%)

Mutual labels: scraping, crawling

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

Stars: ✭ 81 (-19%)

Mutual labels: scrapy, scraping

Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy

Stars: ✭ 309 (+209%)

Mutual labels: scrapy, scraping

Fbcrawl

A Facebook crawler

Stars: ✭ 536 (+436%)

Mutual labels: crawler, scrapy

Scrapy Selenium

Scrapy middleware to handle javascript pages using selenium

Stars: ✭ 550 (+450%)

Mutual labels: scrapy, crawling

Wechatsogou

基于搜狗微信搜索的微信公众号爬虫接口

Stars: ✭ 5,220 (+5120%)

Mutual labels: crawler, scrapy

Post Tuto Deployment

Build and deploy a machine learning app from scratch 🚀

Stars: ✭ 368 (+268%)

Mutual labels: scrapy, scraping

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (+529%)

Mutual labels: crawler, scrapy

Awesome Python Primer

自学入门 Python 优质中文资源索引，包含书籍 / 文档 / 视频，适用于爬虫 / Web / 数据分析 / 机器学习方向

Stars: ✭ 57 (-43%)

Mutual labels: crawler, scraping

Py3 scripts

Life is short, *****.

Stars: ✭ 5 (-95%)

Mutual labels: crawler, scrapy

papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-85%)

Mutual labels: crawler, scraping

Scrapy Redis

Redis-based components for Scrapy.

Stars: ✭ 4,998 (+4898%)

Mutual labels: crawler, scrapy

Gazpacho

🥫 The simple, fast, and modern web scraping library

Stars: ✭ 525 (+425%)

Mutual labels: scraping, webscraping

Haipproxy

💖 High available distributed ip proxy pool, powerd by Scrapy and Redis