DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (-31.97%)

Mutual labels: crawler, crawling

Scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Stars: ✭ 42,343 (+28704.76%)

Mutual labels: crawler, crawling

Qqmusicspider

基于Scrapy的QQ音乐爬虫(QQ Music Spider)，爬取歌曲信息、歌词、精彩评论等，并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料

Stars: ✭ 120 (-18.37%)

Mutual labels: crawler

4chan Downloader

Python3 script to continuously download all images/webms of multiple 4chan thread simultaneously - without installation

Stars: ✭ 136 (-7.48%)

Mutual labels: crawler

Awesome Puppeteer

A curated list of awesome puppeteer resources.

Stars: ✭ 1,728 (+1075.51%)

Mutual labels: crawling

Skill Share Crawler Dl

Download Videos Skill Share per ID or per Class

Stars: ✭ 122 (-17.01%)

Mutual labels: crawler

Onegram

This repository is no longer maintained.

Stars: ✭ 137 (-6.8%)

Mutual labels: crawler

Pspider

简单易用的Python爬虫框架，QQ交流群：597510560

Stars: ✭ 1,611 (+995.92%)

Mutual labels: crawler

Massivedl

Download a large list of files concurrently

Stars: ✭ 141 (-4.08%)

Mutual labels: crawling

Tiebamanager

（已跑路）百度贴吧吧务管理工具，自动扫描帖子并处理违规帖

Stars: ✭ 119 (-19.05%)

Mutual labels: crawler

Goclone

Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.

Stars: ✭ 134 (-8.84%)

Mutual labels: crawler

Php Crawler

A php crawler that finds emails on the internets

Stars: ✭ 119 (-19.05%)

Mutual labels: crawler

Youtube Projects

This repository contains all the code I use in my YouTube tutorials.

Stars: ✭ 144 (-2.04%)

Mutual labels: crawler

Oddish

To crawl all csgo skins from website.

Stars: ✭ 139 (-5.44%)

Mutual labels: crawler

Free proxy website

获取免费socks/https/http代理的网站集合

Stars: ✭ 119 (-19.05%)

Mutual labels: crawler

Sentinel Crawler

Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器，分布式爬虫

Stars: ✭ 118 (-19.73%)

Mutual labels: crawler

Docs

《数据采集从入门到放弃》源码。内容简介：爬虫介绍、就业情况、爬虫工程师面试题；HTTP协议介绍； Requests使用；解析器Xpath介绍； MongoDB与MySQL；多线程爬虫； Scrapy介绍；Scrapy-redis介绍；使用docker部署；使用nomad管理docker集群；使用EFK查询docker日志

Stars: ✭ 118 (-19.73%)

Mutual labels: crawler

Red hawk

All in one tool for Information Gathering, Vulnerability Scanning and Crawling. A must have tool for all penetration testers

Stars: ✭ 1,898 (+1191.16%)

Mutual labels: crawler

Moodle Downloader 2

A Moodle downloader that downloads course content fast from Moodle (eg. lecture pdfs)

Stars: ✭ 118 (-19.73%)

Mutual labels: crawler

Decryptlogin

APIs for loginning some websites by using requests.