DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (+203.03%)

Mutual labels: crawling

crawlzone

Crawlzone is a fast asynchronous internet crawling framework for PHP.

Stars: ✭ 70 (+112.12%)

Mutual labels: crawling-framework

Python Crawling Tutorial

Python crawling tutorial

Stars: ✭ 57 (+72.73%)

Mutual labels: crawling

diffbot-php-client

[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library

Stars: ✭ 53 (+60.61%)

Mutual labels: crawling

MoalemYar

A personal project for class management, using various technologies like WPF, Entityframwork, CodeFirst, Sqlite, Migration and more

Stars: ✭ 53 (+60.61%)

Mutual labels: wpf-application

Scrapyrt

HTTP API for Scrapy spiders

Stars: ✭ 637 (+1830.3%)

Mutual labels: crawling

Headless Chrome Crawler

Distributed crawler powered by Headless Chrome

Stars: ✭ 5,129 (+15442.42%)

Mutual labels: crawling

Memorious

Distributed crawling framework for documents and structured data.

Stars: ✭ 248 (+651.52%)

Mutual labels: crawling

xXx dead xXx

b̶̡̪̬͒l̸̰̗̝̀ỏ̷̡̩g̴͇̑g̶̲̱̽͐i̵̹͗n̶̤̥͂̅̆g̴̮̾̅͜ ̷̧͎͆i̷̛͒͜͠n̸̥̺͒ ̶͚͚͊̿͜t̸̺͙̭̆̊̈́ḧ̶̟́̐e̸̱͔̟̓̓͝ ̶̨͔̾͛̑d̵̥̣̏ȧ̷̼̊r̷̰̝̥̅̌͝k̵̟̥̞̉̍͛

Stars: ✭ 19 (-42.42%)

Mutual labels: crawling

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+46975.76%)

Mutual labels: crawling

telegram-crawler

🕷 Automatically detect changes made to the official Telegram sites, clients and servers.

Stars: ✭ 84 (+154.55%)

Mutual labels: crawling

Nutch

Apache Nutch is an extensible and scalable web crawler

Stars: ✭ 2,277 (+6800%)

Mutual labels: crawling

tech-seo-crawler

Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.

Stars: ✭ 57 (+72.73%)

Mutual labels: crawling

pumba

Fetch, store and access user agent strings for different browsers

Stars: ✭ 12 (-63.64%)

Mutual labels: crawling

Scrapy Selenium

Scrapy middleware to handle javascript pages using selenium

Stars: ✭ 550 (+1566.67%)

Mutual labels: crawling

Crawler

Go process used to crawl websites

Stars: ✭ 147 (+345.45%)

Mutual labels: crawling

TimeRecorder

工数管理アプリ

Stars: ✭ 51 (+54.55%)

Mutual labels: wpf-application

Instagram Bot

An Instagram bot developed using the Selenium Framework

Stars: ✭ 138 (+318.18%)

Mutual labels: crawling

Simple.Wpf.DataGrid

An experiment to build a data grid (blotter) in WPF without using any third party libaries

Stars: ✭ 64 (+93.94%)

Mutual labels: wpf-application

Bhban rpa

6개월 치 업무를 하루 만에 끝내는 업무 자동화(생능출판사, 2020)의 예제 코드입니다. 파이썬을 한 번도 배워본 적 없는 분들을 위한 예제이며, 엑셀부터 디자인, 매크로, 크롤링까지 업무 자동화와 관련된 다양한 분야 예제가 제공됩니다.

Stars: ✭ 124 (+275.76%)

Mutual labels: crawling

core

The complete web scraping toolkit for PHP.

Stars: ✭ 1,110 (+3263.64%)

Mutual labels: crawling

Squidwarc

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

Stars: ✭ 125 (+278.79%)

Mutual labels: crawling

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (+57.58%)

Mutual labels: crawling

Scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Stars: ✭ 42,343 (+128212.12%)

Mutual labels: crawling

podcastcrawler

PHP library to find podcasts

Stars: ✭ 40 (+21.21%)

Mutual labels: crawling

Grawler

Grawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them in a file.

Stars: ✭ 98 (+196.97%)

Mutual labels: crawling

scrapy-fieldstats

A Scrapy extension to log items coverage when the spider shuts down

Stars: ✭ 17 (-48.48%)

Mutual labels: crawling

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (+106.06%)

Mutual labels: crawling

puppet-master

Puppeteer as a service hosted on Saasify.