DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

✭ 100

csharp crawler dotnetcore scrapy scraping entity-framework-core webscraping ddd-architecture crawling

Imghash

Perceptual image hashing for Node.js

✭ 98

javascript computer-vision image-processing webscraping

Udemy bot

An automation bot for free Udemy courses

✭ 91

python bot chrome webscraping udemy

Clock

可视化任务调度系统，精简到一个二进制文件 (Web visual task scheduler system , yes ! just one binary solve all the problems !)

✭ 86

go web scheduler task visual webscraping

Covid 19 jhu data web scrap and cleaning

This repository contains data and code used to get and clean data from https://github.com/CSSEGISandData/COVID-19 and https://www.worldometers.info/coronavirus/

✭ 80

python jupyter-notebook pandas webscraping

Instago

Download/access photos, videos, stories, story highlights, postlives, following and followers of Instagram

✭ 59

go golang instagram downloader web-scraping webscraping gopherjs

Keeper Core Api

Nunux Keeper core API

✭ 55

javascript nodejs restful-api content-management webscraping

Fifa Fut Data

Web-scraping script that writes the data of all players from FutHead and FutBin to a CSV file or a DB

✭ 55

python database mysql dataset csv webscraping video-game soccer

Sneakerbot App

App that scrapes the Footlocker website to construct URLs for upcoming sneaker releases and adds the shoe to your cart if it is available. Uses Python and Selenium Webdriver. *Chrome and Chromedriver must be installed and Chromedriver must be on main path

✭ 54

python python3 bot chrome bots webscraping

Brokenlinkhijacker

A Fast Broken Link Hijacker Tool written in Python

✭ 45

python scanner reconnaissance bug-bounty webscraping

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

✭ 1,024

python django spider scraper scrapy scraping webscraping

Configs

Public, free to use, repository with diggers configs for scraping / extracting data from various e-commerce websites and online stores

✭ 37

ecommerce scraping etl webscraping

Redditsfinder

Archive a reddit user's post history. Formatted overview of a profile, JSON containing every post, and picture downloads. Uses the pushshift API.

✭ 28

python reddit webscraping

Huginn

Create agents that monitor and act on your behalf. Your agents are standing by!

✭ 33,694

ruby HTML coffeescript shell SCSS Dockerfile automation monitoring twitter notifications scraper rss agent feed webscraping huginn feedgenerator twitter-streaming

Sig To Googlecalendar

A python script to get class schedules on UFLA's SIG and convert to a .CSV file to use in Google Calendar

✭ 14

python webscraping beautifulsoup

Webscrapping

R语言爬虫；Python爬虫；rvest；Rcurl

✭ 9

r webscraping

Datadoubleconfirm

Simple datasets and notebooks for data visualization, statistical analysis and modelling - with write-ups here: http://projectosyo.wix.com/datadoubleconfirm.

✭ 24

python r jupyter-notebook data-visualization webscraping statistical-analysis

Mailinglistscraper

A python web scraper for public email lists.

✭ 19

python spider scraper scrapy webscraping

Gazpacho

🥫 The simple, fast, and modern web scraping library

✭ 525

python hacktoberfest scraping webscraping

Suckit

Suck the InTernet

✭ 429

rust hacktoberfest webscraping

Morph

Take the hassle out of web scraping

✭ 421

ruby docker webscraping

Proxy requests

a class that uses scraped proxies to make http GET/POST requests (Python requests)

✭ 357

python python3 http proxy requests http-proxy proxy-server webscraping recursion proxy-list python-requests

Xidel

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

✭ 335

pascal html cli json web http command-line rest xml scraper curl xpath webscraping data-processing css-selector wget

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

✭ 4,077

python machine-learning automation artificial-intelligence ai crawler scraper scraping web-scraping webscraping scrape webautomation

Rcrawler

An R web crawler and scraper

✭ 274

r crawler scraper webscraping

schedule-tweet

Schedules tweets using TweetDeck

✭ 14

python shell automation twitter scraping selenium webscraping selenium-python

web check

Script for checking changes in webpages

✭ 50

python webscraping graphical-user-interface begginer

Instagram-Scraper-2021

Scrape Instagram content and stories anonymously, using a new technique based on the har file (No Token + No public API).

✭ 57

Jupyter Notebook python instagram data scraper selenium instagram-feed instagram-scraper instagram-api webscraping instagram-stories browsermob-proxy graphql-api instagram-bot instagram-crawler

ARGUS

ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9

✭ 68

python Jupyter Notebook Batchfile scraping crawling scrapy webscraping scrapyd webcrawling

amelia 2.0

An Artificial Intelligence Chat Bot and Service Provider written in Python and AIML.

✭ 19

python music weather opencv json camera dictionary chatbot wolfram-alpha imdb aiml artificial-intelligence wikipedia-api webscraping

anikimiapi

A Simple, LightWeight, Statically-Typed Python3 API wrapper for GogoAnime.

✭ 15

python url anime download dub bs4 webscraping sub otaku gogoanime

Utlyz-CLI

Let's you to access your FB account from the command line and returns various things number of unread notifications, messages or friend requests you have.