Image Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive process.

✭ 202

python deep-learning search-engine download scraping bing

Antch

Antch, a fast, powerful and extensible web crawling & scraping framework for Go

✭ 198

go golang framework crawler scraping crawling web-crawler

Jsonframe Cheerio

simple multi-level scraper json input/output for Cheerio

✭ 196

javascript json scraper scraping selector frame

Juriscraper

An API to scrape American court websites for metadata.

✭ 194

html scraping government

Anime Dl

Anime-dl is a command-line program to download anime from CrunchyRoll and Funimation.

✭ 190

python web automation scraper anime scraping

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

✭ 171

typescript nodejs json crawler spider expressjs scraper puppeteer scraping linkedin crawling

Linkedin Learning Downloader

Linkedin Learning videos downloader

✭ 171

python asyncio scraping aiohttp linkedin

Requests Html

Pythonic HTML Parsing for Humans™

✭ 12,268

python Makefile html http scraping requests beautifulsoup kennethreitz lxml css-selectors pyquery

Secret Agent

The web browser that's built for scraping.

✭ 151

typescript proxy browser puppeteer devtools scraping chromium mitm automated mitmproxy

Xquery

Extract data or evaluate value from HTML/XML documents using XPath

✭ 155

go golang html xml scraping xpath

Serpscrap

SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.

✭ 153

python search research scraper screenshot seo scraping

Shadow Useragent

Pick the most common user-agents on the Internet 👻

✭ 147

python scraping useragent

Phpscraper

PHP Scraper - an highly opinionated web-interface for PHP

✭ 148

scraper scraping web-scraping web-scraper

Fantasy Basketball

Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.

✭ 146

jupyter-notebook machine-learning data-science data-visualization optimization data-mining scraping genetic-algorithm

Sqrape

Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)

✭ 144

go reflection scraping web-scraping magic css-selector

Embed

Get info from any web service or page

✭ 1,808

PHP scraping opengraph embeds oembed twitter-cards

Educative.io Downloader

📖 This tool is to download course from educative.io for offline usage. It uses your login credentials and download the course.

✭ 139

typescript hacktoberfest nodejs pdf puppeteer scraping

Search Engine Google

🕷 Google client for SERPS

✭ 138

google search-engine scraping

Udemycoursegrabber

Your will to enroll in Udemy course is here, but the money isn't? Search no more! This python program searches for your desired course in more than [insert big number here] websites, compares the last updated date, and gives you the download link of the latest one back, but you also have the choice to see the other ones as well!

✭ 137

python selenium scraper scraping udemy

Torchbear

🔥🐻 The Speakeasy Scripting Engine Which Combines Speed, Safety, and Simplicity

✭ 128

rust lua scripting android hacktoberfest linux database web data-science markdown cryptography crypto scraping development-environment jinja2 embeddable

Scan For Webcams

scan for webcams on the internet

✭ 128

python security scraping webcam shodan

Htmlsql

htmlSQL is a experimental PHP library which allows you to access HTML values by an SQL like syntax.

✭ 120

scraping

Od Database

Distributed crawler, database and web frontend for public directories indexing

✭ 121

python bootstrap elasticsearch scraping

Awesome Puppeteer

A curated list of awesome puppeteer resources.

✭ 1,728

awesome awesome-list automation puppeteer scraping headless-chrome crawling

Souqscraper

Simple scriptes for Level UP your scraping Skills, and source code for Level UP playlist on Youtube

✭ 118

python scraping beautifulsoup

Seleniumcrawler

An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site

✭ 117

python selenium scraper scrapy scraping selenium-webdriver asp-net

Scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

✭ 42,343

python hacktoberfest framework crawler scraping crawling

Webmagic

A scalable web crawler framework for Java.

✭ 10,186

java HTML javascript kotlin ruby groovy framework crawler scraping

Laravel Bank Statements

Laravel package to collect your bank statements history. Currently support for parsing statements history from BCA, Mandiri, BNI, and MUAMALAT e-banking websites.

✭ 105

laravel-package scraping bank

D4n155

OWASP D4N155 - Intelligent and dynamic wordlist using OSINT

✭ 105

shell tool google crawler osint dynamic scraping wordlist duckduckgo

Languagepod101 Scraper

Python scraper for Language Pods such as Japanesepod101.com 👹 🗾 🍣 Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨

✭ 104

python language download scraping course requests learn japanese podcast beautifulsoup japanese-language

Dotnetcrawler

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

✭ 100

csharp crawler dotnetcore scrapy scraping entity-framework-core webscraping ddd-architecture crawling

Grawler

Grawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them in a file.

✭ 98

automation proxy osint curl scraping crawling

Nintendeals

Library with a set of tools for scraping information about Nintendo games and its prices across all regions (NA, EU and JP).

✭ 94

python games scraping reddit nintendo nintendo-switch

Humanoid

Node.js package to bypass CloudFlare's anti-bot JavaScript challenges

✭ 88

javascript bot scraping web-scraping bypass

Pastepwn

Python framework to scrape Pastebin pastes and analyze them

✭ 87

python hacktoberfest framework osint scraping pastebin

Dataengineeringproject

Example end to end data engineering project.

✭ 82

python hacktoberfest redis mongodb elasticsearch kafka big-data s3 django-rest-framework scraping airflow data-engineering kafka-connect minio

Billy

legacy backend for Open States

✭ 85

python scraping

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

✭ 1,246

go crawler spider scraper scraping

Google Covid19 Mobility Reports

Data extraction of Google's COVID-19 Mobility Reports

✭ 82

html dataset scraping

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

✭ 81

python email scraper scrapy scraping emails email-marketing extraction

Detect Cms

PHP Library for detecting CMS

✭ 78

detection scraping web-scraping web-scraper

Viewstate

ASP.NET View State Decoder

✭ 77

python python3 security dotnet scraping asp-net web-security

1-60 of 229 scraping projects

›