DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (-31.97%)

Mutual labels: scraping

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (+88.44%)

Mutual labels: scraping

Scrapy Cluster

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.

Stars: ✭ 921 (+526.53%)

Mutual labels: scraping

schedule-tweet

Schedules tweets using TweetDeck

Stars: ✭ 14 (-90.48%)

Mutual labels: scraping

Fantasy Basketball

Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.

Stars: ✭ 146 (-0.68%)

Mutual labels: scraping

facebook-discussion-tk

A collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.

Stars: ✭ 33 (-77.55%)

Mutual labels: scraping

Webhere

HTML scraping for Objective-C.

Stars: ✭ 16 (-89.12%)

Mutual labels: scraping

Nintendeals

Library with a set of tools for scraping information about Nintendo games and its prices across all regions (NA, EU and JP).

Stars: ✭ 94 (-36.05%)

Mutual labels: scraping

Facebook data analyzer

Analyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, contacts, friends added statistics and more

Stars: ✭ 515 (+250.34%)

Mutual labels: scraping

policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

Stars: ✭ 22 (-85.03%)

Mutual labels: scraping

Lulu

[Unmaintained] A simple and clean video/music/image downloader 👾

Stars: ✭ 789 (+436.73%)

Mutual labels: scraping

flutter ua client hints

Provide User-Agent Client Hints to a Flutter app.

Stars: ✭ 27 (-81.63%)

Mutual labels: useragent

Htmlsql

htmlSQL is a experimental PHP library which allows you to access HTML values by an SQL like syntax.

Stars: ✭ 120 (-18.37%)

Mutual labels: scraping

memes-api

API for scrapping common meme sites

Stars: ✭ 17 (-88.44%)

Mutual labels: scraping

Parsel

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Stars: ✭ 628 (+327.21%)

Mutual labels: scraping

webdext

Intelligent Web Data Extractor

Stars: ✭ 75 (-48.98%)

Mutual labels: scraping

Pastepwn

Python framework to scrape Pastebin pastes and analyze them

Stars: ✭ 87 (-40.82%)

Mutual labels: scraping

PyLex

Perform lexical analysis on words, one word at a time.

Stars: ✭ 60 (-59.18%)

Mutual labels: scraping

Easy Scraping Tutorial

Simple but useful Python web scraping tutorial code.

Stars: ✭ 583 (+296.6%)

Mutual labels: scraping

Zeiver

A Scraper, Downloader, & Recorder for static open directories.

Stars: ✭ 14 (-90.48%)

Mutual labels: scraping

Search Engine Google

🕷 Google client for SERPS

Stars: ✭ 138 (-6.12%)

Mutual labels: scraping

papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-89.8%)

Mutual labels: scraping

Tabula

Tabula is a tool for liberating data tables trapped inside PDF files

Stars: ✭ 5,420 (+3587.07%)

Mutual labels: scraping

humanparser

Parse a human name string into salutation, first name, middle name, last name, suffix.

Stars: ✭ 78 (-46.94%)

Mutual labels: scraping

Billy

legacy backend for Open States

Stars: ✭ 85 (-42.18%)

Mutual labels: scraping

dust

Archive web pages with all relevant assets or save as a single file HTML

Stars: ✭ 19 (-87.07%)

Mutual labels: scraping

Browser.php

A PHP Class to detect a user's Browser. This encapsulation provides a breakdown of the browser and the version of the browser using the browser's user-agent string. This is not a guaranteed solution but provides an overall accurate way to detect what browser a user is using.

Stars: ✭ 546 (+271.43%)

Mutual labels: useragent

pomp

Screen scraping and web crawling framework

Stars: ✭ 61 (-58.5%)

Mutual labels: scraping

Awesome Puppeteer

A curated list of awesome puppeteer resources.

Stars: ✭ 1,728 (+1075.51%)

Mutual labels: scraping

Gazpacho

🥫 The simple, fast, and modern web scraping library

Stars: ✭ 525 (+257.14%)

Mutual labels: scraping

Phpscraper

PHP Scraper - an highly opinionated web-interface for PHP

Stars: ✭ 148 (+0.68%)

Mutual labels: scraping

Sqrape

Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)

Stars: ✭ 144 (-2.04%)

Mutual labels: scraping

Udemycoursegrabber

Your will to enroll in Udemy course is here, but the money isn't? Search no more! This python program searches for your desired course in more than [insert big number here] websites, compares the last updated date, and gives you the download link of the latest one back, but you also have the choice to see the other ones as well!

Stars: ✭ 137 (-6.8%)

Mutual labels: scraping

Souqscraper

Simple scriptes for Level UP your scraping Skills, and source code for Level UP playlist on Youtube

Stars: ✭ 118 (-19.73%)

Mutual labels: scraping

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

Stars: ✭ 81 (-44.9%)

Mutual labels: scraping

Facebook Scraper

Scrape Facebook public pages without an API key

Stars: ✭ 499 (+239.46%)

Mutual labels: scraping

61-120 of 251 similar projects

‹

›