This is a project of build knowledge graph course. The project leverages historical stock price, and integrates social media listening from customers to predict market Trend On Dow Jones Industrial Average (DJIA).

Stars: ✭ 57 (+78.13%)

Mutual labels: web-crawler

webdext

Intelligent Web Data Extractor

Stars: ✭ 75 (+134.38%)

Mutual labels: scraping

PrawWallpaperDownloader

Download images from reddit

Stars: ✭ 18 (-43.75%)

Mutual labels: scraping

PyLex

Perform lexical analysis on words, one word at a time.

Stars: ✭ 60 (+87.5%)

Mutual labels: scraping

lgcrawl

python+scrapy+splash 爬取拉勾全站职位信息

Stars: ✭ 22 (-31.25%)

Mutual labels: scrapy

Xquery

Extract data or evaluate value from HTML/XML documents using XPath

Stars: ✭ 155 (+384.38%)

Mutual labels: scraping

Free-Proxy

Hi there will be a lot of proxies here.

Stars: ✭ 135 (+321.88%)

Mutual labels: proxy-list

Serpscrap

SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.

Stars: ✭ 153 (+378.13%)

Mutual labels: scraping

papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-53.12%)

Mutual labels: scraping

google-scraper

This class can retrieve search results from Google.

Stars: ✭ 33 (+3.13%)

Mutual labels: scraping

docker-selenium-lambda

The simplest demo of chrome automation by python and selenium in AWS Lambda

Stars: ✭ 172 (+437.5%)

Mutual labels: scraping

dust

Archive web pages with all relevant assets or save as a single file HTML

Stars: ✭ 19 (-40.62%)

Mutual labels: scraping

SmartGW

Domain based VPN Gateway/Proxy for all devices

Stars: ✭ 49 (+53.13%)

Mutual labels: http-proxy

Shadow Useragent

Pick the most common user-agents on the Internet 👻

Stars: ✭ 147 (+359.38%)

Mutual labels: scraping

scrapy-boilerplate

Scrapy project boilerplate done right

Stars: ✭ 30 (-6.25%)

Mutual labels: scrapy

ha-multiscrape

Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.

Stars: ✭ 103 (+221.88%)

Mutual labels: scraping

itemadapter

Common interface for data container classes

Stars: ✭ 47 (+46.88%)

Mutual labels: scrapy

Phpscraper

PHP Scraper - an highly opinionated web-interface for PHP

Stars: ✭ 148 (+362.5%)

Mutual labels: scraping

Fantasy Basketball

Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.

Stars: ✭ 146 (+356.25%)

Mutual labels: scraping

scrapy-wayback-machine

A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.

Stars: ✭ 92 (+187.5%)

Mutual labels: scrapy

Sqrape

Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)

Stars: ✭ 144 (+350%)

Mutual labels: scraping

Embed

Get info from any web service or page