Geeksforgeeks.pdfTopic wise PDFs of Geeks for Geeks articles. (Last updated in October 2018)
Stars: ✭ 489 (+1428.13%)
cnn-proxySubdomain method that proxies websockets, XMLHttpRequests, and more.
Stars: ✭ 13 (-59.37%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+1275%)
revpReverse HTTP proxy that works on Linux, Windows, and macOS. Made with C++ and Boost.
Stars: ✭ 80 (+150%)
LookylooLookyloo is a web interface that allows users to capture a website page and then display a tree of domains that call each other.
Stars: ✭ 381 (+1090.63%)
Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (+884.38%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+434.38%)
ScrapeBotA Selenium-driven tool for automated website interaction and scraping.
Stars: ✭ 16 (-50%)
KatanaA Python Tool For google Hacking
Stars: ✭ 355 (+1009.38%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+12640.63%)
ProxyCheckerproxy checker to check the status of the ip-port proxy list
Stars: ✭ 24 (-25%)
PythonScrapyBasicSetupBasic setup with random user agents and IP addresses for Python Scrapy Framework.
Stars: ✭ 57 (+78.13%)
anime-scraper[partially working] Scrape and add anime episode stream URLs to uGet (Linux) or IDM (Windows) ~ Python3
Stars: ✭ 21 (-34.37%)
crawling-frameworkEasily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-31.25%)
Requests HtmlPythonic HTML Parsing for Humans™
Stars: ✭ 12,268 (+38237.5%)
LambdasoupFunctional HTML scraping and rewriting with CSS in OCaml
Stars: ✭ 280 (+775%)
archeAnalyze scraped data
Stars: ✭ 49 (+53.13%)
MechanizeMechanize is a ruby library that makes automated web interaction easy.
Stars: ✭ 4,158 (+12893.75%)
rlbRedirecting Load Balancer
Stars: ✭ 30 (-6.25%)
instagram explorer📷 An app to scrap instagram posts and analyze data.
Stars: ✭ 17 (-46.87%)
devproxyA local development http proxy with hosts spoofing written in Go
Stars: ✭ 35 (+9.38%)
JD Spider👍 京东爬虫(大量注释,对刚入门爬虫者极度友好)
Stars: ✭ 56 (+75%)
ogpParserOpen Graph Protocol Parser for Node.js
Stars: ✭ 43 (+34.38%)
AutohomeUsing Scrapy to crawl Autohome, storage into MonogDB, simple analysis and NLP coming soon
Stars: ✭ 23 (-28.12%)
Secret AgentThe web browser that's built for scraping.
Stars: ✭ 151 (+371.88%)
github-languagesTiny little ruby on rails website that crawls though your public github repos to find out what your favourite languages are.
Stars: ✭ 23 (-28.12%)
copycatA PHP Scraping Class
Stars: ✭ 70 (+118.75%)
Market-Trend-PredictionThis is a project of build knowledge graph course. The project leverages historical stock price, and integrates social media listening from customers to predict market Trend On Dow Jones Industrial Average (DJIA).
Stars: ✭ 57 (+78.13%)
webdextIntelligent Web Data Extractor
Stars: ✭ 75 (+134.38%)
PyLexPerform lexical analysis on words, one word at a time.
Stars: ✭ 60 (+87.5%)
lgcrawlpython+scrapy+splash 爬取拉勾全站职位信息
Stars: ✭ 22 (-31.25%)
XqueryExtract data or evaluate value from HTML/XML documents using XPath
Stars: ✭ 155 (+384.38%)
Free-ProxyHi there will be a lot of proxies here.
Stars: ✭ 135 (+321.88%)
SerpscrapSEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
Stars: ✭ 153 (+378.13%)
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-53.12%)
google-scraperThis class can retrieve search results from Google.
Stars: ✭ 33 (+3.13%)
docker-selenium-lambdaThe simplest demo of chrome automation by python and selenium in AWS Lambda
Stars: ✭ 172 (+437.5%)
dustArchive web pages with all relevant assets or save as a single file HTML
Stars: ✭ 19 (-40.62%)
SmartGWDomain based VPN Gateway/Proxy for all devices
Stars: ✭ 49 (+53.13%)
Shadow UseragentPick the most common user-agents on the Internet 👻
Stars: ✭ 147 (+359.38%)
ha-multiscrapeHome Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Stars: ✭ 103 (+221.88%)
itemadapterCommon interface for data container classes
Stars: ✭ 47 (+46.88%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+362.5%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (+356.25%)
scrapy-wayback-machineA Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Stars: ✭ 92 (+187.5%)
SqrapeSimple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)
Stars: ✭ 144 (+350%)
EmbedGet info from any web service or page
Stars: ✭ 1,808 (+5550%)
ChampA Telegram bot combined with python to serve some basic functions like weather, music charts, cricket score and much more.
Stars: ✭ 22 (-31.25%)
Mimo-CrawlerA web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Stars: ✭ 22 (-31.25%)