City ScrapersScrape, standardize and share public meetings from local government websites
Stars: ✭ 220 (+214.29%)
Html MetadataMetaData html scraper and parser for Node.js (supports Promises and callback style)
Stars: ✭ 129 (+84.29%)
User AgentsA JavaScript library for generating random user agents with data that's updated daily.
Stars: ✭ 485 (+592.86%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+562.86%)
30 Days Of PythonLearn Python for the next 30 (or so) Days.
Stars: ✭ 1,748 (+2397.14%)
SelectolaxPython binding to Modest engine (fast HTML5 parser with CSS selectors).
Stars: ✭ 368 (+425.71%)
Short Jokes DatasetPython scripts for building 'Short Jokes' dataset, featured on Kaggle
Stars: ✭ 215 (+207.14%)
AcheACHE is a web crawler for domain-specific search.
Stars: ✭ 320 (+357.14%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+2065.71%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+295.71%)
Php Curl ClassPHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+4047.14%)
Scrapyd Cluster On HerokuSet up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO 👉
Stars: ✭ 106 (+51.43%)
comic-scraper[Python] Scraps comics and manga from various websites and creates cbz files from them
Stars: ✭ 16 (-77.14%)
Trump LiesTutorial: Web scraping in Python with Beautiful Soup
Stars: ✭ 201 (+187.14%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (-31.43%)
PulsarTurn large Web sites into tables and charts using simple SQLs.
Stars: ✭ 100 (+42.86%)
wayback⏪ Tools to Work with the Various Internet Archive Wayback Machine APIs
Stars: ✭ 52 (-25.71%)
WebmiddleNode.js framework for modular web scraping and data extraction
Stars: ✭ 13 (-81.43%)
Splashr💦 Tools to Work with the 'Splash' JavaScript Rendering Service in R
Stars: ✭ 93 (+32.86%)
PaperScraperA web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.
Stars: ✭ 63 (-10%)
Twitter IntelligenceTwitter Intelligence OSINT project performs tracking and analysis of the Twitter
Stars: ✭ 179 (+155.71%)
linkextractorA Docker tutorial using a link extraction application example
Stars: ✭ 41 (-41.43%)
HumanoidNode.js package to bypass CloudFlare's anti-bot JavaScript challenges
Stars: ✭ 88 (+25.71%)
halfstaff🇺🇸 Is the US flag at half-staff?
Stars: ✭ 22 (-68.57%)
lopezCrawling and scraping the Web for fun and profit
Stars: ✭ 20 (-71.43%)
investigation-amazon-brandsMaterials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Takes the Buy Box, it Doesn’t Give it up"
Stars: ✭ 56 (-20%)
RvestSimple web scraping for R
Stars: ✭ 1,253 (+1690%)
actor-scraperHouse of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
Stars: ✭ 83 (+18.57%)
heroshiHeroshi – open source web crawler.
Stars: ✭ 51 (-27.14%)
ReaderExtract clean(er), readable text from web pages via Mercury Web Parser.
Stars: ✭ 75 (+7.14%)
tableau-scrapingTableau scraper python library. R and Python scripts to scrape data from Tableau viz
Stars: ✭ 91 (+30%)
Quora ApiAn unofficial API for Quora.
Stars: ✭ 250 (+257.14%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-2.86%)
OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-78.57%)
DecapitatedHeadless 'Chrome' Orchestration in R
Stars: ✭ 65 (-7.14%)
Node-js-functionalitiesThis repository contains very useful restful API's and functionalities in node-js containing many important tutorial code for mastering node-js, all tutorials have been published on medium.com, tutorials link is given below
Stars: ✭ 69 (-1.43%)
HiA Programming language for Web Scraping
Stars: ✭ 14 (-80%)
WaWebSessionHandler(DISCONTINUED) Save WhatsApp Web Sessions as files and open them everywhere!
Stars: ✭ 27 (-61.43%)
InstagoDownload/access photos, videos, stories, story highlights, postlives, following and followers of Instagram
Stars: ✭ 59 (-15.71%)
browser-poolA Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (+1.43%)
Web ScrapingDetailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, SHFE and news data crawlers on BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Stars: ✭ 153 (+118.57%)
iwwAI based web-wrapper for web-content-extraction
Stars: ✭ 61 (-12.86%)
Project TauroA Router WiFi key recovery/cracking tool with a twist.
Stars: ✭ 52 (-25.71%)
htmlunit🕸🧰☕️Tools to Scrape Dynamic Web Content via the 'HtmlUnit' Java Library
Stars: ✭ 39 (-44.29%)
Wayback Machine ScraperA command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Stars: ✭ 230 (+228.57%)
2017-summer-workshopExercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
Stars: ✭ 33 (-52.86%)
xharnessC# command line tool for running tests on Android / iOS / tvOS devices and simulators
Stars: ✭ 123 (+75.71%)
jdi-darkPowerful Framework for Backend Automation Testing on Java (Rest, Soap, WebSocket)
Stars: ✭ 36 (-48.57%)
DocbaoCông cụ quét và phân tích từ khoá các trang báo mạng Việt Nam
Stars: ✭ 230 (+228.57%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+111.43%)
Letterboxd recommendationsScraping publicly-accessible Letterboxd data and creating a movie recommendation model with it that can generate recommendations when provided with a Letterboxd username
Stars: ✭ 23 (-67.14%)