City ScrapersScrape, standardize and share public meetings from local government websites
Stars: ✭ 220 (-65.57%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (-27.39%)
Scrapy CraigslistWeb Scraping Craigslist's Engineering Jobs in NY with Scrapy
Stars: ✭ 54 (-91.55%)
CascadiaGo cascadia package command line CSS selector
Stars: ✭ 67 (-89.51%)
OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-97.65%)
Php Curl ClassPHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+354.3%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (-76.84%)
Html MetadataMetaData html scraper and parser for Node.js (supports Promises and callback style)
Stars: ✭ 129 (-79.81%)
IMDB-ScraperScrapy project for scraping data from IMDB with Movie Dataset including 58,623 movies' data.
Stars: ✭ 37 (-94.21%)
restaurant-finder-featureReviewsBuild a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-96.71%)
Project TauroA Router WiFi key recovery/cracking tool with a twist.
Stars: ✭ 52 (-91.86%)
Juno crawlerScrapy crawler to collect data on the back catalog of songs listed for sale.
Stars: ✭ 150 (-76.53%)
Linkedin-ClientWeb scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (-93.43%)
HargoHargo is a Go library and command line utility that parses HAR files, can convert to curl format, and serve as a load test driver.
Stars: ✭ 164 (-74.33%)
Detect CmsPHP Library for detecting CMS
Stars: ✭ 78 (-87.79%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+2.66%)
DaftlistingsA library that enables programmatic interaction with daft.ie. Daft.ie has nationwide coverage and contains about 80% of the total available properties in Ireland.
Stars: ✭ 86 (-86.54%)
Web ScrapingDetailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, SHFE and news data crawlers on BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Stars: ✭ 153 (-76.06%)
Scrape Linkedin Selenium`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (-62.6%)
Netflix CloneNetflix like full-stack application with SPA client and backend implemented in service oriented architecture
Stars: ✭ 156 (-75.59%)
Scrapyd Cluster On HerokuSet up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO 👉
Stars: ✭ 106 (-83.41%)
scrapy-wayback-machineA Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Stars: ✭ 92 (-85.6%)
scraping-ebayScraping Ebay's products using Scrapy Web Crawling Framework
Stars: ✭ 79 (-87.64%)
top-github-scraperScape top GitHub repositories and users based on keywords
Stars: ✭ 40 (-93.74%)
uoft-scrapersPublic web scraping scripts for the University of Toronto.
Stars: ✭ 48 (-92.49%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-89.36%)
1c httpПодсистема 1С для работы с HTTP
Stars: ✭ 48 (-92.49%)
MultiHttpThis is a high performance , very useful multi-curl tool written in php. 一个超级好用的并发CURL工具!!!(httpful,restful, concurrency)
Stars: ✭ 79 (-87.64%)
Scrapy RedisRedis-based components for Scrapy.
Stars: ✭ 4,998 (+682.16%)
Rate.sx💰 curl cryptocurrencies exchange rates
Stars: ✭ 563 (-11.89%)
Haipproxy💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+681.38%)
Unifi Api ClientA PHP API client class to interact with Ubiquiti's UniFi Controller API
Stars: ✭ 602 (-5.79%)
AsyncpgA fast PostgreSQL Database Client Library for Python/asyncio.
Stars: ✭ 5,216 (+716.28%)
Httpstatcurl statistics made simple
Stars: ✭ 4,991 (+681.06%)
GfGoFrame is a modular, powerful, high-performance and enterprise-class application development framework of Golang.
Stars: ✭ 6,501 (+917.37%)
EdgedbThe next generation relational database.
Stars: ✭ 5,368 (+740.06%)
Http Status CheckCLI tool to crawl a website and check HTTP status codes
Stars: ✭ 512 (-19.87%)
CoolqlcoolNextjs server to query websites with GraphQL
Stars: ✭ 623 (-2.5%)
UrllibRequest HTTP(s) URLs in a complex world
Stars: ✭ 600 (-6.1%)
LibrdkafkaThe Apache Kafka C/C++ library
Stars: ✭ 5,617 (+779.03%)
Orz a high performance, general purpose data compressor written in rust
Stars: ✭ 509 (-20.34%)
Fast floatFast and exact implementation of the C++ from_chars functions for float and double types: 4x faster than strtod
Stars: ✭ 510 (-20.19%)
Yugabyte DbThe high-performance distributed SQL database for global, internet-scale apps.
Stars: ✭ 5,890 (+821.75%)
Xxl RpcA high performance, distributed RPC framework.(分布式服务框架XXL-RPC)
Stars: ✭ 493 (-22.85%)
Sledthe champagne of beta embedded databases
Stars: ✭ 5,423 (+748.67%)
Colferbinary serialization format
Stars: ✭ 597 (-6.57%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+650.08%)
XnifferA swift network profiler built on top of URLSession.
Stars: ✭ 488 (-23.63%)
AnakinHigh performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.
Stars: ✭ 488 (-23.63%)
User AgentsA JavaScript library for generating random user agents with data that's updated daily.
Stars: ✭ 485 (-24.1%)
IcrawlerA multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (-1.56%)
HttpuThe terminal-first http client
Stars: ✭ 619 (-3.13%)