Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (-5.54%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (-99.41%)
greycatGreyCat - Data Analytics, Temporal data, What-if, Live machine learning
Stars: ✭ 104 (-99.06%)
makinageStream Processing Made Easy
Stars: ✭ 31 (-99.72%)
SaberWindow-Based Hybrid CPU/GPU Stream Processing Engine
Stars: ✭ 35 (-99.68%)
DparkPython clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (-75.95%)
Go StreamsA lightweight stream processing library for Go
Stars: ✭ 615 (-94.46%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-99.49%)
BenthosFancy stream processing made operationally mundane
Stars: ✭ 3,705 (-66.6%)
GsfGrid Solutions Framework
Stars: ✭ 106 (-99.04%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-99.38%)
HudiUpserts, Deletes And Incremental Processing on Big Data.
Stars: ✭ 2,586 (-76.69%)
GearpumpLightweight real-time big data streaming engine over Akka
Stars: ✭ 745 (-93.28%)
NsdbNatural Series Database
Stars: ✭ 49 (-99.56%)
richflowA Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
Stars: ✭ 17 (-99.85%)
mxfactoriala payment application intended for deployment by the united states treasury
Stars: ✭ 36 (-99.68%)
godsendA simple and eloquent workflow for streaming messages to micro-services.
Stars: ✭ 15 (-99.86%)
MachineMachine is a workflow/pipeline library for processing data
Stars: ✭ 78 (-99.3%)
VectorsqlVectorSQL is a free analytics DBMS for IoT & Big Data, compatible with ClickHouse.
Stars: ✭ 171 (-98.46%)
frizzleThe magic message bus
Stars: ✭ 14 (-99.87%)
openPDCOpen Source Phasor Data Concentrator
Stars: ✭ 109 (-99.02%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (-99.4%)
mediapipe plusThe purpose of this project is to apply mediapipe to more AI chips.
Stars: ✭ 38 (-99.66%)
dockerfilesMulti docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-99.74%)
twitter-stream-api🐤 Another Twitter stream PHP library to retrieve filtered tweets on hot.
Stars: ✭ 11 (-99.9%)
beakerA distributed, transactional key-value store.
Stars: ✭ 63 (-99.43%)
StreamBenchMeasuring the performance of popular streaming engines with Yahoo's Streaming Benchmark
Stars: ✭ 52 (-99.53%)
datartDatart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (-90.61%)
flink-learnLearning Flink : Flink CEP,Flink Core,Flink SQL
Stars: ✭ 70 (-99.37%)
product-spAn open source, cloud-native streaming data integration and analytics product optimized for agile digital businesses
Stars: ✭ 80 (-99.28%)
zdh web大数据采集,抽取平台
Stars: ✭ 292 (-97.37%)
cdcA library for performing Content-Defined Chunking (CDC) on data streams.
Stars: ✭ 18 (-99.84%)
go-riversCollection of stream processing / multiplexing / networking libs in Go
Stars: ✭ 35 (-99.68%)
nebula-graphA distributed, fast open-source graph database featuring horizontal scalability and high availability. This is an archived repo for v2.5 only, from 2.6.0 +, NebulaGraph switched back to https://github.com/vesoft-inc/nebula
Stars: ✭ 833 (-92.49%)
jhdfA pure Java HDF5 library
Stars: ✭ 83 (-99.25%)
kafka-workersKafka Workers is a client library which unifies records consuming from Kafka and processing them by user-defined WorkerTasks.
Stars: ✭ 30 (-99.73%)
columnifyMake record oriented data to columnar format.
Stars: ✭ 28 (-99.75%)
PiscesPisces is a time series database, desktop application, command line tool, and webapp. Pisces is designed to organize, graph, and analyze natural resource data that varies with time: gauge height, river flow, water temperature, etc.
Stars: ✭ 35 (-99.68%)
datacatalog-tag-managerPython package to manage Google Cloud Data Catalog tags, loading metadata from external sources -- currently supports the CSV file format
Stars: ✭ 17 (-99.85%)
amasAmas is recursive acronym for “Amas, monitor alert system”.
Stars: ✭ 77 (-99.31%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (-99.46%)
kafka-shell⚡A supercharged, interactive Kafka shell built on top of the existing Kafka CLI tools.
Stars: ✭ 107 (-99.04%)
google-sheets-etlLive import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-99.86%)
dt-sql-parserSQL Parsers for BigData, built with antlr4.
Stars: ✭ 135 (-98.78%)
NotesThis is a learning note | Java基础,JVM,源码,大数据,面经
Stars: ✭ 69 (-99.38%)
Data-Wrangling-with-PythonSimplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (-99.19%)
gretel-python-clientThe Gretel Python Client allows you to interact with the Gretel REST API.
Stars: ✭ 28 (-99.75%)
analyzing-reddit-sentiment-with-awsLearn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.
Stars: ✭ 40 (-99.64%)