All Projects → awesome-bigdata → Similar Projects or Alternatives

472 Open source projects that are alternatives of or similar to awesome-bigdata

Awesome Bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (-5.54%)
the-apache-ignite-book
All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (-99.41%)
greycat
GreyCat - Data Analytics, Temporal data, What-if, Live machine learning
Stars: ✭ 104 (-99.06%)
makinage
Stream Processing Made Easy
Stars: ✭ 31 (-99.72%)
Saber
Window-Based Hybrid CPU/GPU Stream Processing Engine
Stars: ✭ 35 (-99.68%)
Dpark
Python clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (-75.95%)
Mutual labels:  bigdata, stream-processing
Go Streams
A lightweight stream processing library for Go
Stars: ✭ 615 (-94.46%)
data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (-99.55%)
Mutual labels:  bigdata, stream-processing
Real Time Sentiment Tracking On Twitter For Brand Improvement And Trend Recognition
A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)
Stars: ✭ 127 (-98.86%)
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-99.49%)
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (-66.6%)
Gsf
Grid Solutions Framework
Stars: ✭ 106 (-99.04%)
Kafka Streams In Action
Source code for the Kafka Streams in Action Book
Stars: ✭ 167 (-98.49%)
Countly Sdk Cordova
Countly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-99.38%)
Mutual labels:  bigdata, data-analytics
Hudi
Upserts, Deletes And Incremental Processing on Big Data.
Stars: ✭ 2,586 (-76.69%)
Mutual labels:  bigdata, stream-processing
Gearpump
Lightweight real-time big data streaming engine over Akka
Stars: ✭ 745 (-93.28%)
Mutual labels:  bigdata, stream-processing
Nsdb
Natural Series Database
Stars: ✭ 49 (-99.56%)
Mutual labels:  data-analytics, streaming-data
Hudi Resources
汇总Apache Hudi相关资料
Stars: ✭ 79 (-99.29%)
Mutual labels:  bigdata, stream-processing
richflow
A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
Stars: ✭ 17 (-99.85%)
Mutual labels:  data-stream, streaming-data
mxfactorial
a payment application intended for deployment by the united states treasury
Stars: ✭ 36 (-99.68%)
godsend
A simple and eloquent workflow for streaming messages to micro-services.
Stars: ✭ 15 (-99.86%)
Machine
Machine is a workflow/pipeline library for processing data
Stars: ✭ 78 (-99.3%)
Awesome Kafka
A list about Apache Kafka
Stars: ✭ 397 (-96.42%)
Vectorsql
VectorSQL is a free analytics DBMS for IoT & Big Data, compatible with ClickHouse.
Stars: ✭ 171 (-98.46%)
frizzle
The magic message bus
Stars: ✭ 14 (-99.87%)
openPDC
Open Source Phasor Data Concentrator
Stars: ✭ 109 (-99.02%)
csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (-99.4%)
Mutual labels:  stream-processing
mediapipe plus
The purpose of this project is to apply mediapipe to more AI chips.
Stars: ✭ 38 (-99.66%)
Mutual labels:  stream-processing
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-99.74%)
Mutual labels:  bigdata
twitter-stream-api
🐤 Another Twitter stream PHP library to retrieve filtered tweets on hot.
Stars: ✭ 11 (-99.9%)
Mutual labels:  streaming-data
beaker
A distributed, transactional key-value store.
Stars: ✭ 63 (-99.43%)
Mutual labels:  distributed-database
Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Stars: ✭ 31 (-99.72%)
Mutual labels:  data-analytics
StreamBench
Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark
Stars: ✭ 52 (-99.53%)
Mutual labels:  bigdata
bigdata-tech-index
Big Data Technology Index
Stars: ✭ 24 (-99.78%)
Mutual labels:  bigdata
datart
Datart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (-90.61%)
Mutual labels:  data-analytics
flink-connectors
Apache Flink connectors for Pravega.
Stars: ✭ 84 (-99.24%)
Mutual labels:  stream-processing
flink-learn
Learning Flink : Flink CEP,Flink Core,Flink SQL
Stars: ✭ 70 (-99.37%)
Mutual labels:  bigdata
product-sp
An open source, cloud-native streaming data integration and analytics product optimized for agile digital businesses
Stars: ✭ 80 (-99.28%)
Mutual labels:  stream-processing
zdh web
大数据采集,抽取平台
Stars: ✭ 292 (-97.37%)
Mutual labels:  bigdata
cdc
A library for performing Content-Defined Chunking (CDC) on data streams.
Stars: ✭ 18 (-99.84%)
Mutual labels:  data-stream
go-rivers
Collection of stream processing / multiplexing / networking libs in Go
Stars: ✭ 35 (-99.68%)
Mutual labels:  stream-processing
nebula-graph
A distributed, fast open-source graph database featuring horizontal scalability and high availability. This is an archived repo for v2.5 only, from 2.6.0 +, NebulaGraph switched back to https://github.com/vesoft-inc/nebula
Stars: ✭ 833 (-92.49%)
Mutual labels:  distributed-database
jhdf
A pure Java HDF5 library
Stars: ✭ 83 (-99.25%)
Mutual labels:  bigdata
kafka-workers
Kafka Workers is a client library which unifies records consuming from Kafka and processing them by user-defined WorkerTasks.
Stars: ✭ 30 (-99.73%)
Mutual labels:  stream-processing
columnify
Make record oriented data to columnar format.
Stars: ✭ 28 (-99.75%)
Mutual labels:  bigdata
Pisces
Pisces is a time series database, desktop application, command line tool, and webapp. Pisces is designed to organize, graph, and analyze natural resource data that varies with time: gauge height, river flow, water temperature, etc.
Stars: ✭ 35 (-99.68%)
Mutual labels:  series-database
awesome-bigquery-views
Useful SQL queries for Blockchain ETL datasets in BigQuery.
Stars: ✭ 325 (-97.07%)
Mutual labels:  data-analytics
datacatalog-tag-manager
Python package to manage Google Cloud Data Catalog tags, loading metadata from external sources -- currently supports the CSV file format
Stars: ✭ 17 (-99.85%)
Mutual labels:  bigdata
amas
Amas is recursive acronym for “Amas, monitor alert system”.
Stars: ✭ 77 (-99.31%)
Mutual labels:  bigdata
meetups-archivos
Ppts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (-99.46%)
Mutual labels:  bigdata
kafka-shell
⚡A supercharged, interactive Kafka shell built on top of the existing Kafka CLI tools.
Stars: ✭ 107 (-99.04%)
Mutual labels:  stream-processing
shardingsphere-ui
Distributed database middleware
Stars: ✭ 41 (-99.63%)
Mutual labels:  distributed-database
google-sheets-etl
Live import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-99.86%)
Mutual labels:  data-warehouse
dt-sql-parser
SQL Parsers for BigData, built with antlr4.
Stars: ✭ 135 (-98.78%)
Mutual labels:  bigdata
Notes
This is a learning note | Java基础,JVM,源码,大数据,面经
Stars: ✭ 69 (-99.38%)
Mutual labels:  bigdata
163-bigdate-note
bigdata note
Stars: ✭ 38 (-99.66%)
Mutual labels:  bigdata
Data-Wrangling-with-Python
Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (-99.19%)
Mutual labels:  data-analytics
gretel-python-client
The Gretel Python Client allows you to interact with the Gretel REST API.
Stars: ✭ 28 (-99.75%)
Mutual labels:  stream-processing
2019 egu workshop jupyter notebooks
Short course on interactive analysis of Big Earth Data with Jupyter Notebooks
Stars: ✭ 29 (-99.74%)
Mutual labels:  bigdata
analyzing-reddit-sentiment-with-aws
Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.
Stars: ✭ 40 (-99.64%)
Mutual labels:  data-stream
1-60 of 472 similar projects