Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (+565.52%)
Mutual labels: analytics, big-data
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+15696.55%)
Mutual labels: analytics, big-data
HyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (+748.28%)
Mutual labels: analytics, big-data
MahaA framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (+248.28%)
Mutual labels: analytics, big-data
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+18193.1%)
Mutual labels: analytics, big-data
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+417.24%)
Mutual labels: analytics, big-data
awesome-AI-kubernetes❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (+227.59%)
Mutual labels: big-data, analytics
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+72620.69%)
Mutual labels: analytics, big-data
Beeva Best PracticesBest Practices and Style Guides in BEEVA
Stars: ✭ 335 (+1055.17%)
Mutual labels: analytics, big-data
DeltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+13358.62%)
Mutual labels: analytics, big-data
Dremio OssDremio - the missing link in modern data
Stars: ✭ 862 (+2872.41%)
Mutual labels: analytics, big-data
Sciblog supportSupport content for my blog
Stars: ✭ 694 (+2293.1%)
Mutual labels: analytics, big-data
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (+234.48%)
Mutual labels: analytics, big-data
FiliEasily make RESTful web services for time series reporting with Big Data analytics engines like Druid and SQL Databases.
Stars: ✭ 151 (+420.69%)
Mutual labels: analytics, big-data
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (+137.93%)
Mutual labels: analytics, big-data
CrateCrateDB is a distributed SQL database that makes it simple to store and analyze
massive amounts of data in real-time.
Stars: ✭ 3,254 (+11120.69%)
Mutual labels: analytics, big-data
Data Science CareerCareer Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
Stars: ✭ 630 (+2072.41%)
Mutual labels: analytics, big-data
Rakam Api📈 Collect customer event data from your apps. (Note that this project only includes the API collector, not the visualization platform)
Stars: ✭ 772 (+2562.07%)
Mutual labels: analytics, big-data
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-51.72%)
Mutual labels: analytics
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-62.07%)
Mutual labels: big-data