KyuubiKyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
Stars: ✭ 363 (+17.1%)
QuillCompile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+544.52%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-51.61%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+1377.74%)
EventqlDistributed "massively parallel" SQL query engine
Stars: ✭ 1,121 (+261.61%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+8546.13%)
React Native Firebase🔥 A well-tested feature-rich modular Firebase implementation for React Native. Supports both iOS & Android platforms for all Firebase services.
Stars: ✭ 9,674 (+3020.65%)
AresdbA GPU-powered real-time analytics storage and query engine.
Stars: ✭ 2,814 (+807.74%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+6702.9%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+33.23%)
DeltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+1159.03%)
JaydebeapiJayDeBeApi module allows you to connect from Python code to databases using Java JDBC. It provides a Python DB-API v2.0 to that database.
Stars: ✭ 247 (-20.32%)
TensorbaseTensorBase BE is building a high performance, cloud neutral bigdata warehouse for SMEs fully in Rust.
Stars: ✭ 440 (+41.94%)
AnalyticsSimple, open-source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.
Stars: ✭ 9,469 (+2954.52%)
QuerytreeData reporting and visualization for your app
Stars: ✭ 230 (-25.81%)
LocustdbMassively parallel, high performance analytics database that will rapidly devour all of your data.
Stars: ✭ 1,250 (+303.23%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+10099.35%)
Cube.js📊 Cube — Open-Source Analytics API for Building Data Apps
Stars: ✭ 11,983 (+3765.48%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-68.71%)
OpenubaA robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-59.03%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+455.16%)
SqlhelperSQL Tools ( Dialect, Pagination, DDL dump, UrlParser, SqlStatementParser, WallFilter, BatchExecutor for Test) based Java. it is easy to integration into any ORM frameworks
Stars: ✭ 242 (-21.94%)
GpdbGreenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
Stars: ✭ 4,928 (+1489.68%)
Ormlite JdbcORMLite JDBC functionality that works with JDBC drivers to attach to various database types
Stars: ✭ 184 (-40.65%)
NsdbNatural Series Database
Stars: ✭ 49 (-84.19%)
SkyaltAccessible database and analytics. Organize and learn from data without engineers.
Stars: ✭ 40 (-87.1%)
Shorty🔗 A URL shortening service built using Flask and MySQL
Stars: ✭ 78 (-74.84%)
NormAccess a database in one line of code.
Stars: ✭ 152 (-50.97%)
Reddit DetectivePlay detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Stars: ✭ 129 (-58.39%)
DuckdbDuckDB is an in-process SQL OLAP Database Management System
Stars: ✭ 4,014 (+1194.84%)
DatabazelThe analytical and reporting solution for MongoDB
Stars: ✭ 118 (-61.94%)
RedashMake Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+6399.03%)
HyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (-20.65%)
SparkleHaskell on Apache Spark.
Stars: ✭ 419 (+35.16%)
DoobieFunctional JDBC layer for Scala.
Stars: ✭ 1,910 (+516.13%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-95.48%)
SnappydataProject SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Stars: ✭ 995 (+220.97%)
ZeppelinWeb-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+1678.39%)
CrateCrateDB is a distributed SQL database that makes it simple to store and analyze
massive amounts of data in real-time.
Stars: ✭ 3,254 (+949.68%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+3570.32%)
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+65.48%)
awesome-AI-kubernetes❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (-69.35%)
growthbookOpen Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+655.48%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-30.32%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+649.35%)
incubator-linkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,459 (+693.23%)
Sqliorm sql interface, Criteria, CriteriaBuilder, ResultMapBuilder
Stars: ✭ 1,644 (+430.32%)
Sqlite JdbcSQLite JDBC Driver
Stars: ✭ 1,961 (+532.58%)
OpaqueAn encrypted data analytics platform
Stars: ✭ 129 (-58.39%)
H2databaseH2 is an embeddable RDBMS written in Java.
Stars: ✭ 3,078 (+892.9%)
ConcourseDistributed database warehouse for transactions, search and analytics across time.
Stars: ✭ 310 (+0%)