bandar-logMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 20 (-94.48%)
BeamApache Beam is a unified programming model for Batch and Streaming
Stars: ✭ 5,149 (+1322.38%)
HiveApache Hive
Stars: ✭ 4,031 (+1013.54%)
AthenaxSQL-based streaming analytics platform at scale
Stars: ✭ 1,178 (+225.41%)
CalciteApache Calcite
Stars: ✭ 2,816 (+677.9%)
Flink ShadedApache Flink shaded artifacts repository
Stars: ✭ 67 (-81.49%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-51.1%)
Pulsar FlinkElastic data processing with Apache Pulsar and Apache Flink
Stars: ✭ 126 (-65.19%)
Presto Go ClientA Presto client for the Go programming language.
Stars: ✭ 183 (-49.45%)
Alchemy给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群
Stars: ✭ 264 (-27.07%)
Kamu CliNext generation tool for decentralized exchange and transformation of semi-structured data
Stars: ✭ 69 (-80.94%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+8634.25%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-58.56%)
RegistrySchema Registry
Stars: ✭ 184 (-49.17%)
ZeppelinWeb-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+1422.93%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-0.28%)
Fiflowflink-sql 在 flink 上运行 sql 和 构建数据流的平台 基于 apache flink 1.10.0
Stars: ✭ 100 (-72.38%)
MahaA framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-72.1%)
PrestoThe official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+3479.28%)
cassandra.realtimeDifferent ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (-93.09%)
litemall-dw基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-90.06%)
fdp-modelserverAn umbrella project for multiple implementations of model serving
Stars: ✭ 47 (-87.02%)
FlinkApache Flink is an open source project of The Apache Software Foundation (ASF).
The Apache Flink project originated from the Stratosphere research project.
Stars: ✭ 17,781 (+4811.88%)
IgniteApache Ignite
Stars: ✭ 4,027 (+1012.43%)
Flinkstreamsql基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Stars: ✭ 1,682 (+364.64%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-40.33%)
Flink Sql CookbookThe Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.
Stars: ✭ 189 (-47.79%)
QuicksqlA Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+403.04%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-31.77%)
StreamlineStreamLine - Streaming Analytics
Stars: ✭ 151 (-58.29%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+412.71%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-94.75%)
PhoenixMirror of Apache Phoenix
Stars: ✭ 867 (+139.5%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+5725.69%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+1165.47%)
CrateCrateDB is a distributed SQL database that makes it simple to store and analyze
massive amounts of data in real-time.
Stars: ✭ 3,254 (+798.9%)
Flask AppbuilderSimple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more. Demo (login with guest/welcome) - http://flaskappbuilder.pythonanywhere.com/
Stars: ✭ 3,603 (+895.3%)
Superboot随着技术日新月异,新技术新平台不断出现,对现如今的开发人员来说选择快速高效的框架进行项目开发,既能提高产出,又能节约时间。本框架无需开发即可实现服务注册、服务发现、负载均衡、服务网关、配置中心、API管理、分布式事务、支撑平台、集成框架、数据传输加密等功能,是学习SpringCloud整体业务模式的完整示例,并且可以直接用于生产环境
Stars: ✭ 341 (-5.8%)
Syntax ParserLight and fast 🚀parser! With zero dependents. - Sql Parser Demo added!
Stars: ✭ 317 (-12.43%)
Automigrateversion your SQL schemas with git + automatically migrate them
Stars: ✭ 318 (-12.15%)
SqlalchemyThe Database Toolkit for Python
Stars: ✭ 4,637 (+1180.94%)
Monetdb OldThis is the official mirror of the MonetDB Mercurial repository. Please note that we do not accept pull requests on github. The regression test results can be found on the MonetDB Testweb http://monetdb.cwi.nl/testweb/web/status.php .For contributions please see: https://www.monetdb.org/Developers
Stars: ✭ 317 (-12.43%)
Mongo SqlAn extensible SQL generation library for JavaScript with a focus on introspectibility
Stars: ✭ 314 (-13.26%)
Go SqlmockSql mock driver for golang to test database interactions
Stars: ✭ 4,003 (+1005.8%)
TezApache Tez
Stars: ✭ 313 (-13.54%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+816.57%)
BigtopMirror of Apache Bigtop
Stars: ✭ 356 (-1.66%)
VespaThe open big data serving engine. https://vespa.ai
Stars: ✭ 3,747 (+935.08%)
OzoneScalable, redundant, and distributed object store for Apache Hadoop
Stars: ✭ 330 (-8.84%)
Php Sql Query BuilderAn elegant lightweight and efficient SQL Query Builder with fluid interface SQL syntax supporting bindings and complicated query generation.
Stars: ✭ 313 (-13.54%)
Themis数据库审核平台
Stars: ✭ 313 (-13.54%)
Rainbow csv🌈Rainbow CSV - Vim plugin: Highlight columns in CSV and TSV files and run queries in SQL-like language
Stars: ✭ 337 (-6.91%)
Uproot3ROOT I/O in pure Python and NumPy.
Stars: ✭ 312 (-13.81%)