kafka-compose🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (-98.62%)
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-98.92%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-99.31%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-96.08%)
Requeryrequery - modern SQL based query & persistence for Java / Kotlin / Android
Stars: ✭ 3,071 (+32.2%)
ClojureqlClojureQL is superior SQL integration for Clojure
Stars: ✭ 281 (-87.9%)
DeveeldbDeveelDB is a complete SQL database system, primarly developed for .NET/Mono frameworks
Stars: ✭ 80 (-96.56%)
JplusoneTool for automatic detection and asserting "N+1 SELECT problem" occurences in JPA based Spring Boot Java applications and finding origin of JPA issued SQL statements in general
Stars: ✭ 91 (-96.08%)
FastsqlDatabase rapid development framework for Java(数据库快速开发框架).
Stars: ✭ 100 (-95.7%)
HiveApache Hive
Stars: ✭ 4,031 (+73.53%)
AddaxAddax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (-73.53%)
JooqjOOQ is the best way to write SQL in Java
Stars: ✭ 4,695 (+102.11%)
Bdp Dataplatform大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (-80.37%)
RagtimeDatabase-independent migration library
Stars: ✭ 519 (-77.66%)
Presto EthereumPresto Ethereum Connector -- SQL on Ethereum
Stars: ✭ 450 (-80.63%)
Hibernate SpringbootCollection of best practices for Java persistence performance in Spring Boot applications
Stars: ✭ 589 (-74.64%)
SquealyGenerate APIs from SQL Queries
Stars: ✭ 584 (-74.86%)
God Of Bigdata专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+158.63%)
Bigdataguide大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (-64.83%)
Mycat2MySQL Proxy using Java NIO based on Sharding SQL,Calcite ,simple and fast
Stars: ✭ 750 (-67.71%)
PoliAn easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
Stars: ✭ 1,850 (-20.36%)
Sparkling TitanicTraining models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-99.48%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-95.35%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-95.35%)
Parquet IndexSpark SQL index for Parquet tables
Stars: ✭ 109 (-95.31%)
Jasync SqlJava & Kotlin Async DataBase Driver for MySQL and PostgreSQL written in Kotlin
Stars: ✭ 1,092 (-52.99%)
Scala Db CodegenScala code/boilerplate generator from a db schema
Stars: ✭ 49 (-97.89%)
OblectoOblecto is a media server, which streams media you already own, and is designed to be at the heart of your entertainment experience. It runs on your home server to index and analyze your media such as Movies and TV Shows and presents them in an interface tailored for your media consupmtion needs.
Stars: ✭ 67 (-97.12%)
Kamu CliNext generation tool for decentralized exchange and transformation of semi-structured data
Stars: ✭ 69 (-97.03%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-96.9%)
ExamplesDemo applications and code examples for Confluent Platform and Apache Kafka
Stars: ✭ 571 (-75.42%)
Hadoop cookbookCookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-96.47%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-42.4%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-96.04%)
ContactsA flutter project with Implementation of a Contacts app in 4 ways (API, Custom, Preferences and Sqflite).
Stars: ✭ 100 (-95.7%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (-57.55%)
Php Thrift SqlA PHP library for connecting to Hive or Impala over Thrift
Stars: ✭ 107 (-95.39%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-95.48%)
Haproxy Configs80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (-95.44%)
Java CourseSelf paced course for Java Engineers
Stars: ✭ 103 (-95.57%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (-29.83%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (-30.31%)
ElassandraElassandra = Elasticsearch + Apache Cassandra
Stars: ✭ 1,610 (-30.69%)
PyhivePython interface to Hive and Presto. 🐝
Stars: ✭ 1,378 (-40.68%)
SpecqlAutomatic PostgreSQL CRUD queries
Stars: ✭ 120 (-94.83%)
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-98.88%)
Drive☁️ A distributed cloud based lazy drive to files integrated with Dropbox, Google Drive.
Stars: ✭ 36 (-98.45%)
Vs DeployVisual Studio Code extension that provides commands to deploy files of a workspace to a destination.
Stars: ✭ 123 (-94.71%)