All Projects → Drill → Similar Projects or Alternatives

824 Open source projects that are alternatives of or similar to Drill

Apache IoTDB

Stars: ✭ 1,221 (-24.58%)

Mutual labels: big-data

Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.

Stars: ✭ 303 (-81.28%)

Mutual labels: big-data

Cloud Based Sql Engine Using Spark

Cloud-based SQL engine using SPARK where data is accessible as JDBC/ODBC data source via Spark ThriftServer.

Stars: ✭ 30 (-98.15%)

Mutual labels: jdbc

Ecency desktop formerly known as Esteem Surfer - reimagined desktop social wallet, contribute and get rewarded (for Windows, Mac, Linux)

Stars: ✭ 100 (-93.82%)

Mutual labels: hive

Create clusters of VMs on the cloud and configure them with Ansible.

Stars: ✭ 298 (-81.59%)

Mutual labels: hadoop

Data Algorithms Book

MapReduce, Spark, Java, and Scala for Data Algorithms Book

Stars: ✭ 949 (-41.38%)

Mutual labels: hadoop

Spring Boot Data Source Decorator

Spring Boot integration with p6spy, datasource-proxy, flexy-pool and spring-cloud-sleuth

Stars: ✭ 295 (-81.78%)

Mutual labels: jdbc

Attic Predictionio Template Recommender

PredictionIO Recommendation Engine Template (Scala-based parallelized engine)

Stars: ✭ 78 (-95.18%)

Mutual labels: big-data

白泽自动化运维系统：配置管理、网络探测、资产管理、业务管理、CMDB、CD、DevOps、作业编排、任务编排等功能,未来将添加监控、报警、日志分析、大数据分析等部分内容

Stars: ✭ 296 (-81.72%)

Mutual labels: big-data

A client interface to the QCArchive Project (read-only image of QCFractal)

Stars: ✭ 29 (-98.21%)

Mutual labels: big-data

CMAK is a tool for managing Apache Kafka clusters

Stars: ✭ 10,544 (+551.27%)

Mutual labels: big-data

Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.

Stars: ✭ 286 (-82.33%)

Mutual labels: hadoop

Awesome Scalability

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

Stars: ✭ 36,688 (+2166.09%)

Mutual labels: big-data

Experimental stuff for going fast with Clojure + JDBC & Async SQL

Stars: ✭ 78 (-95.18%)

Mutual labels: jdbc

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

Stars: ✭ 5,513 (+240.52%)

Mutual labels: big-data

Interview Questions Collection

按知识领域整理面试题，包括C++、Java、Hadoop、机器学习等

Stars: ✭ 21 (-98.7%)

Mutual labels: hadoop

Read and write Neuroglancer datasets programmatically.

Stars: ✭ 63 (-96.11%)

Mutual labels: big-data

Hibernate Springboot

Collection of best practices for Java persistence performance in Spring Boot applications

Stars: ✭ 589 (-63.62%)

Mutual labels: jdbc

A tool for data sampling, data generation, and data diffing

Stars: ✭ 279 (-82.77%)

Mutual labels: parquet

Springboot Templates

springboot和dubbo、netty的集成，redis mongodb的nosql模板， kafka rocketmq rabbit的MQ模板， solr solrcloud elasticsearch查询引擎

Stars: ✭ 100 (-93.82%)

Mutual labels: hive

Cdc Kafka Hadoop

MySQL to NoSQL real time dataflow

Stars: ✭ 13 (-99.2%)

Mutual labels: hadoop

The Metadata Platform for the Modern Data Stack

Stars: ✭ 4,232 (+161.4%)

Mutual labels: big-data

Train TensorFlow models on YARN in just a few lines of code!

Stars: ✭ 76 (-95.31%)

Mutual labels: hadoop

Create full-fledged APIs for static datasets without writing a single line of code.

Stars: ✭ 253 (-84.37%)

Mutual labels: parquet

Dremio - the missing link in modern data

Stars: ✭ 862 (-46.76%)

Mutual labels: big-data

Two-day workshop that covers how to use R to interact databases and Spark

Stars: ✭ 110 (-93.21%)

Mutual labels: big-data

Tennis Crystal Ball

Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction

Stars: ✭ 107 (-93.39%)

Mutual labels: big-data

Database Subsetting and Relational Data Browsing Tool.

Stars: ✭ 576 (-64.42%)

Mutual labels: jdbc

Apache Spark 官方文档中文版

Stars: ✭ 1,126 (-30.45%)

Mutual labels: big-data

Alluxio, data orchestration for analytics and machine learning in the cloud

Stars: ✭ 5,379 (+232.24%)

Mutual labels: hadoop

H2 is an embeddable RDBMS written in Java.

Stars: ✭ 3,078 (+90.12%)

Mutual labels: jdbc

Bigdata Interview

🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Stars: ✭ 857 (-47.07%)

Mutual labels: hadoop

R package for statistical tools with big matrices stored on disk.

Stars: ✭ 139 (-91.41%)

Mutual labels: big-data

The Data Engineering Cookbook

Stars: ✭ 9,829 (+507.1%)

Mutual labels: big-data

REST APIs with JSP tags, SQL and much more.

Stars: ✭ 24 (-98.52%)

Mutual labels: jdbc

Distributed Stream and Batch Processing

Stars: ✭ 855 (-47.19%)

Mutual labels: big-data

phData Pulse application log aggregation and monitoring

Stars: ✭ 13 (-99.2%)

Mutual labels: hadoop

Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]

Stars: ✭ 854 (-47.25%)

Mutual labels: big-data

hadoop-docker-lite

Docker build project to setup a lightweight hadoop cluster containing hadoop, pig, zookeeper, hbase, phoenix, storm, kafka, kafka manager

Stars: ✭ 24 (-98.52%)

Mutual labels: hadoop

Apache Hadoop docker image

Stars: ✭ 1,190 (-26.5%)

Mutual labels: hadoop

A plugin designed for bukkit servers, aiming to reduce the lag that both the server and players experience.

Stars: ✭ 23 (-98.58%)

Mutual labels: jdbc

A scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.

Stars: ✭ 8 (-99.51%)

Mutual labels: hadoop

📊 📋 Dashboards using YAML or JSON files

Stars: ✭ 1,511 (-6.67%)

Mutual labels: big-data

avro-schema-generator

Library for generating avro schema files (.avsc) based on DB tables structure

Stars: ✭ 38 (-97.65%)

Mutual labels: jdbc

A facebook for data

Stars: ✭ 26 (-98.39%)

Mutual labels: hive

dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.

Stars: ✭ 30 (-98.15%)

Mutual labels: parquet

Research on distributed system

Stars: ✭ 73 (-95.49%)

Mutual labels: big-data

Postgresql JDBC Driver

Stars: ✭ 925 (-42.87%)

Mutual labels: jdbc

Mirror of Apache Giraph

Stars: ✭ 569 (-64.85%)

Mutual labels: big-data

Convert and analyze large data sets at light speed, on Mac and iOS.

Stars: ✭ 62 (-96.17%)

Mutual labels: big-data

Efficient video analysis at scale

Stars: ✭ 569 (-64.85%)

Mutual labels: big-data

定期更新Hadoop生态圈中常用大数据组件文档重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图印象笔记 Scala版本简单demo 常用工具类去敏后的train code 持续更新!!!)

Stars: ✭ 567 (-64.98%)

Mutual labels: hadoop

An extremely fast Non-crypto-safe AES Based Hash algorithm for Big Data

Stars: ✭ 62 (-96.17%)

Mutual labels: big-data

Reproducible Data Science at Scale!

Stars: ✭ 5,305 (+227.67%)

Mutual labels: big-data

Workflows and interfaces for neuroimaging packages

Stars: ✭ 557 (-65.6%)

Mutual labels: big-data

Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.

Stars: ✭ 60 (-96.29%)

Mutual labels: hadoop

Seamless multi-master syncing database with an intuitive HTTP/JSON API, designed for reliability

Stars: ✭ 5,166 (+219.09%)

Mutual labels: big-data

Thrill - An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++

Stars: ✭ 528 (-67.39%)

Mutual labels: big-data

repo for code published on pythondata.com

Stars: ✭ 113 (-93.02%)

Mutual labels: big-data

Mysql perf analyzer

MySQL performance monitoring and analysis.

Stars: ✭ 1,423 (-12.11%)

Mutual labels: big-data

301-360 of 824 similar projects