hive-jdbc-driverAn alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (-71.82%)
Hive Jdbc Uber JarHive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Stars: ✭ 188 (+70.91%)
AsakusafwAsakusa Framework
Stars: ✭ 114 (+3.64%)
HiveApache Hive
Stars: ✭ 4,031 (+3564.55%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-74.55%)
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+238.18%)
IcebergIceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (+257.27%)
Docs4dev后端开发常用框架文档及中文翻译,包含 Spring 系列文档(Spring, Spring Boot, Spring Cloud, Spring Security, Spring Session),大数据(Apache Hive, HBase, Apache Flume),日志(Log4j2, Logback),Http Server(NGINX,Apache),Python,数据库(OpenTSDB,MySQL,PostgreSQL)等最新官方文档以及对应的中文翻译。
Stars: ✭ 974 (+785.45%)
CamusMirror of Linkedin's Camus
Stars: ✭ 81 (-26.36%)
Pyetlpython ETL framework
Stars: ✭ 33 (-70%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-11.82%)
AkkeeperAn easy way to deploy your Akka services to a distributed environment.
Stars: ✭ 30 (-72.73%)
Storm Camel ExampleReal-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.
Stars: ✭ 28 (-74.55%)
Docker Spark🚢 Docker image for Apache Spark
Stars: ✭ 78 (-29.09%)
AvrocadoAvrocado is a convenience library to handle Avro in Golang
Stars: ✭ 21 (-80.91%)
DamprPython Data Processing library
Stars: ✭ 102 (-7.27%)
BitalarmAn app to keep track of different cryptocurrencies, written in dart + flutter
Stars: ✭ 94 (-14.55%)
ChukwaMirror of Apache Chukwa
Stars: ✭ 77 (-30%)
Tf YarnTrain TensorFlow models on YARN in just a few lines of code!
Stars: ✭ 76 (-30.91%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-90%)
Dockerfiles50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu
Stars: ✭ 847 (+670%)
Hadoop PotA scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.
Stars: ✭ 8 (-92.73%)
DatabookA facebook for data
Stars: ✭ 26 (-76.36%)
Docker HadoopApache Hadoop docker image
Stars: ✭ 1,190 (+981.82%)
AvscAvro for JavaScript ⚡️
Stars: ✭ 930 (+745.45%)
PyhivePython interface to Hive and Presto. 🐝
Stars: ✭ 1,378 (+1152.73%)
Stormtweetssentimentd3vizComputes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.
Stars: ✭ 25 (-77.27%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+744.55%)
KyloKylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+732.73%)
MagnolifyA collection of Magnolia add-on modules
Stars: ✭ 81 (-26.36%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-34.55%)
Aptos☀️ Avro, Protobuf, Thrift on Swagger
Stars: ✭ 17 (-84.55%)
MahaA framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-8.18%)
AtsdAxibase Time Series Database Documentation
Stars: ✭ 68 (-38.18%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-95.45%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+561.82%)
Eyerissf An Eyeriss Chip (researched by MIT, a CNN accelerator) simulator and New DNN framework "Hive"
Stars: ✭ 68 (-38.18%)
ScriptisScriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Stars: ✭ 696 (+532.73%)
MapreduceMapReduce by examples
Stars: ✭ 91 (-17.27%)
Pmacctpmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].
Stars: ✭ 677 (+515.45%)
Winutilswinutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows
Stars: ✭ 657 (+497.27%)
JumbuneJumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-41.82%)
Corral🐎 A serverless MapReduce framework written for AWS Lambda
Stars: ✭ 648 (+489.09%)
Schema RegistryConfluent Schema Registry for Kafka
Stars: ✭ 1,647 (+1397.27%)
Php Thrift SqlA PHP library for connecting to Hive or Impala over Thrift
Stars: ✭ 107 (-2.73%)
WaimakWaimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (-45.45%)
TonyTonY is a framework to natively run deep learning frameworks on Apache Hadoop.
Stars: ✭ 626 (+469.09%)
Javapdf🍣100本 Java电子书 技术书籍PDF(以下载阅读为荣,以点赞收藏为耻)
Stars: ✭ 609 (+453.64%)