Simple Java Framework,designed for easily develop Spring based java program.Support Bigdata And metadata management.A common elasticsearch comm query tool and so on.

Stars: ✭ 16 (-85.45%)

Mutual labels: hadoop

Floating Elephants

Docker containers for Hadoop.

Stars: ✭ 19 (-82.73%)

Mutual labels: hadoop

orion

Management and automation platform for Stateful Distributed Systems

Stars: ✭ 77 (-30%)

Mutual labels: hadoop

AvroConvert

Apache Avro serializer for .NET

Stars: ✭ 44 (-60%)

Mutual labels: avro

hadoop-ansible

Install hadoop cluster with ansible

Stars: ✭ 35 (-68.18%)

Mutual labels: hadoop

Magnolify

A collection of Magnolia add-on modules

Stars: ✭ 81 (-26.36%)

Mutual labels: avro

avro-serde-php

Avro Serialisation/Deserialisation (SerDe) library for PHP 7.3+ & 8.0 with a Symfony Serializer integration

Stars: ✭ 43 (-60.91%)

Mutual labels: avro

ambari-hdp-docker

Dockerfiles and Docker Compose for HDP 2.6 with Blueprints

Stars: ✭ 23 (-79.09%)

Mutual labels: hadoop

Yandex Big Data Engineering

Stars: ✭ 17 (-84.55%)

Mutual labels: mapreduce

kafka-connect-fs

Kafka Connect FileSystem Connector

Stars: ✭ 107 (-2.73%)

Mutual labels: hadoop

dotnet-avro

An Avro implementation for .NET

Stars: ✭ 60 (-45.45%)

Mutual labels: avro

Big Data Engineering Coursera Yandex

Big Data for Data Engineers Coursera Specialization from Yandex

Stars: ✭ 71 (-35.45%)

Mutual labels: mapreduce

Orc

Apache ORC - the smallest, fastest columnar storage for Hadoop workloads

Stars: ✭ 389 (+253.64%)

Mutual labels: hadoop

UBA

UEBA Solution for Insider Security. This repo is archived. Thanks!

Stars: ✭ 36 (-67.27%)

Mutual labels: hadoop

bigdatatutorial

Stars: ✭ 34 (-69.09%)

Mutual labels: hadoop

spark-acid

ACID Data Source for Apache Spark based on Hive ACID

Stars: ✭ 91 (-17.27%)

Mutual labels: hive

Hadoop For Geoevent

ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.

Stars: ✭ 5 (-95.45%)

Mutual labels: hadoop

Luigi

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Stars: ✭ 15,226 (+13741.82%)

Mutual labels: hadoop

avro-typescript

TypeScript Code Generator for Apache Avro Schema Types

Stars: ✭ 19 (-82.73%)

Mutual labels: avro

Maha

A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.

Stars: ✭ 101 (-8.18%)

Mutual labels: hive

Docker Hadoop Cluster

Multiple node cluster on Docker for self development.

Stars: ✭ 82 (-25.45%)

Mutual labels: hadoop

Weblogsanalysissystem

A big data platform for analyzing web access logs

Stars: ✭ 37 (-66.36%)

Mutual labels: hadoop

Ignite

Apache Ignite

Stars: ✭ 4,027 (+3560.91%)

Mutual labels: hadoop

implyr

SQL backend to dplyr for Impala

Stars: ✭ 74 (-32.73%)

Mutual labels: hadoop

docker-hive

Docker image for Apache Hive Metastore

Stars: ✭ 42 (-61.82%)

Mutual labels: hive

Javaorbigdata Interview

Java开发者或者大数据开发者面试知识点整理

Stars: ✭ 203 (+84.55%)

Mutual labels: hadoop

Kafka Storm Starter

Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.

Stars: ✭ 728 (+561.82%)

Mutual labels: avro

Awesome Learning

实践源码库：https://github.com/jast90/bigdata 。微信搜索Jast关注公众号，获取最新技术分享😯。

Stars: ✭ 197 (+79.09%)

Mutual labels: hadoop

yuzhouwan

Code Library for My Blog

Stars: ✭ 39 (-64.55%)

Mutual labels: hadoop

hadoop-crypto

Library for per-file client-side encyption in Hadoop FileSystems such as HDFS or S3.

Stars: ✭ 38 (-65.45%)

Mutual labels: hadoop

Bigdl

Building Large-Scale AI Applications for Distributed Big Data

Stars: ✭ 3,813 (+3366.36%)

Mutual labels: hadoop

datasqueeze

Hadoop utility to compact small files

Stars: ✭ 18 (-83.64%)

Mutual labels: hadoop

hive-cube

Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org

Stars: ✭ 34 (-69.09%)

Mutual labels: hive

Learning Spark

零基础学习spark，大数据学习

Stars: ✭ 37 (-66.36%)

Mutual labels: hadoop

Choetl

ETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)

Stars: ✭ 372 (+238.18%)

Mutual labels: avro

wasp

WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.

Stars: ✭ 19 (-82.73%)

Mutual labels: hadoop

ooso

Java library for running Serverless MapReduce jobs

Stars: ✭ 25 (-77.27%)

Mutual labels: mapreduce

presto

Teradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data

Stars: ✭ 91 (-17.27%)

Mutual labels: hadoop

301-360 of 422 similar projects

first

‹

›