Simple Java Framework,designed for easily develop Spring based java program.Support Bigdata And metadata management.A common elasticsearch comm query tool and so on.

Stars: ✭ 16 (-57.89%)

Mutual labels: hadoop

hadoop-ansible

Install hadoop cluster with ansible

Stars: ✭ 35 (-7.89%)

Mutual labels: hadoop

asana mailer

A script that uses Asana's RESTful API to generate plaintext and HTML emails.

Stars: ✭ 12 (-68.42%)

Mutual labels: octo-correct-managed

disq

A library for manipulating bioinformatics sequencing formats in Apache Spark

Stars: ✭ 29 (-23.68%)

Mutual labels: hadoop

smart-data-lake

Smart Automation Tool for building modern Data Lakes and Data Pipelines

Stars: ✭ 79 (+107.89%)

Mutual labels: hadoop

hadoopoffice

HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)

Stars: ✭ 56 (+47.37%)

Mutual labels: hadoop

webhdfs

Node.js WebHDFS REST API client

Stars: ✭ 88 (+131.58%)

Mutual labels: hadoop

disk

基于hadoop+hbase+springboot实现分布式网盘系统

Stars: ✭ 53 (+39.47%)

Mutual labels: hadoop

gomrjob

gomrjob - a Go Framework for Hadoop Map Reduce Jobs

Stars: ✭ 39 (+2.63%)

Mutual labels: hadoop

liquibase-impala

Liquibase extension to add Impala Database support

Stars: ✭ 23 (-39.47%)

Mutual labels: hadoop

orion

Management and automation platform for Stateful Distributed Systems

Stars: ✭ 77 (+102.63%)

Mutual labels: hadoop

qs-hadoop

大数据生态圈学习

Stars: ✭ 18 (-52.63%)

Mutual labels: hadoop

oci-cloudera

Terraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)

Stars: ✭ 20 (-47.37%)

Mutual labels: hadoop

conjure-rust

Conjure support for Rust

Stars: ✭ 14 (-63.16%)

Mutual labels: octo-correct-managed

dockerfiles

Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )

Stars: ✭ 29 (-23.68%)

Mutual labels: hadoop

RecommendationEngine

Source code and dataset for paper "CBMR: An optimized MapReduce for item‐based collaborative filtering recommendation algorithm with empirical analysis"

Stars: ✭ 43 (+13.16%)

Mutual labels: hadoop

ambari-hdp-docker

Dockerfiles and Docker Compose for HDP 2.6 with Blueprints

Stars: ✭ 23 (-39.47%)

Mutual labels: hadoop

iis

Information Inference Service of the OpenAIRE system

Stars: ✭ 16 (-57.89%)

Mutual labels: hadoop

xxhadoop

Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !

Stars: ✭ 37 (-2.63%)

Mutual labels: hadoop

sparkucx

A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer

Stars: ✭ 32 (-15.79%)

Mutual labels: hadoop

the-apache-ignite-book

All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above

Stars: ✭ 65 (+71.05%)

Mutual labels: hadoop

rust-zipkin

A library for logging and propagating Zipkin trace information in Rust

Stars: ✭ 50 (+31.58%)

Mutual labels: octo-correct-managed

phishcatch

A browser extension and API server for detecting corporate password use on external websites

Stars: ✭ 75 (+97.37%)

Mutual labels: octo-correct-managed

corc

An ORC File Scheme for the Cascading data processing platform.

Stars: ✭ 14 (-63.16%)

Mutual labels: hadoop

learning-hadoop-and-spark

Companion to Learning Hadoop and Learning Spark courses on Linked In Learning

Stars: ✭ 146 (+284.21%)

Mutual labels: hadoop

hadoop-ecosystem

Visualizations of the Hadoop Ecosystem

Stars: ✭ 20 (-47.37%)

Mutual labels: hadoop

openPDC

Open Source Phasor Data Concentrator

Stars: ✭ 109 (+186.84%)

Mutual labels: hadoop

pyspark-ML-in-Colab

Pyspark in Google Colab: A simple machine learning (Linear Regression) model

Stars: ✭ 32 (-15.79%)

Mutual labels: hadoop

dpkb

大数据相关内容汇总，包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词：Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse

Stars: ✭ 123 (+223.68%)

Mutual labels: hadoop

rastercube

rastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)

Stars: ✭ 15 (-60.53%)

Mutual labels: hadoop

yarn-prometheus-exporter

Export Hadoop YARN (resource-manager) metrics in prometheus format

Stars: ✭ 44 (+15.79%)

Mutual labels: hadoop

big-data-exploration

[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product

Stars: ✭ 43 (+13.16%)

Mutual labels: hadoop

teraslice

Scalable data processing pipelines in JavaScript

Stars: ✭ 48 (+26.32%)

Mutual labels: hadoop

wasp

WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.

Stars: ✭ 19 (-50%)

Mutual labels: hadoop

beanszoo

Distributed Java micro-services using ZooKeeper

Stars: ✭ 12 (-68.42%)

Mutual labels: hadoop

datalake-etl-pipeline

Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations

Stars: ✭ 39 (+2.63%)

Mutual labels: hadoop

gradle-consistent-versions

Compact, constraint-friendly lockfiles for your dependencies

Stars: ✭ 92 (+142.11%)

Mutual labels: octo-correct-managed

go-baseapp

A lightweight starting point for Go web servers

Stars: ✭ 61 (+60.53%)

Mutual labels: octo-correct-managed

palantir-java-format

A modern, lambda-friendly, 120 character Java formatter.