oci-clouderaTerraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)
Stars: ✭ 20 (-71.43%)
hadoop-deployment-bashCode for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
Stars: ✭ 31 (-55.71%)
cmuxA set of commands for managing CDH clusters using Cloudera Manager REST API.
Stars: ✭ 34 (-51.43%)
GoInstallerGoInstaller is installer for CodeIgniter with user interface (UI).
Stars: ✭ 31 (-55.71%)
prestoTeradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data
Stars: ✭ 91 (+30%)
memex-gateGeneral Architecture for Text Engineering
Stars: ✭ 47 (-32.86%)
ros hadoopHadoop splittable InputFormat for ROS. Process rosbag with Hadoop Spark and other HDFS compatible systems.
Stars: ✭ 92 (+31.43%)
text-sdk-phpPHP SDK to send messages with CM.com
Stars: ✭ 18 (-74.29%)
odoo-helper-scriptsThe easiest way to install and manage development odoo instances / projects.
Stars: ✭ 34 (-51.43%)
ProgramUpdaterPUF - Program Updater Framework. A library to easier the task of program updating
Stars: ✭ 14 (-80%)
liquibase-impalaLiquibase extension to add Impala Database support
Stars: ✭ 23 (-67.14%)
flokkrDocumentation placeholder and utilities for all the other containers.
Stars: ✭ 30 (-57.14%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (-20%)
gitpackGit-based package manager written in POSIX shell
Stars: ✭ 72 (+2.86%)
SingularisMy System Configuration ⚙️
Stars: ✭ 27 (-61.43%)
darwinAvro Schema Evolution made easy
Stars: ✭ 26 (-62.86%)
archdi-pkgArch Linux Desktop Installer Packages
Stars: ✭ 46 (-34.29%)
xxhadoopData Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-47.14%)
entrepotA list of free GitHub.com hosted WordPress plugins, themes & blocks
Stars: ✭ 29 (-58.57%)
MLHadoopThis repository contains Machine-Learning MapReduce codes for Hadoop which are written from scratch (without using any package or library). E.g. Prediction (Linear and Logistic Regression), Clustering (K-Means), Classification (KNN) etc.
Stars: ✭ 50 (-28.57%)
hive-jdbc-driverAn alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (-55.71%)
corcAn ORC File Scheme for the Cascading data processing platform.
Stars: ✭ 14 (-80%)
waspWASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-72.86%)
UEFI MULTIUEFI_MULTI - Make Multi-Boot USB-Drive
Stars: ✭ 33 (-52.86%)
clusterdockclusterdock is a framework for creating Docker-based container clusters
Stars: ✭ 26 (-62.86%)
PackageProject.cmake🏛️ Help other developers use your project. A CMake script for packaging C/C++ projects for simple project installation while employing best-practices for maximum compatibility.
Stars: ✭ 48 (-31.43%)
hadoop-etl-udfsThe Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Stars: ✭ 17 (-75.71%)
InstallomatorInstallation script to deploy standard software on Macs
Stars: ✭ 472 (+574.29%)
sparkucxA high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer
Stars: ✭ 32 (-54.29%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-65.71%)
rastercuberastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-78.57%)
fsbrowserFast desktop client for Hadoop Distributed File System
Stars: ✭ 27 (-61.43%)
pyspark-ML-in-ColabPyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (-54.29%)
hadoop-cryptoLibrary for per-file client-side encyption in Hadoop FileSystems such as HDFS or S3.
Stars: ✭ 38 (-45.71%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (-38.57%)
aaocp一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (-24.29%)
skeinA tool and library for easily deploying applications on Apache YARN
Stars: ✭ 128 (+82.86%)
nvim❤️ A neovim config repo.
Stars: ✭ 33 (-52.86%)
UBAUEBA Solution for Insider Security. This repo is archived. Thanks!
Stars: ✭ 36 (-48.57%)
disqA library for manipulating bioinformatics sequencing formats in Apache Spark
Stars: ✭ 29 (-58.57%)
implyrSQL backend to dplyr for Impala
Stars: ✭ 74 (+5.71%)
disk基于hadoop+hbase+springboot实现分布式网盘系统
Stars: ✭ 53 (-24.29%)
clickhouse hadoopImport data from clickhouse to hadoop with pure SQL
Stars: ✭ 26 (-62.86%)
lsstConfigures environment for LSST software (newinstall.sh)
Stars: ✭ 14 (-80%)
platys-modern-data-platformSupport for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
Stars: ✭ 35 (-50%)
cobra-policytoolManage Apache Atlas and Ranger configuration for your Hadoop environment.
Stars: ✭ 16 (-77.14%)
gtniInstall your all npm dependencies recursively with gtni while you are doing git clone, fetch or pull
Stars: ✭ 17 (-75.71%)
datasqueezeHadoop utility to compact small files
Stars: ✭ 18 (-74.29%)