All Projects → Facebook Hive Udfs → Similar Projects or Alternatives

304 Open source projects that are alternatives of or similar to Facebook Hive Udfs

Parquet4s

Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.

Stars: ✭ 125 (-41.31%)

Mutual labels: hadoop

Docker Hadoop

Apache Hadoop docker image

Stars: ✭ 1,190 (+458.69%)

Mutual labels: hadoop

Docker Hadoop Cluster

Multiple node cluster on Docker for self development.

Stars: ✭ 82 (-61.5%)

Mutual labels: hadoop

Spydra

Ephemeral Hadoop clusters using Google Compute Platform

Stars: ✭ 128 (-39.91%)

Mutual labels: hadoop

Bigdata Playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Stars: ✭ 177 (-16.9%)

Mutual labels: hadoop

Chukwa

Mirror of Apache Chukwa

Stars: ✭ 77 (-63.85%)

Mutual labels: hadoop

Dynamometer

A tool for scale and performance testing of HDFS with a specific focus on the NameNode.

Stars: ✭ 122 (-42.72%)

Mutual labels: hadoop

Xsql

Unified SQL Analytics Engine Based on SparkSQL

Stars: ✭ 176 (-17.37%)

Mutual labels: hive

Hive Third Functions

Some useful custom hive udf functions, especial array, json, math, string functions.

Stars: ✭ 151 (-29.11%)

Mutual labels: hive

Hdfs Shell

HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS

Stars: ✭ 117 (-45.07%)

Mutual labels: hadoop

Atsd

Axibase Time Series Database Documentation

Stars: ✭ 68 (-68.08%)

Mutual labels: hadoop

Hadoop Hdfs

Mirror of Apache Hadoop HDFS

Stars: ✭ 152 (-28.64%)

Mutual labels: hadoop

Luigi Warehouse

A luigi powered analytics / warehouse stack

Stars: ✭ 72 (-66.2%)

Mutual labels: hive

Awesome Learning

实践源码库：https://github.com/jast90/bigdata 。微信搜索Jast关注公众号，获取最新技术分享😯。

Stars: ✭ 197 (-7.51%)

Mutual labels: hadoop

Eyerissf

An Eyeriss Chip (researched by MIT, a CNN accelerator) simulator and New DNN framework "Hive"

Stars: ✭ 68 (-68.08%)

Mutual labels: hive

Src

A light-weight distributed stream computing framework for Golang

Stars: ✭ 67 (-68.54%)

Mutual labels: hadoop

Jumbune

Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,

Stars: ✭ 64 (-69.95%)

Mutual labels: hadoop

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (-29.58%)

Mutual labels: hadoop

Ibis

A pandas-like deferred expression system, with first-class SQL support

Stars: ✭ 1,630 (+665.26%)

Mutual labels: hadoop

Waimak

Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.

Stars: ✭ 60 (-71.83%)

Mutual labels: hadoop

Likelike

An implementation of locality sensitive hashing with Hadoop

Stars: ✭ 58 (-72.77%)

Mutual labels: hadoop

Cube.js

📊 Cube — Open-Source Analytics API for Building Data Apps

Stars: ✭ 11,983 (+5525.82%)

Mutual labels: hive

Docker Spark Cluster

A Spark cluster setup running on Docker containers

Stars: ✭ 57 (-73.24%)

Mutual labels: hadoop

Deeplearning4j

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…

Stars: ✭ 12,277 (+5663.85%)

Mutual labels: hadoop

Hadoop

Apache Hadoop

Stars: ✭ 12,177 (+5616.9%)

Mutual labels: hadoop

Docker Hadoop

A Docker container with a full Hadoop cluster setup with Spark and Zeppelin

Stars: ✭ 54 (-74.65%)

Mutual labels: hadoop

Hadoop Solr

Code to index HDFS to Solr using MapReduce

Stars: ✭ 51 (-76.06%)

Mutual labels: hadoop

Base

https://www.researchgate.net/profile/Rajah_Iyer

Stars: ✭ 48 (-77.46%)

Mutual labels: hadoop

Asakusafw

Asakusa Framework

Stars: ✭ 114 (-46.48%)

Mutual labels: hadoop

Moosefs

MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)

Stars: ✭ 1,025 (+381.22%)

Mutual labels: hadoop

Nagios Plugins

450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...

Stars: ✭ 1,000 (+369.48%)

Mutual labels: hadoop

Parquet Rs

Apache Parquet implementation in Rust

Stars: ✭ 144 (-32.39%)

Mutual labels: hadoop

Tensorflowonyarn

Support TensorFlow on YARN

Stars: ✭ 114 (-46.48%)

Mutual labels: hadoop

Weblogsanalysissystem

A big data platform for analyzing web access logs

Stars: ✭ 37 (-82.63%)

Mutual labels: hadoop

Learning Spark

零基础学习spark，大数据学习

Stars: ✭ 37 (-82.63%)

Mutual labels: hadoop

Parquet Go

Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.

Stars: ✭ 114 (-46.48%)

Mutual labels: hadoop

Jsr203 Hadoop

A Java NIO file system provider for HDFS

Stars: ✭ 35 (-83.57%)

Mutual labels: hadoop

Docs4dev

后端开发常用框架文档及中文翻译，包含 Spring 系列文档（Spring, Spring Boot, Spring Cloud, Spring Security, Spring Session），大数据（Apache Hive, HBase, Apache Flume），日志（Log4j2, Logback），Http Server（NGINX，Apache），Python，数据库（OpenTSDB，MySQL，PostgreSQL）等最新官方文档以及对应的中文翻译。

Stars: ✭ 974 (+357.28%)

Mutual labels: hive

Javaorbigdata Interview

Java开发者或者大数据开发者面试知识点整理

Stars: ✭ 203 (-4.69%)

Mutual labels: hadoop

Nutch

Apache Nutch is an extensible and scalable web crawler

Stars: ✭ 2,277 (+969.01%)

Mutual labels: hadoop

Bigdata practice

大数据分析可视化实践

Stars: ✭ 166 (-22.07%)

Mutual labels: hive

Hive

Fast. Scalable. Powerful. The Blockchain for Web 3.0

Stars: ✭ 142 (-33.33%)

Mutual labels: hive

Xlearning Xdml

extremely distributed machine learning

Stars: ✭ 113 (-46.95%)

Mutual labels: hadoop

Pyetl

python ETL framework

Stars: ✭ 33 (-84.51%)

Mutual labels: hive

Akkeeper

An easy way to deploy your Akka services to a distributed environment.

Stars: ✭ 30 (-85.92%)

Mutual labels: hadoop

Data Algorithms Book

MapReduce, Spark, Java, and Scala for Data Algorithms Book

Stars: ✭ 949 (+345.54%)

Mutual labels: hadoop

Storm Camel Example

Real-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.

Stars: ✭ 28 (-86.85%)

Mutual labels: hadoop

Spark Authorizer

A Spark SQL extension which provides SQL Standard Authorization for Apache Spark

Stars: ✭ 141 (-33.8%)

Mutual labels: hive

Introtohadoopandmr udacity course

🐘 Source code for assignments of Udacity course "Introduction to Hadoop and MapReduce"

Stars: ✭ 110 (-48.36%)

Mutual labels: hadoop

Interview Questions Collection

按知识领域整理面试题，包括C++、Java、Hadoop、机器学习等

Stars: ✭ 21 (-90.14%)

Mutual labels: hadoop

Cdc Kafka Hadoop

MySQL to NoSQL real time dataflow

Stars: ✭ 13 (-93.9%)

Mutual labels: hadoop

Waterdrop

Production Ready Data Integration Product, documentation：

Stars: ✭ 1,856 (+771.36%)

Mutual labels: hadoop

Bigdata Interview

🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Stars: ✭ 857 (+302.35%)

Mutual labels: hadoop

Big Whale

Spark、Flink等离线任务的调度以及实时任务的监控

Stars: ✭ 163 (-23.47%)

Mutual labels: hadoop

Dockerfiles

50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu

Stars: ✭ 847 (+297.65%)

Mutual labels: hadoop

Hadoop Pot

A scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.

Stars: ✭ 8 (-96.24%)

Mutual labels: hadoop

Databook

A facebook for data

Stars: ✭ 26 (-87.79%)

Mutual labels: hive

Php Thrift Sql

A PHP library for connecting to Hive or Impala over Thrift

Stars: ✭ 107 (-49.77%)

Mutual labels: hive

Stormtweetssentimentd3viz

Computes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.

Stars: ✭ 25 (-88.26%)

Mutual labels: hadoop

Kylo

Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.

Stars: ✭ 916 (+330.05%)

Mutual labels: hadoop

61-120 of 304 similar projects

‹

›