All Projects → Facebook Hive Udfs → Similar Projects or Alternatives

304 Open source projects that are alternatives of or similar to Facebook Hive Udfs

Parquet4s
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Stars: ✭ 125 (-41.31%)
Mutual labels:  hadoop
Docker Hadoop
Apache Hadoop docker image
Stars: ✭ 1,190 (+458.69%)
Mutual labels:  hadoop
Docker Hadoop Cluster
Multiple node cluster on Docker for self development.
Stars: ✭ 82 (-61.5%)
Mutual labels:  hadoop
Spydra
Ephemeral Hadoop clusters using Google Compute Platform
Stars: ✭ 128 (-39.91%)
Mutual labels:  hadoop
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-16.9%)
Mutual labels:  hadoop
Chukwa
Mirror of Apache Chukwa
Stars: ✭ 77 (-63.85%)
Mutual labels:  hadoop
Dynamometer
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Stars: ✭ 122 (-42.72%)
Mutual labels:  hadoop
Xsql
Unified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-17.37%)
Mutual labels:  hive
Hive Third Functions
Some useful custom hive udf functions, especial array, json, math, string functions.
Stars: ✭ 151 (-29.11%)
Mutual labels:  hive
Hdfs Shell
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-45.07%)
Mutual labels:  hadoop
Atsd
Axibase Time Series Database Documentation
Stars: ✭ 68 (-68.08%)
Mutual labels:  hadoop
Hadoop Hdfs
Mirror of Apache Hadoop HDFS
Stars: ✭ 152 (-28.64%)
Mutual labels:  hadoop
Luigi Warehouse
A luigi powered analytics / warehouse stack
Stars: ✭ 72 (-66.2%)
Mutual labels:  hive
Awesome Learning
实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (-7.51%)
Mutual labels:  hadoop
Eyerissf
An Eyeriss Chip (researched by MIT, a CNN accelerator) simulator and New DNN framework "Hive"
Stars: ✭ 68 (-68.08%)
Mutual labels:  hive
Src
A light-weight distributed stream computing framework for Golang
Stars: ✭ 67 (-68.54%)
Mutual labels:  hadoop
Jumbune
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-69.95%)
Mutual labels:  hadoop
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-29.58%)
Mutual labels:  hadoop
Ibis
A pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+665.26%)
Mutual labels:  hadoop
Waimak
Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (-71.83%)
Mutual labels:  hadoop
Likelike
An implementation of locality sensitive hashing with Hadoop
Stars: ✭ 58 (-72.77%)
Mutual labels:  hadoop
Cube.js
📊 Cube — Open-Source Analytics API for Building Data Apps
Stars: ✭ 11,983 (+5525.82%)
Mutual labels:  hive
Docker Spark Cluster
A Spark cluster setup running on Docker containers
Stars: ✭ 57 (-73.24%)
Mutual labels:  hadoop
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+5663.85%)
Mutual labels:  hadoop
Hadoop
Apache Hadoop
Stars: ✭ 12,177 (+5616.9%)
Mutual labels:  hadoop
Docker Hadoop
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-74.65%)
Mutual labels:  hadoop
Hadoop Solr
Code to index HDFS to Solr using MapReduce
Stars: ✭ 51 (-76.06%)
Mutual labels:  hadoop
Base
https://www.researchgate.net/profile/Rajah_Iyer
Stars: ✭ 48 (-77.46%)
Mutual labels:  hadoop
Asakusafw
Asakusa Framework
Stars: ✭ 114 (-46.48%)
Mutual labels:  hadoop
Moosefs
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+381.22%)
Mutual labels:  hadoop
Nagios Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (+369.48%)
Mutual labels:  hadoop
Parquet Rs
Apache Parquet implementation in Rust
Stars: ✭ 144 (-32.39%)
Mutual labels:  hadoop
Tensorflowonyarn
Support TensorFlow on YARN
Stars: ✭ 114 (-46.48%)
Mutual labels:  hadoop
Weblogsanalysissystem
A big data platform for analyzing web access logs
Stars: ✭ 37 (-82.63%)
Mutual labels:  hadoop
Learning Spark
零基础学习spark,大数据学习
Stars: ✭ 37 (-82.63%)
Mutual labels:  hadoop
Parquet Go
Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.
Stars: ✭ 114 (-46.48%)
Mutual labels:  hadoop
Jsr203 Hadoop
A Java NIO file system provider for HDFS
Stars: ✭ 35 (-83.57%)
Mutual labels:  hadoop
Docs4dev
后端开发常用框架文档及中文翻译,包含 Spring 系列文档(Spring, Spring Boot, Spring Cloud, Spring Security, Spring Session),大数据(Apache Hive, HBase, Apache Flume),日志(Log4j2, Logback),Http Server(NGINX,Apache),Python,数据库(OpenTSDB,MySQL,PostgreSQL)等最新官方文档以及对应的中文翻译。
Stars: ✭ 974 (+357.28%)
Mutual labels:  hive
Javaorbigdata Interview
Java开发者或者大数据开发者面试知识点整理
Stars: ✭ 203 (-4.69%)
Mutual labels:  hadoop
Nutch
Apache Nutch is an extensible and scalable web crawler
Stars: ✭ 2,277 (+969.01%)
Mutual labels:  hadoop
Bigdata practice
大数据分析可视化实践
Stars: ✭ 166 (-22.07%)
Mutual labels:  hive
Hive
Fast. Scalable. Powerful. The Blockchain for Web 3.0
Stars: ✭ 142 (-33.33%)
Mutual labels:  hive
Xlearning Xdml
extremely distributed machine learning
Stars: ✭ 113 (-46.95%)
Mutual labels:  hadoop
Pyetl
python ETL framework
Stars: ✭ 33 (-84.51%)
Mutual labels:  hive
Akkeeper
An easy way to deploy your Akka services to a distributed environment.
Stars: ✭ 30 (-85.92%)
Mutual labels:  hadoop
Data Algorithms Book
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+345.54%)
Mutual labels:  hadoop
Storm Camel Example
Real-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.
Stars: ✭ 28 (-86.85%)
Mutual labels:  hadoop
Spark Authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-33.8%)
Mutual labels:  hive
Introtohadoopandmr udacity course
🐘 Source code for assignments of Udacity course "Introduction to Hadoop and MapReduce"
Stars: ✭ 110 (-48.36%)
Mutual labels:  hadoop
Interview Questions Collection
按知识领域整理面试题,包括C++、Java、Hadoop、机器学习等
Stars: ✭ 21 (-90.14%)
Mutual labels:  hadoop
Cdc Kafka Hadoop
MySQL to NoSQL real time dataflow
Stars: ✭ 13 (-93.9%)
Mutual labels:  hadoop
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+771.36%)
Mutual labels:  hadoop
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+302.35%)
Mutual labels:  hadoop
Big Whale
Spark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-23.47%)
Mutual labels:  hadoop
Dockerfiles
50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu
Stars: ✭ 847 (+297.65%)
Mutual labels:  hadoop
Hadoop Pot
A scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.
Stars: ✭ 8 (-96.24%)
Mutual labels:  hadoop
Databook
A facebook for data
Stars: ✭ 26 (-87.79%)
Mutual labels:  hive
Php Thrift Sql
A PHP library for connecting to Hive or Impala over Thrift
Stars: ✭ 107 (-49.77%)
Mutual labels:  hive
Stormtweetssentimentd3viz
Computes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.
Stars: ✭ 25 (-88.26%)
Mutual labels:  hadoop
Kylo
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+330.05%)
Mutual labels:  hadoop
61-120 of 304 similar projects