All Projects → Eel Sdk → Similar Projects or Alternatives

1215 Open source projects that are alternatives of or similar to Eel Sdk

hive-bigquery-storage-handler
Hive Storage Handler for interoperability between BigQuery and Apache Hive
Stars: ✭ 16 (-88.57%)
Mutual labels:  hive, hadoop
iis
Information Inference Service of the OpenAIRE system
Stars: ✭ 16 (-88.57%)
Mutual labels:  big-data, hadoop
rastercube
rastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-89.29%)
Mutual labels:  big-data, hadoop
hive to es
同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-85%)
Mutual labels:  hive, hadoop
Go Streams
A lightweight stream processing library for Go
Stars: ✭ 615 (+339.29%)
Mutual labels:  kafka, etl
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-87.86%)
Mutual labels:  big-data, etl
hive-jdbc-driver
An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (-77.86%)
Mutual labels:  hive, hadoop
Parquet Format
Apache Parquet
Stars: ✭ 800 (+471.43%)
Mutual labels:  big-data, parquet
Hadoop For Geoevent
ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-96.43%)
Mutual labels:  big-data, hadoop
Hdfs Shell
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-16.43%)
Mutual labels:  big-data, hadoop
cobra-policytool
Manage Apache Atlas and Ranger configuration for your Hadoop environment.
Stars: ✭ 16 (-88.57%)
Mutual labels:  hive, hadoop
Hazelcast Jet
Distributed Stream and Batch Processing
Stars: ✭ 855 (+510.71%)
Mutual labels:  kafka, big-data
smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
Stars: ✭ 79 (-43.57%)
Mutual labels:  hive, hadoop
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-75.71%)
Mutual labels:  big-data, hadoop
TIL
Today I Learned
Stars: ✭ 43 (-69.29%)
Mutual labels:  hive, hadoop
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+15648.57%)
Mutual labels:  big-data, hadoop
Cdc Kafka Hadoop
MySQL to NoSQL real time dataflow
Stars: ✭ 13 (-90.71%)
Mutual labels:  kafka, hadoop
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+512.14%)
Mutual labels:  kafka, hadoop
Pyetl
python ETL framework
Stars: ✭ 33 (-76.43%)
Mutual labels:  etl, hive
big-data-lite
Samples to the Oracle Big Data Lite VM
Stars: ✭ 41 (-70.71%)
Mutual labels:  big-data, hadoop
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-90.71%)
Mutual labels:  big-data, hadoop
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-90%)
Mutual labels:  big-data, hadoop
GooglePlay-Web-Crawler
Mapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive
Stars: ✭ 18 (-87.14%)
Mutual labels:  hive, hadoop
Amazon S3 Find And Forget
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-17.86%)
Mutual labels:  big-data, parquet
bandar-log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 20 (-85.71%)
Mutual labels:  big-data, etl
basin
Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-82.14%)
Mutual labels:  hadoop, etl
Cloudbreak
A tool for provisioning and managing Apache Hadoop clusters in the cloud. Cloudbreak, as part of the Hortonworks Data Platform, makes it easy to provision, configure and elastically grow HDP clusters on cloud infrastructure. Cloudbreak can be used to provision Hadoop across cloud infrastructure providers including AWS, Azure, GCP and OpenStack.
Stars: ✭ 301 (+115%)
Mutual labels:  big-data, hadoop
Smooks
An extensible Java framework for building XML and non-XML streaming applications
Stars: ✭ 293 (+109.29%)
Mutual labels:  big-data, etl
Gather Deployment
Gathers scalable tensorflow and infrastructure deployment
Stars: ✭ 326 (+132.86%)
Mutual labels:  kafka, hadoop
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (-73.57%)
Mutual labels:  hive, hadoop
Choetl
ETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+165.71%)
Mutual labels:  etl, parquet
Ignite
Apache Ignite
Stars: ✭ 4,027 (+2776.43%)
Mutual labels:  big-data, hadoop
Hive Funnel Udf
Hive UDFs for funnel analysis
Stars: ✭ 72 (-48.57%)
Mutual labels:  hadoop, hive
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-47.14%)
Mutual labels:  hadoop, hive
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-43.57%)
Mutual labels:  big-data, etl
Iceberg
Iceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (+180.71%)
Mutual labels:  hadoop, parquet
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+157.86%)
Mutual labels:  big-data, etl
Kafka Streams
equivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (+337.86%)
Mutual labels:  kafka, big-data
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+3940%)
Mutual labels:  big-data, hadoop
Parquet Mr
Apache Parquet
Stars: ✭ 1,278 (+812.86%)
Mutual labels:  big-data, parquet
Bigdata
💎🔥大数据学习笔记
Stars: ✭ 488 (+248.57%)
Mutual labels:  hadoop, hive
Dockerfiles
50+ DockerHub public images for Docker & Kubernetes - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak, TeamCity and DevOps tools built on the major Linux distros: Alpine, CentOS, Debian, Fedora, Ubuntu
Stars: ✭ 847 (+505%)
Mutual labels:  kafka, hadoop
Camus
Mirror of Linkedin's Camus
Stars: ✭ 81 (-42.14%)
Mutual labels:  kafka, hadoop
Wifi
基于wifi抓取信息的大数据查询分析系统
Stars: ✭ 93 (-33.57%)
Mutual labels:  hadoop, hive
Streamx
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (-31.43%)
Mutual labels:  kafka, big-data
Parquet Cpp
Apache Parquet
Stars: ✭ 339 (+142.14%)
Mutual labels:  big-data, parquet
Haproxy Configs
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (-24.29%)
Mutual labels:  hadoop, hive
Luigi Warehouse
A luigi powered analytics / warehouse stack
Stars: ✭ 72 (-48.57%)
Mutual labels:  etl, hive
Docker Spark Cluster
A Spark cluster setup running on Docker containers
Stars: ✭ 57 (-59.29%)
Mutual labels:  big-data, hadoop
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (-41.43%)
Mutual labels:  kafka, big-data
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-41.43%)
Mutual labels:  hadoop, hive
Springboot Templates
springboot和dubbo、netty的集成,redis mongodb的nosql模板, kafka rocketmq rabbit的MQ模板, solr solrcloud elasticsearch查询引擎
Stars: ✭ 100 (-28.57%)
Mutual labels:  kafka, hive
Calcite Avatica
Mirror of Apache Calcite - Avatica
Stars: ✭ 130 (-7.14%)
Mutual labels:  big-data, hadoop
Moosefs
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+632.14%)
Mutual labels:  big-data, hadoop
Maha
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-27.86%)
Mutual labels:  big-data, hive
DataX-src
DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (-85%)
Mutual labels:  hive, etl
dpkb
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (-12.14%)
Mutual labels:  hive, hadoop
Ozone
Scalable, redundant, and distributed object store for Apache Hadoop
Stars: ✭ 330 (+135.71%)
Mutual labels:  big-data, hadoop
Nagios Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (+614.29%)
Mutual labels:  kafka, hadoop
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-30.71%)
Mutual labels:  kafka, big-data
61-120 of 1215 similar projects