All Projects → Hadoop Connectors → Similar Projects or Alternatives

323 Open source projects that are alternatives of or similar to Hadoop Connectors

ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ

Stars: ✭ 174 (-20.18%)

Mutual labels: bigquery

Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. What datasets do you want to be added to Ethereum ETL? Vote here: https://blockchain-etl.convas.io.

Stars: ✭ 89 (-59.17%)

Mutual labels: bigquery

Calcite Avatica

Mirror of Apache Calcite - Avatica

Stars: ✭ 130 (-40.37%)

Mutual labels: hadoop

Hadoop cookbook

Cookbook to install Hadoop 2.0+ using Chef

Stars: ✭ 82 (-62.39%)

Mutual labels: hadoop

Awesome Learning

实践源码库：https://github.com/jast90/bigdata 。微信搜索Jast关注公众号，获取最新技术分享😯。

Stars: ✭ 197 (-9.63%)

Mutual labels: hadoop

Camus

Mirror of Linkedin's Camus

Stars: ✭ 81 (-62.84%)

Mutual labels: hadoop

Airflow Pipeline

An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR

Stars: ✭ 128 (-41.28%)

Mutual labels: hadoop

Docker Spark

🚢 Docker image for Apache Spark

Stars: ✭ 78 (-64.22%)

Mutual labels: hadoop

Big Whale

Spark、Flink等离线任务的调度以及实时任务的监控

Stars: ✭ 163 (-25.23%)

Mutual labels: hadoop

Tf Yarn

Train TensorFlow models on YARN in just a few lines of code!

Stars: ✭ 76 (-65.14%)

Mutual labels: hadoop

Griffon Vm

Griffon Data Science Virtual Machine

Stars: ✭ 128 (-41.28%)

Mutual labels: hadoop

Docker Hadoop

Apache Hadoop docker image

Stars: ✭ 1,190 (+445.87%)

Mutual labels: hadoop

Mprove

Open source Business Intelligence tool 🎉

Stars: ✭ 212 (-2.75%)

Mutual labels: bigquery

Hive Funnel Udf

Hive UDFs for funnel analysis

Stars: ✭ 72 (-66.97%)

Mutual labels: hadoop

Parquet4s

Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.

Stars: ✭ 125 (-42.66%)

Mutual labels: hadoop

Sql Runner

Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake

Stars: ✭ 68 (-68.81%)

Mutual labels: bigquery

Gpt2 Bert Reddit Bot

a bot that generates realistic replies using a combination of pretrained GPT-2 and BERT models

Stars: ✭ 158 (-27.52%)

Mutual labels: bigquery

Src

A light-weight distributed stream computing framework for Golang

Stars: ✭ 67 (-69.27%)

Mutual labels: hadoop

Mais

Universalizando o acesso a dados no Brasil. Docs: https://basedosdados.github.io/mais/

Stars: ✭ 122 (-44.04%)

Mutual labels: bigquery

Jumbune

Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,

Stars: ✭ 64 (-70.64%)

Mutual labels: hadoop

Nutch

Apache Nutch is an extensible and scalable web crawler

Stars: ✭ 2,277 (+944.5%)

Mutual labels: hadoop

Likelike

An implementation of locality sensitive hashing with Hadoop

Stars: ✭ 58 (-73.39%)

Mutual labels: hadoop

Professional Services

Common solutions and tools developed by Google Cloud's Professional Services team

Stars: ✭ 1,923 (+782.11%)

Mutual labels: bigquery

Docker Hadoop

A Docker container with a full Hadoop cluster setup with Spark and Zeppelin

Stars: ✭ 54 (-75.23%)

Mutual labels: hadoop

Hadoop Common

Mirror of Apache Hadoop common

Stars: ✭ 155 (-28.9%)

Mutual labels: hadoop

Hadoop Solr

Code to index HDFS to Solr using MapReduce

Stars: ✭ 51 (-76.61%)

Mutual labels: hadoop

Hdfs Shell

HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS

Stars: ✭ 117 (-46.33%)

Mutual labels: hadoop

Moosefs

MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)

Stars: ✭ 1,025 (+370.18%)

Mutual labels: hadoop

Calcite

Apache Calcite

Stars: ✭ 2,816 (+1191.74%)

Mutual labels: hadoop

Nagios Plugins

450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...

Stars: ✭ 1,000 (+358.72%)

Mutual labels: hadoop

Ibis

A pandas-like deferred expression system, with first-class SQL support

Stars: ✭ 1,630 (+647.71%)

Mutual labels: hadoop

Learning Spark

零基础学习spark，大数据学习

Stars: ✭ 37 (-83.03%)

Mutual labels: hadoop

Movie recommend

基于Spark的电影推荐系统，包含爬虫项目、web网站、后台管理系统以及spark推荐系统

Stars: ✭ 2,092 (+859.63%)

Mutual labels: hadoop

Ethereum Etl

Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ

Stars: ✭ 956 (+338.53%)

Mutual labels: bigquery

Datax

DataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server

Stars: ✭ 116 (-46.79%)

Mutual labels: hadoop

Pg2bq

Export PostgreSQL tables to Google BigQuery

Stars: ✭ 30 (-86.24%)

Mutual labels: bigquery

Bigquery Grafana

Google BigQuery Datasource Plugin for Grafana.

Stars: ✭ 188 (-13.76%)

Mutual labels: bigquery

Storm Camel Example

Real-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.

Stars: ✭ 28 (-87.16%)

Mutual labels: hadoop

Tensorflowonyarn

Support TensorFlow on YARN

Stars: ✭ 114 (-47.71%)

Mutual labels: hadoop

Interview Questions Collection

按知识领域整理面试题，包括C++、Java、Hadoop、机器学习等

Stars: ✭ 21 (-90.37%)

Mutual labels: hadoop

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (-31.19%)

Mutual labels: hadoop

Bigdata Interview

🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Stars: ✭ 857 (+293.12%)

Mutual labels: hadoop

Xlearning Xdml

extremely distributed machine learning

Stars: ✭ 113 (-48.17%)

Mutual labels: hadoop

Hadoop Pot

A scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.

Stars: ✭ 8 (-96.33%)

Mutual labels: hadoop

Javaorbigdata Interview

Java开发者或者大数据开发者面试知识点整理

Stars: ✭ 203 (-6.88%)

Mutual labels: hadoop

Kylo

Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.

Stars: ✭ 916 (+320.18%)

Mutual labels: hadoop

Introtohadoopandmr udacity course

🐘 Source code for assignments of Udacity course "Introduction to Hadoop and MapReduce"

Stars: ✭ 110 (-49.54%)

Mutual labels: hadoop

Dataflow Tutorial

Cloud Dataflow Tutorial for Beginners

Stars: ✭ 17 (-92.2%)

Mutual labels: bigquery

Parquet Rs

Apache Parquet implementation in Rust

Stars: ✭ 144 (-33.94%)

Mutual labels: hadoop

Hadoop For Geoevent

ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.

Stars: ✭ 5 (-97.71%)

Mutual labels: hadoop

Haproxy Configs

80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.

Stars: ✭ 106 (-51.38%)

Mutual labels: hadoop

Winutils

winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows

Stars: ✭ 657 (+201.38%)

Mutual labels: hadoop

Scio

A Scala API for Apache Beam and Google Cloud Dataflow.

Stars: ✭ 2,247 (+930.73%)

Mutual labels: bigquery

Embulk Output Bigquery

Embulk output plugin to load/insert data into Google BigQuery

Stars: ✭ 99 (-54.59%)

Mutual labels: bigquery

Gcp Variant Transforms

GCP Variant Transforms

Stars: ✭ 100 (-54.13%)

Mutual labels: bigquery

Sparkrdma

RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark

Stars: ✭ 215 (-1.38%)

Mutual labels: hadoop

Facebook Hive Udfs

Facebook's Hive UDFs

Stars: ✭ 213 (-2.29%)

Mutual labels: hadoop

Recommendsys

推荐项目（实时推荐和离线推荐）

Stars: ✭ 198 (-9.17%)

Mutual labels: hadoop

Bigdata Playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Stars: ✭ 177 (-18.81%)

Mutual labels: hadoop

Hbaseclient

HBase客户端数据管理软件

Stars: ✭ 135 (-38.07%)

Mutual labels: hadoop

61-120 of 323 similar projects

‹

›