All Projects → Hadoop Connectors → Similar Projects or Alternatives

323 Open source projects that are alternatives of or similar to Hadoop Connectors

Bitcoin Etl
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 174 (-20.18%)
Mutual labels:  bigquery
Ethereum Etl Airflow
Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. What datasets do you want to be added to Ethereum ETL? Vote here: https://blockchain-etl.convas.io.
Stars: ✭ 89 (-59.17%)
Mutual labels:  bigquery
Calcite Avatica
Mirror of Apache Calcite - Avatica
Stars: ✭ 130 (-40.37%)
Mutual labels:  hadoop
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-62.39%)
Mutual labels:  hadoop
Awesome Learning
实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (-9.63%)
Mutual labels:  hadoop
Camus
Mirror of Linkedin's Camus
Stars: ✭ 81 (-62.84%)
Mutual labels:  hadoop
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-41.28%)
Mutual labels:  hadoop
Docker Spark
🚢 Docker image for Apache Spark
Stars: ✭ 78 (-64.22%)
Mutual labels:  hadoop
Big Whale
Spark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-25.23%)
Mutual labels:  hadoop
Tf Yarn
Train TensorFlow models on YARN in just a few lines of code!
Stars: ✭ 76 (-65.14%)
Mutual labels:  hadoop
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-41.28%)
Mutual labels:  hadoop
Docker Hadoop
Apache Hadoop docker image
Stars: ✭ 1,190 (+445.87%)
Mutual labels:  hadoop
Mprove
Open source Business Intelligence tool 🎉
Stars: ✭ 212 (-2.75%)
Mutual labels:  bigquery
Hive Funnel Udf
Hive UDFs for funnel analysis
Stars: ✭ 72 (-66.97%)
Mutual labels:  hadoop
Parquet4s
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Stars: ✭ 125 (-42.66%)
Mutual labels:  hadoop
Sql Runner
Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake
Stars: ✭ 68 (-68.81%)
Mutual labels:  bigquery
Gpt2 Bert Reddit Bot
a bot that generates realistic replies using a combination of pretrained GPT-2 and BERT models
Stars: ✭ 158 (-27.52%)
Mutual labels:  bigquery
Src
A light-weight distributed stream computing framework for Golang
Stars: ✭ 67 (-69.27%)
Mutual labels:  hadoop
Mais
Universalizando o acesso a dados no Brasil. Docs: https://basedosdados.github.io/mais/
Stars: ✭ 122 (-44.04%)
Mutual labels:  bigquery
Jumbune
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-70.64%)
Mutual labels:  hadoop
Nutch
Apache Nutch is an extensible and scalable web crawler
Stars: ✭ 2,277 (+944.5%)
Mutual labels:  hadoop
Likelike
An implementation of locality sensitive hashing with Hadoop
Stars: ✭ 58 (-73.39%)
Mutual labels:  hadoop
Professional Services
Common solutions and tools developed by Google Cloud's Professional Services team
Stars: ✭ 1,923 (+782.11%)
Mutual labels:  bigquery
Docker Hadoop
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-75.23%)
Mutual labels:  hadoop
Hadoop Common
Mirror of Apache Hadoop common
Stars: ✭ 155 (-28.9%)
Mutual labels:  hadoop
Hadoop Solr
Code to index HDFS to Solr using MapReduce
Stars: ✭ 51 (-76.61%)
Mutual labels:  hadoop
Hdfs Shell
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-46.33%)
Mutual labels:  hadoop
Moosefs
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+370.18%)
Mutual labels:  hadoop
Calcite
Apache Calcite
Stars: ✭ 2,816 (+1191.74%)
Mutual labels:  hadoop
Nagios Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (+358.72%)
Mutual labels:  hadoop
Ibis
A pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+647.71%)
Mutual labels:  hadoop
Learning Spark
零基础学习spark,大数据学习
Stars: ✭ 37 (-83.03%)
Mutual labels:  hadoop
Movie recommend
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+859.63%)
Mutual labels:  hadoop
Ethereum Etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 956 (+338.53%)
Mutual labels:  bigquery
Datax
DataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (-46.79%)
Mutual labels:  hadoop
Pg2bq
Export PostgreSQL tables to Google BigQuery
Stars: ✭ 30 (-86.24%)
Mutual labels:  bigquery
Bigquery Grafana
Google BigQuery Datasource Plugin for Grafana.
Stars: ✭ 188 (-13.76%)
Mutual labels:  bigquery
Storm Camel Example
Real-time analysis and visualization with Storm-AMQ-Camel-Websockets-Highcharts integration.
Stars: ✭ 28 (-87.16%)
Mutual labels:  hadoop
Tensorflowonyarn
Support TensorFlow on YARN
Stars: ✭ 114 (-47.71%)
Mutual labels:  hadoop
Interview Questions Collection
按知识领域整理面试题,包括C++、Java、Hadoop、机器学习等
Stars: ✭ 21 (-90.37%)
Mutual labels:  hadoop
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-31.19%)
Mutual labels:  hadoop
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+293.12%)
Mutual labels:  hadoop
Xlearning Xdml
extremely distributed machine learning
Stars: ✭ 113 (-48.17%)
Mutual labels:  hadoop
Hadoop Pot
A scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.
Stars: ✭ 8 (-96.33%)
Mutual labels:  hadoop
Javaorbigdata Interview
Java开发者或者大数据开发者面试知识点整理
Stars: ✭ 203 (-6.88%)
Mutual labels:  hadoop
Kylo
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+320.18%)
Mutual labels:  hadoop
Introtohadoopandmr udacity course
🐘 Source code for assignments of Udacity course "Introduction to Hadoop and MapReduce"
Stars: ✭ 110 (-49.54%)
Mutual labels:  hadoop
Dataflow Tutorial
Cloud Dataflow Tutorial for Beginners
Stars: ✭ 17 (-92.2%)
Mutual labels:  bigquery
Parquet Rs
Apache Parquet implementation in Rust
Stars: ✭ 144 (-33.94%)
Mutual labels:  hadoop
Hadoop For Geoevent
ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-97.71%)
Mutual labels:  hadoop
Haproxy Configs
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (-51.38%)
Mutual labels:  hadoop
Winutils
winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows
Stars: ✭ 657 (+201.38%)
Mutual labels:  hadoop
Scio
A Scala API for Apache Beam and Google Cloud Dataflow.
Stars: ✭ 2,247 (+930.73%)
Mutual labels:  bigquery
Embulk Output Bigquery
Embulk output plugin to load/insert data into Google BigQuery
Stars: ✭ 99 (-54.59%)
Mutual labels:  bigquery
Gcp Variant Transforms
GCP Variant Transforms
Stars: ✭ 100 (-54.13%)
Mutual labels:  bigquery
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-1.38%)
Mutual labels:  hadoop
Facebook Hive Udfs
Facebook's Hive UDFs
Stars: ✭ 213 (-2.29%)
Mutual labels:  hadoop
Recommendsys
推荐项目(实时推荐和离线推荐)
Stars: ✭ 198 (-9.17%)
Mutual labels:  hadoop
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-18.81%)
Mutual labels:  hadoop
Hbaseclient
HBase客户端数据管理软件
Stars: ✭ 135 (-38.07%)
Mutual labels:  hadoop
61-120 of 323 similar projects