All Projects → Hops Examples → Similar Projects or Alternatives

6452 Open source projects that are alternatives of or similar to Hops Examples

个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。

Stars: ✭ 92 (+9.52%)

Mutual labels: spark, flink, hive

深圳地铁大数据客流分析系统🚇🚄🌟

Stars: ✭ 826 (+883.33%)

Mutual labels: spark, flink, hive

Hadoopcryptoledger

Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive

Stars: ✭ 126 (+50%)

Mutual labels: spark, flink, hive

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Stars: ✭ 6,008 (+7052.38%)

Mutual labels: spark, flink, hive

Bdp Dataplatform

大数据生态解决方案数据平台：基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。

Stars: ✭ 456 (+442.86%)

Mutual labels: spark, flink, hive

Big Data Ecosystem Docker

Stars: ✭ 161 (+91.67%)

Mutual labels: jupyter-notebook, spark, hive

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

Stars: ✭ 817 (+872.62%)

Mutual labels: spark, flink, hive

A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources

Stars: ✭ 1,821 (+2067.86%)

Mutual labels: spark, flink, hive

Dataspherestudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Stars: ✭ 1,195 (+1322.62%)

Mutual labels: spark, flink, hive

HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)

Stars: ✭ 56 (-33.33%)

Mutual labels: hive, flink

基于Spark2.2新闻网大数据实时系统项目

Stars: ✭ 36 (-57.14%)

Mutual labels: spark, hive

Spark Jupyter Aws

A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support

Stars: ✭ 259 (+208.33%)

Mutual labels: jupyter-notebook, spark

大数据学习笔记，学习路线，技术案例整理。

Stars: ✭ 37 (-55.95%)

Mutual labels: hive, flink

Easy parsing of Apache HTTPD and NGINX access logs with Java, Hadoop, Hive, Pig, Flink, Beam, Storm, Drill, ...

Stars: ✭ 139 (+65.48%)

Mutual labels: hive, flink

ACID Data Source for Apache Spark based on Hive ACID

Stars: ✭ 91 (+8.33%)

Mutual labels: spark, hive

基于开源Litemall电商项目的大数据项目，包含前端埋点(openresty+lua)、后端埋点；数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化)，同时也包含了Azkaban的workflow。

Stars: ✭ 36 (-57.14%)

Mutual labels: hive, flink

WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!

Stars: ✭ 372 (+342.86%)

Mutual labels: spark, hive

Enterprise gateway

A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.

Stars: ✭ 412 (+390.48%)

Mutual labels: jupyter-notebook, spark

Agile data code 2

Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition

Stars: ✭ 413 (+391.67%)

Mutual labels: jupyter-notebook, spark

Web UI for Trino, Presto, Hive, Elasticsearch, SparkSQL

Stars: ✭ 424 (+404.76%)

Mutual labels: spark, hive

The Hunting ELK

Stars: ✭ 3,097 (+3586.9%)

Mutual labels: jupyter-notebook, spark

A Scala feature transformation library for data science and machine learning

Stars: ✭ 420 (+400%)

Mutual labels: spark, flink

Yet Another UserAgent Analyzer

Stars: ✭ 472 (+461.9%)

Mutual labels: flink, hive

Justenoughscalaforspark

A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.

Stars: ✭ 538 (+540.48%)

Mutual labels: jupyter-notebook, spark

Spark Movie Lens

An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset

Stars: ✭ 745 (+786.9%)

Mutual labels: jupyter-notebook, spark

Yandex Big Data Engineering

Stars: ✭ 17 (-79.76%)

Mutual labels: jupyter-notebook, spark

Spark Scala Tutorial

A free tutorial for Apache Spark.

Stars: ✭ 907 (+979.76%)

Mutual labels: jupyter-notebook, spark

TiDB connectors for Flink/Hive/Presto

Stars: ✭ 192 (+128.57%)

Mutual labels: hive, flink

大数据相关内容汇总，包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词：Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse

Stars: ✭ 123 (+46.43%)

Mutual labels: hive, flink

Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )

Stars: ✭ 29 (-65.48%)

Mutual labels: hive, flink

Installations mac ubuntu windows

Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).

Stars: ✭ 231 (+175%)

Mutual labels: jupyter-notebook, spark

Open-source distribute workflow schedule tools, also support streaming task.

Stars: ✭ 35 (-58.33%)

Mutual labels: spark, hive

fastdata-cluster

Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)

Stars: ✭ 20 (-76.19%)

Mutual labels: spark, flink

incubator-linkis

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,459 (+2827.38%)

Mutual labels: spark, hive

Mydatascienceportfolio

Applying Data Science and Machine Learning to Solve Real World Business Problems

Stars: ✭ 227 (+170.24%)

Mutual labels: jupyter-notebook, spark

Kyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark

Stars: ✭ 363 (+332.14%)

Mutual labels: spark, hive

Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark

Stars: ✭ 303 (+260.71%)

Mutual labels: jupyter-notebook, spark

Hadoop cookbook

Cookbook to install Hadoop 2.0+ using Chef

Stars: ✭ 82 (-2.38%)

Mutual labels: spark, hive

Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.

Stars: ✭ 278 (+230.95%)

Mutual labels: spark, flink

Moonbox is a DVtaaS (Data Virtualization as a Service) Platform

Stars: ✭ 424 (+404.76%)

Mutual labels: spark, hive

Apache Spark (PySpark) Practice on Real Data

Stars: ✭ 200 (+138.1%)

Mutual labels: jupyter-notebook, spark

Jupyter magics and kernels for working with remote Spark clusters

Stars: ✭ 954 (+1035.71%)

Mutual labels: jupyter-notebook, spark

Elasticsearch Spark Recommender

Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch

Stars: ✭ 707 (+741.67%)

Mutual labels: jupyter-notebook, spark

Spark Tdd Example

A simple Spark TDD example

Stars: ✭ 23 (-72.62%)

Mutual labels: jupyter-notebook, spark

🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Stars: ✭ 986 (+1073.81%)

Mutual labels: jupyter-notebook, spark

Python Helper library for Jupyter Notebooks

Stars: ✭ 998 (+1088.1%)

Mutual labels: jupyter-notebook, spark

Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.

Stars: ✭ 696 (+728.57%)

Mutual labels: spark, hive

Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark

Stars: ✭ 14 (-83.33%)

Mutual labels: jupyter-notebook, spark

Bigdata Interview

🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Stars: ✭ 857 (+920.24%)

Mutual labels: spark, flink

Data Ingestion Platform

Stars: ✭ 39 (-53.57%)

Mutual labels: spark, flink

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Stars: ✭ 5,656 (+6633.33%)

Mutual labels: jupyter-notebook, spark

Data Science Cookbook

🎓 Jupyter notebooks from UFC data science course

Stars: ✭ 60 (-28.57%)

Mutual labels: jupyter-notebook, spark

Pyspark Examples

Code examples on Apache Spark using python

Stars: ✭ 58 (-30.95%)

Mutual labels: jupyter-notebook, spark

Pysparkgeoanalysis

🌐 Interactive Workshop on GeoAnalysis using PySpark

Stars: ✭ 63 (-25%)

Mutual labels: jupyter-notebook, spark

Big Data Engineering Coursera Yandex

Big Data for Data Engineers Coursera Specialization from Yandex

Stars: ✭ 71 (-15.48%)

Mutual labels: jupyter-notebook, spark

Model Serving Tutorial

Code and presentation for Strata Model Serving tutorial

Stars: ✭ 57 (-32.14%)

Mutual labels: spark, flink

Word2Vec models with Twitter data using Spark. Blog:

Stars: ✭ 64 (-23.81%)

Mutual labels: jupyter-notebook, spark

Luigi Warehouse

A luigi powered analytics / warehouse stack

Stars: ✭ 72 (-14.29%)

Mutual labels: spark, hive

Azure Cosmosdb Spark

Apache Spark Connector for Azure Cosmos DB

Stars: ✭ 165 (+96.43%)

Mutual labels: jupyter-notebook, spark

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

Stars: ✭ 5,513 (+6463.1%)

Mutual labels: spark, flink

1-60 of 6452 similar projects