All Projects → Mobius → Similar Projects or Alternatives

1522 Open source projects that are alternatives of or similar to Mobius

Azure Event Hubs Spark

Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs

Stars: ✭ 140 (-84.93%)

Mutual labels: spark, bigdata, apache-spark, spark-streaming, streaming

Spark

.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.

Stars: ✭ 1,721 (+85.25%)

Mutual labels: spark, bigdata, apache-spark, spark-streaming, streaming

Data Accelerator

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

Stars: ✭ 247 (-73.41%)

Mutual labels: spark, apache-spark, spark-streaming, streaming

Cdap

An open source framework for building data analytic applications.

Stars: ✭ 509 (-45.21%)

Mutual labels: dataset, spark, spark-streaming, mapreduce

Dpark

Python clone of Spark, a MapReduce alike framework in Python

Stars: ✭ 2,668 (+187.19%)

Mutual labels: spark, bigdata, mapreduce

Sparkrdma

RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark

Stars: ✭ 215 (-76.86%)

Mutual labels: spark, bigdata, apache-spark

Real Time Stream Processing Engine

This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.

Stars: ✭ 37 (-96.02%)

Mutual labels: spark, apache-spark, spark-streaming

aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Stars: ✭ 111 (-88.05%)

Mutual labels: spark, apache-spark, dataframe

Spark Streaming Monitoring With Lightning

Plot live-stats as graph from ApacheSpark application using Lightning-viz

Stars: ✭ 15 (-98.39%)

Mutual labels: bigdata, apache-spark, spark-streaming

qs-hadoop

大数据生态圈学习

Stars: ✭ 18 (-98.06%)

Mutual labels: bigdata, spark-streaming, mapreduce

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+1083.1%)

Mutual labels: spark, bigdata, mapreduce

Bigdata Interview

🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Stars: ✭ 857 (-7.75%)

Mutual labels: spark, bigdata, mapreduce

Big Data Engineering Coursera Yandex

Big Data for Data Engineers Coursera Specialization from Yandex

Stars: ✭ 71 (-92.36%)

Mutual labels: spark, bigdata, mapreduce

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (-83.85%)

Mutual labels: dataframe, spark, apache-spark

Coolplayspark

酷玩 Spark: Spark 源代码解析、Spark 类库等

Stars: ✭ 3,318 (+257.16%)

Mutual labels: spark, apache-spark, spark-streaming

leaflet heatmap

简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-98.6%)

Mutual labels: spark, apache-spark, bigdata

Spark States

Custom state store providers for Apache Spark

Stars: ✭ 83 (-91.07%)

Mutual labels: spark, apache-spark, spark-streaming

Bigdata Notebook

Stars: ✭ 100 (-89.24%)

Mutual labels: spark, bigdata, streaming

Sparta

Real Time Analytics and Data Pipelines based on Spark Streaming

Stars: ✭ 513 (-44.78%)

Mutual labels: spark, spark-streaming, streaming

Splash

Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange

Stars: ✭ 105 (-88.7%)

Mutual labels: spark, bigdata, apache-spark

Whylogs Java

Profile and monitor your ML data pipeline end-to-end

Stars: ✭ 164 (-82.35%)

Mutual labels: dataset, spark, apache-spark

Streaming Readings

Streaming System 相关的论文读物

Stars: ✭ 554 (-40.37%)

Mutual labels: apache-spark, spark-streaming, streaming

SparkTwitterAnalysis

An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project.

Stars: ✭ 29 (-96.88%)

Mutual labels: apache-spark, bigdata

spark-utils

Basic framework utilities to quickly start writing production ready Apache Spark applications

Stars: ✭ 25 (-97.31%)

Mutual labels: apache-spark, spark-streaming

Goodreads etl pipeline

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

Stars: ✭ 793 (-14.64%)

Mutual labels: spark, apache-spark

Spark Redis

A connector for Spark that allows reading and writing to/from Redis cluster

Stars: ✭ 773 (-16.79%)

Mutual labels: dataframe, spark

gan deeplearning4j

Automatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.

Stars: ✭ 19 (-97.95%)

Mutual labels: apache-spark, bigdata

pulsar-adapters

Apache Pulsar Adapters

Stars: ✭ 18 (-98.06%)

Mutual labels: streaming, apache-spark

big data

A collection of tutorials on Hadoop, MapReduce, Spark, Docker

Stars: ✭ 34 (-96.34%)

Mutual labels: bigdata, mapreduce

data processing course

Some class materials for a data processing course using PySpark

Stars: ✭ 50 (-94.62%)

Mutual labels: spark, bigdata

Yandex Big Data Engineering

Stars: ✭ 17 (-98.17%)

Mutual labels: spark, mapreduce

interview-refresh-java-bigdata

a one-stop repo to lookup for code snippets of core java concepts, sql, data structures as well as big data. It also consists of interview questions asked in real-life.

Stars: ✭ 25 (-97.31%)

Mutual labels: spark-streaming, mapreduce

pulsar-user-group-loc-cn

Workspace for China local user group.

Stars: ✭ 19 (-97.95%)

Mutual labels: streaming, bigdata

SparkProgrammingInScala

Apache Spark Course Material

Stars: ✭ 57 (-93.86%)

Mutual labels: apache-spark, bigdata

Sparklyr

R interface for Apache Spark

Stars: ✭ 775 (-16.58%)

Mutual labels: spark, apache-spark

pyspark-algorithms

PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2

Stars: ✭ 72 (-92.25%)

Mutual labels: mapreduce, dataframe

yuzhouwan

Code Library for My Blog

Stars: ✭ 39 (-95.8%)

Mutual labels: spark, bigdata

Spark-and-Kafka IoT-Data-Processing-and-Analytics

Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time

Stars: ✭ 42 (-95.48%)

Mutual labels: bigdata, spark-streaming

spark-gradle-template

Apache Spark in your IDE with gradle

Stars: ✭ 39 (-95.8%)

Mutual labels: spark, apache-spark

Angel

A Flexible and Powerful Parameter Server for large-scale machine learning

Stars: ✭ 6,458 (+595.16%)

Mutual labels: spark, spark-streaming

data-algorithms-with-spark

O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian

Stars: ✭ 34 (-96.34%)

Mutual labels: spark, mapreduce

Big Data Rosetta Code

Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code

Stars: ✭ 254 (-72.66%)

Mutual labels: spark, bigdata

spark-structured-streaming-examples

Spark structured streaming examples with using of version 3.0.0

Stars: ✭ 23 (-97.52%)

Mutual labels: spark, apache-spark

Coding Now

学习记录的一些笔记，以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等

Stars: ✭ 750 (-19.27%)

Mutual labels: spark, bigdata

Docker Spark Cluster

A simple spark standalone cluster for your testing environment purposses

Stars: ✭ 261 (-71.91%)

Mutual labels: spark, bigdata

lectures-hse-spark

Масштабируемое машинное обучение и анализ больших данных с Apache Spark

Stars: ✭ 20 (-97.85%)

Mutual labels: bigdata, mapreduce

connected-component

Map Reduce Implementation of Connected Component on Apache Spark

Stars: ✭ 68 (-92.68%)

Mutual labels: apache-spark, mapreduce

kafka-spark-streaming-zeppelin-docker

One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)

Stars: ✭ 82 (-91.17%)

Mutual labels: streaming, spark

Spark Jupyter Aws

A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support

Stars: ✭ 259 (-72.12%)

Mutual labels: spark, apache-spark

Spark Movie Lens

An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset

Stars: ✭ 745 (-19.81%)

Mutual labels: spark, bigdata

Learningsparkv2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Stars: ✭ 307 (-66.95%)

Mutual labels: spark, apache-spark

Wirbelsturm

Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.

Stars: ✭ 332 (-64.26%)

Mutual labels: spark, apache-spark

Sparkmeasure

This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.

Stars: ✭ 368 (-60.39%)

Mutual labels: spark, apache-spark

Spline

Data Lineage Tracking And Visualization Solution

Stars: ✭ 306 (-67.06%)

Mutual labels: spark, bigdata

Sidekick

High Performance HTTP Sidecar Load Balancer

Stars: ✭ 366 (-60.6%)

Mutual labels: spark, bigdata

Spark Structured Streaming Book

The Internals of Spark Structured Streaming

Stars: ✭ 371 (-60.06%)

Mutual labels: spark, apache-spark

Sparkle

Haskell on Apache Spark.

Stars: ✭ 419 (-54.9%)

Mutual labels: spark, apache-spark

Agile data code 2

Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition

Stars: ✭ 413 (-55.54%)

Mutual labels: spark, apache-spark

Kafka Storm Starter

Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.

Stars: ✭ 728 (-21.64%)

Mutual labels: spark, apache-spark

Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO

Stars: ✭ 427 (-54.04%)

Mutual labels: dataset, streaming

1-60 of 1522 similar projects

›

next*5