All Projects → Mobius → Similar Projects or Alternatives

1522 Open source projects that are alternatives of or similar to Mobius

data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (-94.62%)
Mutual labels:  spark, bigdata
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-92.03%)
Mutual labels:  spark, bigdata
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-94.08%)
Mutual labels:  spark, apache-spark
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-90.1%)
Mutual labels:  spark, mapreduce
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-90.74%)
Mutual labels:  spark, apache-spark
Spark On K8s Operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+91.6%)
Mutual labels:  spark, apache-spark
Sparktutorial
Source code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-88.7%)
Mutual labels:  spark, bigdata
Lambda Arch
Applying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-88.05%)
Mutual labels:  spark, bigdata
Spark-and-Kafka IoT-Data-Processing-and-Analytics
Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time
Stars: ✭ 42 (-95.48%)
Mutual labels:  bigdata, spark-streaming
Teddy
Spark Streaming监控平台,支持任务部署与告警、自启动
Stars: ✭ 120 (-87.08%)
Mutual labels:  spark, streaming
Kinesis Sql
Kinesis Connector for Structured Streaming
Stars: ✭ 120 (-87.08%)
Mutual labels:  spark, spark-streaming
data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Stars: ✭ 34 (-96.34%)
Mutual labels:  spark, mapreduce
Kafka Storm Starter
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (-21.64%)
Mutual labels:  spark, apache-spark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (-82.24%)
Mutual labels:  spark, apache-spark
Pyspark Learning
Updated repository
Stars: ✭ 147 (-84.18%)
Mutual labels:  spark, spark-streaming
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+212.06%)
Mutual labels:  spark, apache-spark
Spark Jupyter Aws
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (-72.12%)
Mutual labels:  spark, apache-spark
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-76.75%)
Mutual labels:  spark, spark-streaming
connected-component
Map Reduce Implementation of Connected Component on Apache Spark
Stars: ✭ 68 (-92.68%)
Mutual labels:  apache-spark, mapreduce
kafka-spark-streaming-zeppelin-docker
One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)
Stars: ✭ 82 (-91.17%)
Mutual labels:  streaming, spark
Docker Spark Cluster
A simple spark standalone cluster for your testing environment purposses
Stars: ✭ 261 (-71.91%)
Mutual labels:  spark, bigdata
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+231.65%)
Mutual labels:  spark, apache-spark
Mastering Spark Sql Book
The Internals of Spark SQL
Stars: ✭ 234 (-74.81%)
Mutual labels:  spark, apache-spark
Ecommercerecommendsystem
商品大数据实时推荐系统。前端:Vue + TypeScript + ElementUI,后端 Spring + Spark
Stars: ✭ 139 (-85.04%)
Mutual labels:  spark, bigdata
Tedsds
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-98.49%)
Mutual labels:  dataset, spark
Every Single Day I Tldr
A daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (-73.2%)
Mutual labels:  spark, bigdata
Ballista
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+144.78%)
Mutual labels:  dataframe, spark
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-83.64%)
Mutual labels:  dataframe, spark
Sparkmeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
Stars: ✭ 368 (-60.39%)
Mutual labels:  spark, apache-spark
pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (-92.25%)
Mutual labels:  mapreduce, dataframe
lectures-hse-spark
Масштабируемое машинное обучение и анализ больших данных с Apache Spark
Stars: ✭ 20 (-97.85%)
Mutual labels:  bigdata, mapreduce
learning-hadoop-and-spark
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (-84.28%)
Mutual labels:  apache-spark, mapreduce
Spark Structured Streaming Book
The Internals of Spark Structured Streaming
Stars: ✭ 371 (-60.06%)
Mutual labels:  spark, apache-spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-55.54%)
Mutual labels:  spark, apache-spark
pulsar-adapters
Apache Pulsar Adapters
Stars: ✭ 18 (-98.06%)
Mutual labels:  streaming, apache-spark
spark-utils
Basic framework utilities to quickly start writing production ready Apache Spark applications
Stars: ✭ 25 (-97.31%)
Mutual labels:  apache-spark, spark-streaming
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-96.34%)
Mutual labels:  bigdata, mapreduce
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (-96.02%)
Mutual labels:  bigdata, mapreduce
yuzhouwan
Code Library for My Blog
Stars: ✭ 39 (-95.8%)
Mutual labels:  spark, bigdata
spark-gradle-template
Apache Spark in your IDE with gradle
Stars: ✭ 39 (-95.8%)
Mutual labels:  spark, apache-spark
Vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Stars: ✭ 6,793 (+631.22%)
Mutual labels:  dataframe, bigdata
Big Data Rosetta Code
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254 (-72.66%)
Mutual labels:  spark, bigdata
spark-structured-streaming-examples
Spark structured streaming examples with using of version 3.0.0
Stars: ✭ 23 (-97.52%)
Mutual labels:  spark, apache-spark
Sparkflow
Easy to use library to bring Tensorflow on Apache Spark
Stars: ✭ 282 (-69.64%)
Mutual labels:  dataframe, apache-spark
isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (-96.99%)
Mutual labels:  apache-spark, dataframe
Sidekick
High Performance HTTP Sidecar Load Balancer
Stars: ✭ 366 (-60.6%)
Mutual labels:  spark, bigdata
Wirbelsturm
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (-64.26%)
Mutual labels:  spark, apache-spark
Big data architect skills
一个大数据架构师应该掌握的技能
Stars: ✭ 400 (-56.94%)
Mutual labels:  spark, bigdata
Sparkle
Haskell on Apache Spark.
Stars: ✭ 419 (-54.9%)
Mutual labels:  spark, apache-spark
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+546.72%)
Mutual labels:  spark, bigdata
Bigdataguide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (-12.06%)
Mutual labels:  spark, bigdata
Io
Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO
Stars: ✭ 427 (-54.04%)
Mutual labels:  dataset, streaming
Learningspark
Scala examples for learning to use Spark
Stars: ✭ 421 (-54.68%)
Mutual labels:  spark, spark-streaming
Bigdataie
大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (-52.1%)
Mutual labels:  spark, bigdata
Learningsparkv2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Stars: ✭ 307 (-66.95%)
Mutual labels:  spark, apache-spark
Bdp Dataplatform
大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (-50.91%)
Mutual labels:  spark, mapreduce
Bigslice
A serverless cluster computing system for the Go programming language
Stars: ✭ 469 (-49.52%)
Mutual labels:  bigdata, mapreduce
Spark Tda
SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (-95.16%)
Mutual labels:  spark, apache-spark
Spark As Service Using Embedded Server
This application comes as Spark2.1-as-Service-Provider using an embedded, Reactive-Streams-based, fully asynchronous HTTP server
Stars: ✭ 46 (-95.05%)
Mutual labels:  spark, apache-spark
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (-96.34%)
Mutual labels:  bigdata, spark-streaming
61-120 of 1522 similar projects