spotify / Big Data Rosetta Code
Licence: apache-2.0
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254
Programming Languages
scala
5932 projects
Projects that are alternatives of or similar to Big Data Rosetta Code
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-50.39%)
Mutual labels: spark, bigdata
Kotlin Spark Api
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-27.95%)
Mutual labels: spark, bigdata
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+577.56%)
Mutual labels: spark, bigdata
Splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-58.66%)
Mutual labels: spark, bigdata
Every Single Day I Tldr
A daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (-1.97%)
Mutual labels: spark, bigdata
Sparktutorial
Source code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-58.66%)
Mutual labels: spark, bigdata
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-44.88%)
Mutual labels: spark, bigdata
Cleanframes
type-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-70.47%)
Mutual labels: spark, bigdata
Dpark
Python clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+950.39%)
Mutual labels: spark, bigdata
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-15.35%)
Mutual labels: spark, bigdata
Lambda Arch
Applying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-56.3%)
Mutual labels: spark, bigdata
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+426.77%)
Mutual labels: spark, bigdata
Ecommercerecommendsystem
商品大数据实时推荐系统。前端:Vue + TypeScript + ElementUI,后端 Spring + Spark
Stars: ✭ 139 (-45.28%)
Mutual labels: spark, bigdata
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-72.05%)
Mutual labels: spark, bigdata
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-70.87%)
Mutual labels: spark, bigdata
data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (-80.31%)
Mutual labels: spark, bigdata
big-data-rosetta-code
Code snippets for solving common big data problems on various platforms. Inspired by Rosetta Code.
For examples rended side by side with comments see:
http://spotify.github.io/big-data-rosetta-code/
Currently the following are covered:
Topics
- src/main/scala/com/spotify/bdrc/scala Scala tricks for data processing
- src/main/scala/com/spotify/bdrc/pipeline Data pipeline snippets
- src/test/scala/com/spotify/bdrc/testing Examples for pipeline testing
License
Copyright 2016 Spotify AB.
Licensed under the Apache License, Version 2.0: http://www.apache.org/licenses/LICENSE-2.0
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].