All Projects → Sparkrdma → Similar Projects or Alternatives

1035 Open source projects that are alternatives of or similar to Sparkrdma

Awesome Pulsar
A curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-73.49%)
Mutual labels:  spark, apache-spark
Countly Sdk Cordova
Countly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-67.91%)
Mutual labels:  big-data, bigdata
Labs
Research on distributed system
Stars: ✭ 73 (-66.05%)
Mutual labels:  spark, big-data
Ecommercerecommendsystem
商品大数据实时推荐系统。前端:Vue + TypeScript + ElementUI,后端 Spring + Spark
Stars: ✭ 139 (-35.35%)
Mutual labels:  spark, bigdata
Presto
The official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+5926.51%)
Mutual labels:  big-data, hadoop
v6.dooring.public
可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Stars: ✭ 323 (+50.23%)
Mutual labels:  big-data, bigdata
swordfish
Open-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-83.72%)
Mutual labels:  spark, hadoop
fastdata-cluster
Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-90.7%)
Mutual labels:  spark, hadoop
hadoop-data-ingestion-tool
OLAP and ETL of Big Data
Stars: ✭ 17 (-92.09%)
Mutual labels:  big-data, hadoop
Sparkling Graph
SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-35.35%)
Mutual labels:  spark, big-data
Docker Spark
🚢 Docker image for Apache Spark
Stars: ✭ 78 (-63.72%)
Mutual labels:  spark, hadoop
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-63.26%)
Mutual labels:  spark, big-data
spark-util
low-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-92.56%)
Mutual labels:  spark, hadoop
flokkr
Documentation placeholder and utilities for all the other containers.
Stars: ✭ 30 (-86.05%)
Mutual labels:  hadoop, bigdata
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-61.86%)
Mutual labels:  spark, hadoop
Spark Website
Apache Spark Website
Stars: ✭ 75 (-65.12%)
Mutual labels:  spark, big-data
Uproot4
ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-62.79%)
Mutual labels:  big-data, bigdata
Kotlin Spark Api
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-14.88%)
Mutual labels:  spark, bigdata
Succinct
Enabling queries on compressed data.
Stars: ✭ 257 (+19.53%)
Mutual labels:  spark, big-data
Big Data Rosetta Code
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254 (+18.14%)
Mutual labels:  spark, bigdata
Parquet Dotnet
🏐 Apache Parquet for modern .NET
Stars: ✭ 276 (+28.37%)
Mutual labels:  big-data, apache-spark
spark-structured-streaming-examples
Spark structured streaming examples with using of version 3.0.0
Stars: ✭ 23 (-89.3%)
Mutual labels:  spark, apache-spark
Elasticluster
Create clusters of VMs on the cloud and configure them with Ansible.
Stars: ✭ 298 (+38.6%)
Mutual labels:  spark, hadoop
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+1333.02%)
Mutual labels:  spark, apache-spark
Uproot3
ROOT I/O in pure Python and NumPy.
Stars: ✭ 312 (+45.12%)
Mutual labels:  big-data, bigdata
Learningsparkv2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Stars: ✭ 307 (+42.79%)
Mutual labels:  spark, apache-spark
Wirbelsturm
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (+54.42%)
Mutual labels:  spark, apache-spark
Delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+1715.35%)
Mutual labels:  spark, big-data
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+67.91%)
Mutual labels:  spark, big-data
Sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+68.37%)
Mutual labels:  spark, big-data
Spark Structured Streaming Book
The Internals of Spark Structured Streaming
Stars: ✭ 371 (+72.56%)
Mutual labels:  spark, apache-spark
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+1460.47%)
Mutual labels:  big-data, apache-spark
Iceberg
Iceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (+82.79%)
Mutual labels:  spark, hadoop
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-60%)
Mutual labels:  spark, apache-spark
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-57.21%)
Mutual labels:  spark, hadoop
Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+88.84%)
Mutual labels:  spark, hadoop
Orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Stars: ✭ 389 (+80.93%)
Mutual labels:  big-data, hadoop
Listenbrainz Server
Server for the ListenBrainz project
Stars: ✭ 420 (+95.35%)
Mutual labels:  spark, big-data
Sparkle
Haskell on Apache Spark.
Stars: ✭ 419 (+94.88%)
Mutual labels:  spark, apache-spark
Cleanframes
type-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-65.12%)
Mutual labels:  spark, bigdata
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (-61.4%)
Mutual labels:  spark, apache-spark
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-54.88%)
Mutual labels:  spark, big-data
Magellan
Geo Spatial Data Analytics on Spark
Stars: ✭ 507 (+135.81%)
Mutual labels:  spark, big-data
Pdf
编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
Stars: ✭ 12,009 (+5485.58%)
Mutual labels:  spark, hadoop
Drill
Apache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+653.02%)
Mutual labels:  big-data, hadoop
Dist Keras
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (+185.12%)
Mutual labels:  hadoop, apache-spark
Spark On K8s Operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+727.91%)
Mutual labels:  spark, apache-spark
Sparktutorial
Source code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-51.16%)
Mutual labels:  spark, bigdata
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-50.23%)
Mutual labels:  big-data, bigdata
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+268.84%)
Mutual labels:  spark, apache-spark
Aliyun Emapreduce Datasources
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Stars: ✭ 132 (-38.6%)
Mutual labels:  spark, hadoop
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-49.3%)
Mutual labels:  big-data, bigdata
Bigdataclass
Two-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-48.84%)
Mutual labels:  spark, big-data
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (+618.14%)
Mutual labels:  big-data, bigdata
Lambda Arch
Applying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-48.37%)
Mutual labels:  spark, bigdata
Xlearning Xdml
extremely distributed machine learning
Stars: ✭ 113 (-47.44%)
Mutual labels:  spark, hadoop
Movies-Analytics-in-Spark-and-Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Stars: ✭ 47 (-78.14%)
Mutual labels:  big-data, hadoop
DaFlow
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-88.84%)
Mutual labels:  apache-spark, hadoop
Data Algorithms Book
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+341.4%)
Mutual labels:  spark, hadoop
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+455.81%)
Mutual labels:  spark, hadoop
121-180 of 1035 similar projects