All Projects → Distributed Dataset → Similar Projects or Alternatives

1206 Open source projects that are alternatives of or similar to Distributed Dataset

Sparklyr
R interface for Apache Spark
Stars: ✭ 775 (+617.59%)
Mutual labels:  spark, distributed
data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (-53.7%)
Mutual labels:  spark, data-processing
Ruby Spark
Ruby wrapper for Apache Spark
Stars: ✭ 221 (+104.63%)
Mutual labels:  spark, distributed
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (+26.85%)
Mutual labels:  aws-lambda, spark
Js Spark
Realtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (+73.15%)
Mutual labels:  spark, distributed
Ytk Learn
Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).
Stars: ✭ 337 (+212.04%)
Mutual labels:  spark, distributed
Xlearning Xdml
extremely distributed machine learning
Stars: ✭ 113 (+4.63%)
Mutual labels:  spark, distributed
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-50%)
Mutual labels:  spark, data-processing
Ballista
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+2005.56%)
Mutual labels:  spark, distributed
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+5137.04%)
Mutual labels:  spark, distributed
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-49.07%)
Mutual labels:  spark, data-processing
Hark Lang
Build stateful and portable serverless applications without thinking about infrastructure.
Stars: ✭ 103 (-4.63%)
Mutual labels:  aws-lambda
Serverless Chat
A serverless web chat built using AWS Lambda, AWS IoT (for WebSockets) and Amazon DynamoDB
Stars: ✭ 99 (-8.33%)
Mutual labels:  aws-lambda
Awslambdaface
Perform deep neural network based face detection and recognition in the cloud (via AWS lambda) with zero model configuration or tuning.
Stars: ✭ 98 (-9.26%)
Mutual labels:  aws-lambda
Node Athena
a nodejs simple aws athena client
Stars: ✭ 97 (-10.19%)
Mutual labels:  aws-lambda
Ipfs.ink
PROJECT HAS BEEN SHUTDOWN - Publish and render markdown essays to and from ipfs
Stars: ✭ 106 (-1.85%)
Mutual labels:  distributed
Serverless Sharp
Serverless image optimizer for S3, Lambda, and Cloudfront
Stars: ✭ 102 (-5.56%)
Mutual labels:  aws-lambda
Turms
The world's most advanced open source instant messaging engine for 100K~10M concurrent users https://turms-im.github.io/docs
Stars: ✭ 97 (-10.19%)
Mutual labels:  distributed
Telegram Bot
Telegram Bot using AWS API Gateway and AWS Lambda
Stars: ✭ 96 (-11.11%)
Mutual labels:  aws-lambda
Bojack
🐴 The unreliable key-value store
Stars: ✭ 101 (-6.48%)
Mutual labels:  distributed
Scaleable Crawler With Docker Cluster
a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine
Stars: ✭ 96 (-11.11%)
Mutual labels:  distributed
Kinesis Streams Fan Out Kinesis Analytics
Amazon Kinesis Streams fan-out via Kinesis Analytics (powered by the Serverless Framework)
Stars: ✭ 95 (-12.04%)
Mutual labels:  aws-lambda
Micro
Micro is a distributed cloud operating system
Stars: ✭ 10,778 (+9879.63%)
Mutual labels:  distributed
Spark On K8s Operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+1548.15%)
Mutual labels:  spark
Bigdata Notebook
Stars: ✭ 100 (-7.41%)
Mutual labels:  spark
Machine Learning For Solar Energy Prediction
Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning
Stars: ✭ 94 (-12.96%)
Mutual labels:  data-processing
Bash Oneliner
A collection of handy Bash One-Liners and terminal tricks for data processing and Linux system maintenance.
Stars: ✭ 1,359 (+1158.33%)
Mutual labels:  data-processing
Smart Security Camera
A Pi Zero and Motion based webcamera that forwards images to Amazon Web Services for Image Processing
Stars: ✭ 103 (-4.63%)
Mutual labels:  aws-lambda
Almond
A Scala kernel for Jupyter
Stars: ✭ 1,354 (+1153.7%)
Mutual labels:  spark
Logigsk
A Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-0.93%)
Mutual labels:  spark
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-10.19%)
Mutual labels:  spark
Cloud Game
Web-based Cloud Gaming service for Retro Game
Stars: ✭ 1,374 (+1172.22%)
Mutual labels:  distributed
Schemer
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-10.19%)
Mutual labels:  spark
Serverless
⚡ Serverless Framework – Build web, mobile and IoT applications with serverless architectures using AWS Lambda, Azure Functions, Google CloudFunctions & more! –
Stars: ✭ 41,584 (+38403.7%)
Mutual labels:  aws-lambda
Lambroll
lambroll is a minimal deployment tool for AWS Lambda.
Stars: ✭ 97 (-10.19%)
Mutual labels:  aws-lambda
Spark Terasort
Spark Terasort
Stars: ✭ 101 (-6.48%)
Mutual labels:  spark
Relation extraction
Relation Extraction using Deep learning(CNN)
Stars: ✭ 96 (-11.11%)
Mutual labels:  spark
Serverless Image Processor
AWS Lambda image processor
Stars: ✭ 106 (-1.85%)
Mutual labels:  aws-lambda
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1138.89%)
Mutual labels:  spark
Spark Ffm
FFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (-6.48%)
Mutual labels:  spark
Toydb
Distributed SQL database in Rust, written as a learning project
Stars: ✭ 1,329 (+1130.56%)
Mutual labels:  distributed
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (+0%)
Mutual labels:  spark
Dandelion
a diaspora* client for Android
Stars: ✭ 100 (-7.41%)
Mutual labels:  distributed
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-14.81%)
Mutual labels:  spark
Lambstatus
[Maintenance mode] Serverless Status Page System
Stars: ✭ 1,323 (+1125%)
Mutual labels:  aws-lambda
Raspberry Pi Dramble
Raspberry Pi Kubernetes cluster that runs HA/HP Drupal 8
Stars: ✭ 1,317 (+1119.44%)
Mutual labels:  distributed
Sparktutorial
Source code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-2.78%)
Mutual labels:  spark
Foundatio
Pluggable foundation blocks for building distributed apps.
Stars: ✭ 1,365 (+1163.89%)
Mutual labels:  distributed
Jrestless
Run JAX-RS applications on AWS Lambda using Jersey. Supports Spring 4.x. The serverless framework can be used for deployment.
Stars: ✭ 93 (-13.89%)
Mutual labels:  aws-lambda
Lambda Phantom Scraper
PhantomJS/Node.js web scraper for AWS Lambda
Stars: ✭ 93 (-13.89%)
Mutual labels:  aws-lambda
Lambdauth
A sample authentication service implemented with a server-less architecture, using AWS Lambda to host and execute the code and Amazon DynamoDB as persistent storage. This provides a cost-efficient solution that is scalable and highly available and can be used with Amazon Cognito for Developer Authenticated Identities.
Stars: ✭ 1,365 (+1163.89%)
Mutual labels:  aws-lambda
React Apig Lambda
Render React.js on-demand with CDN caching
Stars: ✭ 93 (-13.89%)
Mutual labels:  aws-lambda
Spark Summit 2017 Sanfrancisco
spark summit 2017 SanFrancisco
Stars: ✭ 93 (-13.89%)
Mutual labels:  spark
Flink Learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+10435.19%)
Mutual labels:  spark
Bonobo
Extract Transform Load for Python 3.5+
Stars: ✭ 1,475 (+1265.74%)
Mutual labels:  data-processing
Ask Cli
Alexa Skills Kit Command Line Interface
Stars: ✭ 100 (-7.41%)
Mutual labels:  aws-lambda
Big Data
🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (-13.89%)
Mutual labels:  spark
Broadway
Concurrent and multi-stage data ingestion and data processing with Elixir
Stars: ✭ 1,310 (+1112.96%)
Mutual labels:  data-processing
Fas
C Pixels-based graphical audio synthesizer implemented as a WebSocket server
Stars: ✭ 100 (-7.41%)
Mutual labels:  distributed
Hazelcast Python Client
Hazelcast IMDG Python Client
Stars: ✭ 92 (-14.81%)
Mutual labels:  distributed
1-60 of 1206 similar projects