SparklyrR interface for Apache Spark
Stars: ✭ 775 (+617.59%)
Ruby SparkRuby wrapper for Apache Spark
Stars: ✭ 221 (+104.63%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (+73.15%)
Ytk LearnYtk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).
Stars: ✭ 337 (+212.04%)
Xlearning Xdmlextremely distributed machine learning
Stars: ✭ 113 (+4.63%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-50%)
BallistaDistributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+2005.56%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+5137.04%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-49.07%)
Hark LangBuild stateful and portable serverless applications without thinking about infrastructure.
Stars: ✭ 103 (-4.63%)
Serverless ChatA serverless web chat built using AWS Lambda, AWS IoT (for WebSockets) and Amazon DynamoDB
Stars: ✭ 99 (-8.33%)
AwslambdafacePerform deep neural network based face detection and recognition in the cloud (via AWS lambda) with zero model configuration or tuning.
Stars: ✭ 98 (-9.26%)
Node Athenaa nodejs simple aws athena client
Stars: ✭ 97 (-10.19%)
Ipfs.inkPROJECT HAS BEEN SHUTDOWN - Publish and render markdown essays to and from ipfs
Stars: ✭ 106 (-1.85%)
Serverless SharpServerless image optimizer for S3, Lambda, and Cloudfront
Stars: ✭ 102 (-5.56%)
TurmsThe world's most advanced open source instant messaging engine for 100K~10M concurrent users https://turms-im.github.io/docs
Stars: ✭ 97 (-10.19%)
Telegram BotTelegram Bot using AWS API Gateway and AWS Lambda
Stars: ✭ 96 (-11.11%)
Bojack🐴 The unreliable key-value store
Stars: ✭ 101 (-6.48%)
MicroMicro is a distributed cloud operating system
Stars: ✭ 10,778 (+9879.63%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+1548.15%)
Bash OnelinerA collection of handy Bash One-Liners and terminal tricks for data processing and Linux system maintenance.
Stars: ✭ 1,359 (+1158.33%)
Smart Security CameraA Pi Zero and Motion based webcamera that forwards images to Amazon Web Services for Image Processing
Stars: ✭ 103 (-4.63%)
AlmondA Scala kernel for Jupyter
Stars: ✭ 1,354 (+1153.7%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-0.93%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-10.19%)
Cloud GameWeb-based Cloud Gaming service for Retro Game
Stars: ✭ 1,374 (+1172.22%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-10.19%)
Serverless⚡ Serverless Framework – Build web, mobile and IoT applications with serverless architectures using AWS Lambda, Azure Functions, Google CloudFunctions & more! –
Stars: ✭ 41,584 (+38403.7%)
Lambrolllambroll is a minimal deployment tool for AWS Lambda.
Stars: ✭ 97 (-10.19%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1138.89%)
Spark FfmFFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (-6.48%)
ToydbDistributed SQL database in Rust, written as a learning project
Stars: ✭ 1,329 (+1130.56%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (+0%)
Dandeliona diaspora* client for Android
Stars: ✭ 100 (-7.41%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-14.81%)
Lambstatus[Maintenance mode] Serverless Status Page System
Stars: ✭ 1,323 (+1125%)
Raspberry Pi DrambleRaspberry Pi Kubernetes cluster that runs HA/HP Drupal 8
Stars: ✭ 1,317 (+1119.44%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-2.78%)
FoundatioPluggable foundation blocks for building distributed apps.
Stars: ✭ 1,365 (+1163.89%)
JrestlessRun JAX-RS applications on AWS Lambda using Jersey. Supports Spring 4.x. The serverless framework can be used for deployment.
Stars: ✭ 93 (-13.89%)
LambdauthA sample authentication service implemented with a server-less architecture, using AWS Lambda to host and execute the code and Amazon DynamoDB as persistent storage. This provides a cost-efficient solution that is scalable and highly available and can be used with Amazon Cognito for Developer Authenticated Identities.
Stars: ✭ 1,365 (+1163.89%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+10435.19%)
BonoboExtract Transform Load for Python 3.5+
Stars: ✭ 1,475 (+1265.74%)
Ask CliAlexa Skills Kit Command Line Interface
Stars: ✭ 100 (-7.41%)
Big Data🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (-13.89%)
BroadwayConcurrent and multi-stage data ingestion and data processing with Elixir
Stars: ✭ 1,310 (+1112.96%)
FasC Pixels-based graphical audio synthesizer implemented as a WebSocket server
Stars: ✭ 100 (-7.41%)