Analytics ZooDistributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Stars: ✭ 2,448 (+992.86%)
PowderkegLive-coding the cluster!
Stars: ✭ 152 (-32.14%)
Technology Talk汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+5317.86%)
AlbedoA recommender system for discovering GitHub repos, built with Apache Spark
Stars: ✭ 149 (-33.48%)
BallistaDistributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+915.18%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-21.87%)
ParquetviewerSimple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (-35.27%)
Spark ExcelA Spark plugin for reading Excel files via Apache POI
Stars: ✭ 216 (-3.57%)
ScannsA scalable nearest neighbor search library in Apache Spark
Stars: ✭ 190 (-15.18%)
OryxOryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+696.88%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1024.11%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (-35.71%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+677.68%)
React WorkshopA step-by-step workshop for learning React fundamentals while building an app
Stars: ✭ 171 (-23.66%)
Org Mode WorkshopWorkshop for Org-mode with focus on todo-, project- and workflow-management
Stars: ✭ 141 (-37.05%)
Scalable Data ScienceScalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Stars: ✭ 142 (-36.61%)
Spark Knnk-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (-8.48%)
Container.trainingSlides and code samples for training, tutorials, and workshops about Docker, containers, and Kubernetes.
Stars: ✭ 2,377 (+961.16%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+5380.8%)
Spark AuthorizerA Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-37.05%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-36.61%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+830.36%)
Data science blogsA repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-37.95%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (-16.52%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-37.95%)
Isolation ForestA Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-37.95%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-3.57%)
AzuredatabricksbestpracticesVersion 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
Stars: ✭ 186 (-16.96%)
GeopysparkGeoTrellis for PySpark
Stars: ✭ 167 (-25.45%)
QuicksqlA Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+712.95%)
Dl Keras Tfrstudio::conf(2020) deep learning workshop
Stars: ✭ 137 (-38.84%)
Apache Spark NodeNode.js bindings for Apache Spark DataFrame APIs
Stars: ✭ 136 (-39.29%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+5231.7%)
Big WhaleSpark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-27.23%)
Bcs workshop apr 20Workshop on basic machine learning, computational modeling, psychophysics, basic data analysis and experiment design
Stars: ✭ 134 (-40.18%)
React WorkshopThe course material for our React Hooks workshop
Stars: ✭ 184 (-17.86%)
AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-41.96%)
OpaqueAn encrypted data analytics platform
Stars: ✭ 129 (-42.41%)
WorkshopDocker, Kubernetes and Gravity Trainings by Gravitational
Stars: ✭ 1,963 (+776.34%)
Spylon KernelJupyter kernel for scala and spark
Stars: ✭ 129 (-42.41%)
Kotlin Spark ApiThis projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-18.3%)
Spark Atlas ConnectorA Spark Atlas connector to track data lineage in Apache Atlas
Stars: ✭ 160 (-28.57%)