All Projects → dlsa → Similar Projects or Alternatives

266 Open source projects that are alternatives of or similar to dlsa

data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Stars: ✭ 34 (+36%)
Mutual labels:  pyspark, spark-ml
ai-deployment
关注AI模型上线、模型部署
Stars: ✭ 149 (+496%)
Mutual labels:  pyspark, spark-ml
isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (+12%)
Mutual labels:  pyspark, spark-ml
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+9972%)
Mutual labels:  pyspark, spark-ml
machine-learning-course
Machine Learning Course @ Santa Clara University
Stars: ✭ 17 (-32%)
Mutual labels:  pyspark, spark-ml
Tdigest
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
Stars: ✭ 274 (+996%)
Mutual labels:  distributed-computing, pyspark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+500%)
Mutual labels:  distributed-computing, pyspark
pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+188%)
Mutual labels:  distributed-computing, pyspark
jessica
Jessica - Jessie (secure distributed Javascript) Compiler Architecture
Stars: ✭ 27 (+8%)
Mutual labels:  distributed-computing
hyperqueue
Scheduler for sub-node tasks for HPC systems with batch scheduling
Stars: ✭ 48 (+92%)
Mutual labels:  distributed-computing
high-assurance-legacy
Legacy code connected to the high-assurance implementation of the Ouroboros protocol family
Stars: ✭ 81 (+224%)
Mutual labels:  distributed-computing
lazycluster
🎛 Distributed machine learning made simple.
Stars: ✭ 43 (+72%)
Mutual labels:  distributed-computing
Spark-for-data-engineers
Apache Spark for data engineers
Stars: ✭ 22 (-12%)
Mutual labels:  pyspark
dask-pytorch-ddp
dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.
Stars: ✭ 50 (+100%)
Mutual labels:  distributed-computing
Springboard-Data-Science-Immersive
No description or website provided.
Stars: ✭ 52 (+108%)
Mutual labels:  pyspark
zmq
ZeroMQ based distributed patterns
Stars: ✭ 27 (+8%)
Mutual labels:  distributed-computing
pre-lt-raster-frames
Spark DataFrames for earth observation data
Stars: ✭ 19 (-24%)
Mutual labels:  spark-ml
Archived-SANSA-Query
SANSA Query Layer
Stars: ✭ 31 (+24%)
Mutual labels:  distributed-computing
plinycompute
A system for development of high-performance, data-intensive, distributed computing, applications, tools, and libraries.
Stars: ✭ 27 (+8%)
Mutual labels:  distributed-computing
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+1796%)
Mutual labels:  pyspark
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-32%)
Mutual labels:  pyspark
Spark-Scala-EKS
Spark Scala docker container sample for AWS testing - EKS & S3
Stars: ✭ 23 (-8%)
Mutual labels:  spark-ml
dcf
Yet another distributed compute framework
Stars: ✭ 48 (+92%)
Mutual labels:  distributed-computing
hydra-hpp
Hydra Hot Potato Player (game)
Stars: ✭ 12 (-52%)
Mutual labels:  distributed-computing
pat-helland-and-me
Materials related to my talk "Pat Helland and Me"
Stars: ✭ 14 (-44%)
Mutual labels:  distributed-computing
gordo
An API-first distributed deployment system of deep learning models using timeseries data to predict the behaviour of systems
Stars: ✭ 25 (+0%)
Mutual labels:  distributed-computing
Federated-Learning-and-Split-Learning-with-raspberry-pi
SRDS 2020: End-to-End Evaluation of Federated Learning and Split Learning for Internet of Things
Stars: ✭ 54 (+116%)
Mutual labels:  distributed-computing
protoactor-go
Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Stars: ✭ 4,138 (+16452%)
Mutual labels:  distributed-computing
good-karma-kit
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
Stars: ✭ 238 (+852%)
Mutual labels:  distributed-computing
distex
Distributed process pool for Python
Stars: ✭ 101 (+304%)
Mutual labels:  distributed-computing
pyspark-k8s-boilerplate
Boilerplate for PySpark on Cloud Kubernetes
Stars: ✭ 24 (-4%)
Mutual labels:  pyspark
check-engine
Data validation library for PySpark 3.0.0
Stars: ✭ 29 (+16%)
Mutual labels:  pyspark
marsjs
Label images from Unsplash in browser - using MobileNet on Tensorflow.Js
Stars: ✭ 53 (+112%)
Mutual labels:  distributed-computing
DataEngineering
This repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (+88%)
Mutual labels:  pyspark
nebula
A distributed block-based data storage and compute engine
Stars: ✭ 127 (+408%)
Mutual labels:  distributed-computing
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+13320%)
Mutual labels:  pyspark
databricks-notebooks
Collection of Databricks and Jupyter Notebooks
Stars: ✭ 19 (-24%)
Mutual labels:  pyspark
IoTPy
Python for streams
Stars: ✭ 24 (-4%)
Mutual labels:  distributed-computing
python mozetl
ETL jobs for Firefox Telemetry
Stars: ✭ 25 (+0%)
Mutual labels:  pyspark
asyncoro
Python framework for asynchronous, concurrent, distributed, network programming with coroutines
Stars: ✭ 50 (+100%)
Mutual labels:  distributed-computing
tutorial
Tutorials to help you build your first Swim app
Stars: ✭ 27 (+8%)
Mutual labels:  distributed-computing
ParallelUtilities.jl
Fast and easy parallel mapreduce on HPC clusters
Stars: ✭ 28 (+12%)
Mutual labels:  distributed-computing
pycondor
Build and submit workflows to HTCondor in Python
Stars: ✭ 23 (-8%)
Mutual labels:  distributed-computing
JOLI.jl
Julia Operators LIbrary
Stars: ✭ 14 (-44%)
Mutual labels:  distributed-computing
machinaris
An easy-to-use WebUI for crypto plotting and farming. Offers Plotman, MadMax, Chiadog, Bladebit, Farmr, and Forktools in a Docker container. Supports Chia, MMX, Chives, Flax, HDDCoin, and BPX among others.
Stars: ✭ 324 (+1196%)
Mutual labels:  distributed-computing
Prime95
Prime95 source code from GIMPS to find Mersenne Prime.
Stars: ✭ 25 (+0%)
Mutual labels:  distributed-computing
ceja
PySpark phonetic and string matching algorithms
Stars: ✭ 24 (-4%)
Mutual labels:  pyspark
ripple
Simple shared surface streaming application
Stars: ✭ 17 (-32%)
Mutual labels:  distributed-computing
Distributed-Data-Structures
[GSoC] Distributed Data Structures - Collections Framework for Chapel language
Stars: ✭ 13 (-48%)
Mutual labels:  distributed-computing
raven-distribution-framework
Decentralized Computing Backend for Artificial Intelligence, Web3, Metaverse, and Gaming Application
Stars: ✭ 31 (+24%)
Mutual labels:  distributed-computing
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+0%)
Mutual labels:  pyspark
cre
common runtime environment for distributed programming languages
Stars: ✭ 20 (-20%)
Mutual labels:  distributed-computing
pyspark-ML-in-Colab
Pyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (+28%)
Mutual labels:  pyspark
rce
Distributed, workflow-driven integration environment
Stars: ✭ 42 (+68%)
Mutual labels:  distributed-computing
Archived-SANSA-ML
SANSA Machine Learning Layer
Stars: ✭ 39 (+56%)
Mutual labels:  distributed-computing
pyspark-for-data-processing
Code for my presentation: Using PySpark to Process Boat Loads of Data
Stars: ✭ 20 (-20%)
Mutual labels:  pyspark
Sparkora
Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (+104%)
Mutual labels:  pyspark
nsmc-zeppelin-notebook
Movie review dataset Word2Vec & sentiment classification Zeppelin notebook
Stars: ✭ 26 (+4%)
Mutual labels:  spark-ml
jupyterlab-sparkmonitor
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (+212%)
Mutual labels:  pyspark
ShadowClone
Unleash the power of cloud
Stars: ✭ 224 (+796%)
Mutual labels:  distributed-computing
1-60 of 266 similar projects