HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+20142.37%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+511.86%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+157.63%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+37269.49%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (+340.68%)
Mpi OperatorKubernetes Operator for Allreduce-style Distributed Training
Stars: ✭ 190 (+222.03%)
Theano-MPIMPI Parallel framework for training deep learning models built in Theano
Stars: ✭ 55 (-6.78%)
Easylambdadistributed dataflows with functional list operations for data processing with C++14
Stars: ✭ 475 (+705.08%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (+413.56%)
MleapMLeap: Deploy ML Pipelines to Production
Stars: ✭ 1,232 (+1988.14%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (+216.95%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+154.24%)
Sparkit LearnPySpark + Scikit-learn = Sparkit-learn
Stars: ✭ 1,073 (+1718.64%)
ElephasDistributed Deep learning with Keras & Spark
Stars: ✭ 1,521 (+2477.97%)
Data Algorithms Book MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+1508.47%)
mloperatorMachine Learning Operator & Controller for Kubernetes
Stars: ✭ 85 (+44.07%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-72.88%)
Kaio-machine-learning-human-face-detectionMachine Learning project a case study focused on the interaction with digital characters, using a character called "Kaio", which, based on the automatic detection of facial expressions and classification of emotions, interacts with humans by classifying emotions and imitating expressions
Stars: ✭ 18 (-69.49%)
polystoresA library for performing hyperparameter optimization
Stars: ✭ 48 (-18.64%)
docker-sparkApache Spark docker container image (Standalone mode)
Stars: ✭ 34 (-42.37%)
DataSciPyData Science with Python
Stars: ✭ 15 (-74.58%)
realtimemap-dotnetA showcase for Proto.Actor - an ultra-fast distributed actors solution for Go, C#, and Java/Kotlin.
Stars: ✭ 47 (-20.34%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (+54.24%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-54.24%)
optimism-v2ARCHIVE of monorepo implementing Boba, an L2 Compute solution built on Optimistic Ethereum - active repo is at https://github.com/bobanetwork/boba
Stars: ✭ 34 (-42.37%)
cramTool to run many small MPI jobs inside of one large MPI job.
Stars: ✭ 23 (-61.02%)
easyFLAn experimental platform to quickly realize and compare with popular centralized federated learning algorithms. A realization of federated learning algorithm on fairness (FedFV, Federated Learning with Fair Averaging, https://fanxlxmu.github.io/publication/ijcai2021/) was accepted by IJCAI-21 (https://www.ijcai.org/proceedings/2021/223).
Stars: ✭ 104 (+76.27%)
blockchain-reading-listA reading list on blockchain and related technologies, targeted at technical people who want a deep understanding of those topics.
Stars: ✭ 93 (+57.63%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-77.97%)
spark-word2vecA parallel implementation of word2vec based on Spark
Stars: ✭ 24 (-59.32%)
DevOpsDevOps code to deploy eScience services
Stars: ✭ 19 (-67.8%)
model-deployment-flask'Deploying machine learning models with a Flask API' tutorial, written for HyperionDev
Stars: ✭ 64 (+8.47%)
Awesome-ScriptsA collection of awesome scripts from developers around the globe.
Stars: ✭ 135 (+128.81%)
skippaSciKIt-learn Pipeline in PAndas
Stars: ✭ 33 (-44.07%)
awesome-AI-kubernetes❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (+61.02%)
Machine-LearningThe projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.
Stars: ✭ 54 (-8.47%)
kafka-compose🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (-45.76%)
sentry-sparkApache Spark Sentry Integration
Stars: ✭ 14 (-76.27%)
shamashAutoscaling for Google Cloud Dataproc
Stars: ✭ 31 (-47.46%)
spark-druid-olapSparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 286 (+384.75%)
future.batchtools🚀 R package future.batchtools: A Future API for Parallel and Distributed Processing using batchtools
Stars: ✭ 77 (+30.51%)
PFL-Non-IIDThe origin of the Non-IID phenomenon is the personalization of users, who generate the Non-IID data. With Non-IID (Not Independent and Identically Distributed) issues existing in the federated learning setting, a myriad of approaches has been proposed to crack this hard nut. In contrast, the personalized federated learning may take the advantage…
Stars: ✭ 58 (-1.69%)
projection-pursuitAn implementation of multivariate projection pursuit regression and univariate classification
Stars: ✭ 24 (-59.32%)
handson-ml2핸즈온 머신러닝 2/E의 주피터 노트북
Stars: ✭ 393 (+566.1%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (+284.75%)
reachLoad embeddings and featurize your sentences.
Stars: ✭ 17 (-71.19%)
neworderA dynamic microsimulation framework for python
Stars: ✭ 15 (-74.58%)
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-55.93%)