spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (+54.24%)
tbslasA parallel, fast solver for the scalar advection-diffusion and the incompressible Navier-Stokes equations based on semi-Lagrangian/Volume-Integral method.
Stars: ✭ 21 (-64.41%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-54.24%)
wxparaverwxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowledge of the performance of applications, libraries, processors and whole architectures.
Stars: ✭ 23 (-61.02%)
optimism-v2ARCHIVE of monorepo implementing Boba, an L2 Compute solution built on Optimistic Ethereum - active repo is at https://github.com/bobanetwork/boba
Stars: ✭ 34 (-42.37%)
dlsaDistributed least squares approximation (dlsa) implemented with Apache Spark
Stars: ✭ 25 (-57.63%)
cramTool to run many small MPI jobs inside of one large MPI job.
Stars: ✭ 23 (-61.02%)
swarm-learningA simplified library for decentralized, privacy preserving machine learning
Stars: ✭ 142 (+140.68%)
easyFLAn experimental platform to quickly realize and compare with popular centralized federated learning algorithms. A realization of federated learning algorithm on fairness (FedFV, Federated Learning with Fair Averaging, https://fanxlxmu.github.io/publication/ijcai2021/) was accepted by IJCAI-21 (https://www.ijcai.org/proceedings/2021/223).
Stars: ✭ 104 (+76.27%)
textlyticsText processing library for sentiment analysis and related tasks
Stars: ✭ 25 (-57.63%)
blockchain-reading-listA reading list on blockchain and related technologies, targeted at technical people who want a deep understanding of those topics.
Stars: ✭ 93 (+57.63%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-77.97%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (+10.17%)
DevOpsDevOps code to deploy eScience services
Stars: ✭ 19 (-67.8%)
spark-word2vecA parallel implementation of word2vec based on Spark
Stars: ✭ 24 (-59.32%)
QCFractalA distributed compute and database platform for quantum chemistry.
Stars: ✭ 107 (+81.36%)
model-deployment-flask'Deploying machine learning models with a Flask API' tutorial, written for HyperionDev
Stars: ✭ 64 (+8.47%)
distexDistributed process pool for Python
Stars: ✭ 101 (+71.19%)
Awesome-ScriptsA collection of awesome scripts from developers around the globe.
Stars: ✭ 135 (+128.81%)
kdd99-scikitSolutions to kdd99 dataset with Decision tree and Neural network by scikit-learn
Stars: ✭ 50 (-15.25%)
awesome-AI-kubernetes❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (+61.02%)
vector space modellingNLP in python Vector Space Modelling and document classification NLP
Stars: ✭ 16 (-72.88%)
IoTPyPython for streams
Stars: ✭ 24 (-59.32%)
Machine-LearningThe projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.
Stars: ✭ 54 (-8.47%)
Prime95Prime95 source code from GIMPS to find Mersenne Prime.
Stars: ✭ 25 (-57.63%)
future.batchtools🚀 R package future.batchtools: A Future API for Parallel and Distributed Processing using batchtools
Stars: ✭ 77 (+30.51%)
booksA collection of online books for data science, computer science and coding!
Stars: ✭ 29 (-50.85%)
PFL-Non-IIDThe origin of the Non-IID phenomenon is the personalization of users, who generate the Non-IID data. With Non-IID (Not Independent and Identically Distributed) issues existing in the federated learning setting, a myriad of approaches has been proposed to crack this hard nut. In contrast, the personalized federated learning may take the advantage…
Stars: ✭ 58 (-1.69%)
abessFast Best-Subset Selection Library
Stars: ✭ 266 (+350.85%)
handson-ml2핸즈온 머신러닝 2/E의 주피터 노트북
Stars: ✭ 393 (+566.1%)
osprey🦅Hyperparameter optimization for machine learning pipelines 🦅
Stars: ✭ 71 (+20.34%)
reachLoad embeddings and featurize your sentences.
Stars: ✭ 17 (-71.19%)
playgroundA Streamlit application to play with machine learning models directly from the browser
Stars: ✭ 48 (-18.64%)
protoactor-goProto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Stars: ✭ 4,138 (+6913.56%)
doubleml-for-pyDoubleML - Double Machine Learning in Python
Stars: ✭ 129 (+118.64%)
neworderA dynamic microsimulation framework for python
Stars: ✭ 15 (-74.58%)
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-55.93%)
axisemAxiSEM is a parallel spectral-element method to solve 3D wave propagation in a sphere with axisymmetric or spherically symmetric visco-elastic, acoustic, anisotropic structures.
Stars: ✭ 34 (-42.37%)
pystellaA code generator for grid-based PDE solving on CPUs and GPUs
Stars: ✭ 18 (-69.49%)
swordfishOpen-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-40.68%)
hyperqueueScheduler for sub-node tasks for HPC systems with batch scheduling
Stars: ✭ 48 (-18.64%)
Football Prediction ProjectThis project will pull past game data from api-football, and use these statistics to predict the outcome of future premier league matches through machine learning.
Stars: ✭ 44 (-25.42%)
hpdbscanHighly parallel DBSCAN (HPDBSCAN)
Stars: ✭ 19 (-67.8%)
mpifxModern Fortran wrappers around MPI routines
Stars: ✭ 25 (-57.63%)
Search Ads Web ServiceOnline search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]
Stars: ✭ 30 (-49.15%)
sparkar-voltsAn extensive non-reactive Typescript framework that eases the development experience in Spark AR
Stars: ✭ 15 (-74.58%)
iot-master物联大师是开源免费的物联网智能网关系统,集成了标准Modbus和主流PLC等多种协议,支持数据采集、公式计算、定时控制、自动控制、异常报警、流量监控、Web组态、远程调试等功能,适用于大部分物联网和工业互联网应用场景。
Stars: ✭ 119 (+101.69%)
ray tutorialAn introductory tutorial about leveraging Ray core features for distributed patterns.
Stars: ✭ 67 (+13.56%)
dicodileExperiments for "Distributed Convolutional Dictionary Learning (DiCoDiLe): Pattern Discovery in Large Images and Signals"
Stars: ✭ 15 (-74.58%)
kaggle-titanicTitanic assignment on Kaggle competition
Stars: ✭ 30 (-49.15%)
machine-learning-capstone-projectThis is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning
Stars: ✭ 28 (-52.54%)
scikit-minescikit-mine : pattern mining in Python
Stars: ✭ 45 (-23.73%)
Spark-ArResources for Spark AR
Stars: ✭ 43 (-27.12%)
five-minute-midasPredicting Profitable Day Trading Positions using Decision Tree Classifiers. scikit-learn | Flask | SQLite3 | pandas | MLflow | Heroku | Streamlit
Stars: ✭ 41 (-30.51%)