BitA tool for component-driven application development.
Stars: ✭ 14,443 (+7422.4%)
Dvc🦉Data Version Control | Git for Data & Models | ML Experiments Management
Stars: ✭ 9,004 (+4589.58%)
DrakeAn R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: ✭ 1,301 (+577.6%)
Drake ExamplesExample workflows for the drake R package
Stars: ✭ 57 (-70.31%)
TargetsFunction-oriented Make-like declarative workflows for R
Stars: ✭ 293 (+52.6%)
Server☁️ Nextcloud server, a safe home for all your data
Stars: ✭ 17,723 (+9130.73%)
PlzSay the magic word 😸
Stars: ✭ 31 (-83.85%)
Steppy ToolkitCurated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-89.06%)
RayAn open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Stars: ✭ 18,547 (+9559.9%)
FluidframeworkLibrary for building distributed, real-time collaborative web applications
Stars: ✭ 3,592 (+1770.83%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+1984.9%)
DagsterAn orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+2034.9%)
lightning-hydra-templatePyTorch Lightning + Hydra. A very user-friendly template for rapid and reproducible ML experimentation with best practices. ⚡🔥⚡
Stars: ✭ 1,905 (+892.19%)
MazeMaze Applied Reinforcement Learning Framework
Stars: ✭ 85 (-55.73%)
Production Data ScienceProduction Data Science: a workflow for collaborative data science aimed at production
Stars: ✭ 388 (+102.08%)
WdlWorkflow Description Language - Specification and Implementations
Stars: ✭ 438 (+128.13%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+546.88%)
PowerjobEnterprise job scheduling middleware with distributed computing ability.
Stars: ✭ 3,231 (+1582.81%)
PolyaxonMachine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+1444.79%)
H1stThe AI Application Platform We All Need. Human AND Machine Intelligence. Based on experience building AI solutions at Panasonic: robotics predictive maintenance, cold-chain energy optimization, Gigafactory battery mfg, avionics, automotive cybersecurity, and more.
Stars: ✭ 697 (+263.02%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+2845.83%)
TitanoboaTitanoboa makes complex workflows easy. It is a low-code workflow orchestration platform for JVM - distributed, highly scalable and fault tolerant.
Stars: ✭ 787 (+309.9%)
DatmoOpen source production model management tool for data scientists
Stars: ✭ 334 (+73.96%)
VdsVerteego Data Suite
Stars: ✭ 9 (-95.31%)
AttacaRobust, distributed version control for large files.
Stars: ✭ 41 (-78.65%)
VistrailsVisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains detailed history information about the steps followed and data derived in the course of an exploratory task: VisTrails maintains provenance of data products, of the computational processes that derive these products and their executions.
Stars: ✭ 94 (-51.04%)
PloomberA convention over configuration workflow orchestrator. Develop locally (Jupyter or your favorite editor), deploy to Airflow or Kubernetes.
Stars: ✭ 221 (+15.1%)
NniAn open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Stars: ✭ 10,698 (+5471.88%)
Cape PythonCollaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-34.9%)
PrefectThe easiest way to automate your data
Stars: ✭ 7,956 (+4043.75%)
CkCollective Knowledge framework (CK) helps to organize black-box research software as a database of reusable components and micro-services with common APIs, automation actions and extensible meta descriptions. See real-world use cases from Arm, General Motors, ACM, Raspberry Pi foundation and others:
Stars: ✭ 395 (+105.73%)
SteppyLightweight, Python library for fast and reproducible experimentation 🔬
Stars: ✭ 119 (-38.02%)
RenkuThe Renku Project provides a platform and tools for reproducible and collaborative data analysis.
Stars: ✭ 141 (-26.56%)
MlboxMLBox is a powerful Automated Machine Learning python library.
Stars: ✭ 1,199 (+524.48%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-28.65%)
BatchflowBatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
Stars: ✭ 156 (-18.75%)
HyperactiveA hyperparameter optimization and data collection toolbox for convenient and fast prototyping of machine-learning models.
Stars: ✭ 182 (-5.21%)
GradioCreate UIs for your machine learning model in Python in 3 minutes
Stars: ✭ 4,358 (+2169.79%)
Pca MagicPCA that iteratively replaces missing data
Stars: ✭ 185 (-3.65%)
BlindpadCollaborative text editor (like Google Docs or CoderPad) with integrated semi-anonymizing voice chat intended to help reduce bias in technical communication.
Stars: ✭ 191 (-0.52%)
ArewedistributedyetWebsite + Community effort to unlock the peer-to-peer web at arewedistributedyet.com ⚡🌐🔑
Stars: ✭ 189 (-1.56%)
Xamarin PlaygroundRandom cool stuff I play around using Xamarin.. :3 Some of these cool projects I feature them on my blog, with step by step explanation. :) Don't forget to check it out. Go to: theconfuzedsourcecode.wordpress.com
Stars: ✭ 183 (-4.69%)
HomlrSupplementary material for Hands-On Machine Learning with R, an applied book covering the fundamentals of machine learning with R.
Stars: ✭ 185 (-3.65%)
CollapseAdvanced and Fast Data Transformation in R
Stars: ✭ 184 (-4.17%)
PywarmA cleaner way to build neural networks for PyTorch.
Stars: ✭ 184 (-4.17%)
Uci Ml ApiSimple API for UCI Machine Learning Dataset Repository (search, download, analyze)
Stars: ✭ 190 (-1.04%)
DelbotIt understands your voice commands, searches news and knowledge sources, and summarizes and reads out content to you.
Stars: ✭ 191 (-0.52%)
Git WorkflowThe git workflow for contributing to open source repositories.
Stars: ✭ 188 (-2.08%)
BastionHighly-available Distributed Fault-tolerant Runtime
Stars: ✭ 2,333 (+1115.1%)
Awesome R Learning ResourcesA curated collection of free resources to help deepen your understanding of the R programming language. Updated regularly. Contributions encouraged via pull request (see contributing.md).
Stars: ✭ 181 (-5.73%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-2.08%)
Imbalanced AlgorithmsPython-based implementations of algorithms for learning on imbalanced data.
Stars: ✭ 180 (-6.25%)
Zi5bookbook.zi5.me全站kindle电子书籍爬取,按照作者书籍名分类,每本书有mobi和equb两种格式,采用分布式进行全站爬取
Stars: ✭ 191 (-0.52%)
Tmt WorkflowA web developer workflow used by WeChat team based on Gulp, with cross-platform supported and solutions prepared.
Stars: ✭ 2,167 (+1028.65%)
Fast Ide🕺Fast Integrated Development Environment 😻
Stars: ✭ 181 (-5.73%)
Lets Plot KotlinKotlin API for Lets-Plot - an open-source plotting library for statistical data.
Stars: ✭ 181 (-5.73%)