All Projects → MBKraus → incremental_training

MBKraus / incremental_training

Licence: other
Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'

Programming Languages

python
139335 projects - #7 most used programming language
Dockerfile
14818 projects

Projects that are alternatives of or similar to incremental training

k3ai
A lightweight tool to get an AI Infrastructure Stack up in minutes not days. K3ai will take care of setup K8s for You, deploy the AI tool of your choice and even run your code on it.
Stars: ✭ 105 (-4.55%)
Mutual labels:  airflow, mlflow
airflow-site
Apache Airflow Website
Stars: ✭ 95 (-13.64%)
Mutual labels:  airflow
Paperboy
A web frontend for scheduling Jupyter notebook reports
Stars: ✭ 221 (+100.91%)
Mutual labels:  airflow
kedro-airflow-k8s
Kedro Plugin to support running pipelines on Kubernetes using Airflow.
Stars: ✭ 22 (-80%)
Mutual labels:  airflow
airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Stars: ✭ 111 (+0.91%)
Mutual labels:  airflow
fab-oidc
Flask-AppBuilder SecurityManager for OpenIDConnect
Stars: ✭ 28 (-74.55%)
Mutual labels:  airflow
Airflow Scheduler Failover Controller
A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability
Stars: ✭ 204 (+85.45%)
Mutual labels:  airflow
comet-for-mlflow
Comet-For-MLFlow Extension
Stars: ✭ 48 (-56.36%)
Mutual labels:  mlflow
scene-recognition-pytorch1.x
Evaluate wandb, tensorboard, neptune, mlflow, etc
Stars: ✭ 37 (-66.36%)
Mutual labels:  mlflow
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-81.82%)
Mutual labels:  airflow
aircan
💨🥫 A Data Factory system for running data processing pipelines built on AirFlow and tailored to CKAN. Includes evolution of DataPusher and Xloader for loading data to DataStore.
Stars: ✭ 24 (-78.18%)
Mutual labels:  airflow
event extract master
支持百度竞赛数据的中文事件抽取,支持ace2005数据的英文事件抽取,本人将苏神的三元组抽取算法中的DGCNN改成了事件抽取任务,并将karas改成了本人习惯使用的pytorch,在数据加载处考虑了各种语言的扩展
Stars: ✭ 43 (-60.91%)
Mutual labels:  mlflow
kedro-airflow
Kedro-Airflow makes it easy to deploy Kedro projects to Airflow.
Stars: ✭ 121 (+10%)
Mutual labels:  airflow
Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+120.91%)
Mutual labels:  airflow
fuseml
FuseML aims to provide an MLOps framework as the medium dynamically integrating together the AI/ML tools of your choice. It's an extensible tool built through collaboration, where Data Engineers and DevOps Engineers can come together and contribute with reusable integration code.
Stars: ✭ 73 (-33.64%)
Mutual labels:  mlflow
Awesome Apache Airflow
Curated list of resources about Apache Airflow
Stars: ✭ 2,755 (+2404.55%)
Mutual labels:  airflow
pipeline
PipelineAI Kubeflow Distribution
Stars: ✭ 4,154 (+3676.36%)
Mutual labels:  airflow
domino-research
Projects developed by Domino's R&D team
Stars: ✭ 74 (-32.73%)
Mutual labels:  mlflow
tornado
The Tornado 🌪️ framework, designed and implemented for adaptive online learning and data stream mining in Python.
Stars: ✭ 110 (+0%)
Mutual labels:  incremental-learning
mlflow-gocd
GoCD plugins to work with MLFlow as model repository in a CD flow
Stars: ✭ 26 (-76.36%)
Mutual labels:  mlflow

Keeping your ML model in shape with Kafka, Airflow and MLFlow

How to incrementally update your ML model in an automated way as new training data becomes available

Fitting and serving your machine learning (ML) model is one thing, but what about keeping it in shape over time?

Let's say we got a ML model that has been put in production and is actively serving predictions. Simultaneously, we got new training data that becomes available in a streaming way while users use the model. Incrementally updating the model with new data can improve the model, whilst it also might reduce model drift. However, it often comes with additional overhead. Luckily, there are tools that allow you to automate many parts of this process. 

This repository takes on the topic of incrementally updating a ML model as new data becomes available. It mainly leans on three nifty tools, being Kafka, Airflow, and MLFlow.

The corresponding walkthrough/post on Medium lays out the workings of this repo step-by-step.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].