All Projects β†’ ForestFlow β†’ ForestFlow

ForestFlow / ForestFlow

Licence: Apache-2.0 License
ForestFlow is a policy-driven Machine Learning Model Server. It is an LF AI Foundation incubation project.

Programming Languages

scala
5932 projects
java
68154 projects - #9 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to ForestFlow

Gwu data mining
Materials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+294.55%)
Mutual labels:  h2o
gallery
BentoML Example Projects 🎨
Stars: ✭ 120 (+118.18%)
Mutual labels:  model-deployment
allie
πŸ€– A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).
Stars: ✭ 93 (+69.09%)
Mutual labels:  model-deployment
forecastVeg
A Machine Learning Approach to Forecasting Remotely Sensed Vegetation Health in Python
Stars: ✭ 44 (-20%)
Mutual labels:  h2o
exemplary-ml-pipeline
Exemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-58.18%)
Mutual labels:  h2o
telco-customer-churn-in-r-and-h2o
Showcase for using H2O and R for churn prediction (inspired by ZhouFang928 examples)
Stars: ✭ 59 (+7.27%)
Mutual labels:  h2o
Benchm Ml
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+3236.36%)
Mutual labels:  h2o
sldm4-h2o
Statistical Learning & Data Mining IV - H2O Presenation & Tutorial
Stars: ✭ 26 (-52.73%)
Mutual labels:  h2o
skutil
NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-learn and h2o extension classes (as well as caret classes for python). See more here: https://tgsmith61591.github.io/skutil
Stars: ✭ 29 (-47.27%)
Mutual labels:  h2o
pyh2o
Python binding for the H2O HTTP server
Stars: ✭ 25 (-54.55%)
Mutual labels:  h2o
serving-pytorch-models
Serving PyTorch models with TorchServe πŸ”₯
Stars: ✭ 91 (+65.45%)
Mutual labels:  model-deployment
nih-chest-xray
Identifying diseases in chest X-rays using convolutional neural networks
Stars: ✭ 83 (+50.91%)
Mutual labels:  h2o
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+6000%)
Mutual labels:  model-deployment
ModelDeployment
CRAN Task View: Model Deployment with R
Stars: ✭ 19 (-65.45%)
Mutual labels:  model-deployment
steam
DEPRECATED Build, manage and deploy H2O's high-speed machine learning models.
Stars: ✭ 59 (+7.27%)
Mutual labels:  h2o
Remixautoml
R package for automation of machine learning, forecasting, feature engineering, model evaluation, model interpretation, data generation, and recommenders.
Stars: ✭ 159 (+189.09%)
Mutual labels:  h2o
mercury-ml
Mercury-ML is an open source Machine Learning workflow management library. Its core contributors are employees of Alexander Thamm GmbH
Stars: ✭ 37 (-32.73%)
Mutual labels:  h2o
model-deployment
Examples showcasing model deployment
Stars: ✭ 17 (-69.09%)
Mutual labels:  model-deployment
model-deployment-flask
'Deploying machine learning models with a Flask API' tutorial, written for HyperionDev
Stars: ✭ 64 (+16.36%)
Mutual labels:  model-deployment
serving-tensorflow-models
Serving TensorFlow models with TensorFlow Serving πŸ“™
Stars: ✭ 41 (-25.45%)
Mutual labels:  model-deployment

ForestFlow

ForestFlow is a scalable policy-based cloud-native machine learning model server. ForestFlow strives to strike a balance between the flexibility it offers data scientists and the adoption of standards while reducing friction between Data Science, Engineering and Operations teams.

ForestFlow is policy-based because we believe automation for Machine Learning/Deep Learning operations is critical to scaling human resources. ForestFlow lends itself well to workflows based on automatic retraining, version control, A/B testing, Canary Model deployments, Shadow testing, automatic time or performance-based model deprecation and time or performance-based model routing in real-time.

Our aim with ForestFlow is to provide data scientists a simple means to deploy models to a production system with minimal friction accelerating the development to production value proposition.

To achieve these goals, ForestFlow looks to address the proliferation of model serving formats and standards for inference API specifications by adopting, what we believe, are currently, or are becoming widely adopted open source frameworks, formats, and API specifications. We do this in a pluggable format such that we can continue to evolve ForestFlow as the industry and space matures and we see a need for additional support.

Contents

Overview

Why ForestFlow?

Continuous deployment and lifecycle management of Machine Learning/Deep Learning models is currently widely accepted as a primary bottleneck for gaining value out of ML projects.

We first set out to find a solution to deploy our own models. The model server implementations we found were either proprietary, closed-source solutions or had too many limitations in what we wanted to achieve. The main concerns for creating ForestFlow can be summarized as:

  • We wanted to reduce friction between our data science, engineering and operations teams
  • We wanted to give data scientists the flexibility to use the tools they wanted (H2O, TensorFlow, Spark export to PFA etc..)
  • We wanted to automate certain lifecycle management aspects of model deployments like automatic performance or time-based routing and retirement of stale models
  • We wanted a model server that allows easy A/B testing, Shadow (listen only) deployments and and Canary deployments. This allows our Data Scientists to experiment with real production data without impacting production and using the same tooling they would when deployment to production.
  • We wanted something that was easy to deploy and scale for different deployment scenarios (on-prem local data center single instance, cluster of instances, Kubernetes managed, Cloud native etc..)
  • We wanted the ability to treat inference requests as a stream and log predictions as a stream. This allows us to test new models against a stream of older infer requests.
  • We wanted to avoid the "super-hero" data scientist that knows how to dockerize an application, apply the science, build an API and deploy to production. This does not scale well and is difficult to support and maintain.
  • Most of all, we wanted repeatability. We didn't want to re-invent the wheel once we had support for a specific framework.

Model Deployment

For model deployment, ForestFlow supports models described via MLfLow Model format which allows for different flavors i..e, frameworks & storage formats.

ForestFlow also supports a BASIC REST API for model deployment as well that mimics the MLflow Model format but does not require it.

Inference

For inference, we’ve adopted a similar approach. ForestFlow provides 2 interfaces for maximum flexibility; a BASIC REST API in addition to standardizing on the GraphPipe API specification.

Relying on standards, for example using GraphPipe’s specification means immediate availability of client libraries in a variety of languages that already support working with ForestFlow; see GraphPipe clients.

Please visit the quickstart guide to get a quick overview of setting up ForestFlow and an example on inference. Also please visit the Inference documentation for a deeper dive.

Currently Supported model formats

  • H2O - Mojo Model
  • TensorFlow & Keras - Planned
  • PFA - Planned
  • Spark ML Models and Pipelines via Aardpfark and PFA - Planned

Go to the Quick Start Guide to get started then dive a little deeper and learn about ForestFlow Concepts and how you can tailor it to fit your own use-cases.

Contributing

While ForestFlow has already delivered tremendous value for us in production, it's still in early phases of development as there are plenty of features we have planned and this continues to evolve at a rapid pace. We appreciate and consistently, make use of and, contribute open source projects back to the community. We realize the problems we're facing aren't unique to us so we welcome feedback, ideas and contributions from the community to help develop our roadmap and implementation of ForestFlow.

Check out Contribution Guide for more details on contributing to ForestFlow.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].