All Projects → banditml → Banditml

banditml / Banditml

Licence: gpl-3.0
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Banditml

Keras Rl
Deep Reinforcement Learning for Keras.
Stars: ✭ 5,166 (+3967.72%)
Mutual labels:  reinforcement-learning, neural-networks
Udacity Deep Learning Nanodegree
This is just a collection of projects that made during my DEEPLEARNING NANODEGREE by UDACITY
Stars: ✭ 15 (-88.19%)
Mutual labels:  reinforcement-learning, neural-networks
Tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
Stars: ✭ 6,136 (+4731.5%)
Mutual labels:  reinforcement-learning, neural-networks
Dissecting Reinforcement Learning
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Stars: ✭ 512 (+303.15%)
Mutual labels:  reinforcement-learning, neural-networks
Ml In Tf
Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!
Stars: ✭ 45 (-64.57%)
Mutual labels:  reinforcement-learning, neural-networks
Machine Learning Articles
Monthly Series - Top 10 Machine Learning Articles
Stars: ✭ 516 (+306.3%)
Mutual labels:  reinforcement-learning, neural-networks
Basic reinforcement learning
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
Stars: ✭ 826 (+550.39%)
Mutual labels:  reinforcement-learning, neural-networks
Sharpneat
SharpNEAT - Evolution of Neural Networks. A C# .NET Framework.
Stars: ✭ 273 (+114.96%)
Mutual labels:  reinforcement-learning, neural-networks
Machine Learning From Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-66.93%)
Mutual labels:  reinforcement-learning, neural-networks
Qualia2.0
Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-67.72%)
Mutual labels:  reinforcement-learning, neural-networks
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+3059.06%)
Mutual labels:  reinforcement-learning, neural-networks
Outlace.github.io
Machine learning and data science blog.
Stars: ✭ 65 (-48.82%)
Mutual labels:  reinforcement-learning, neural-networks
Tensorlayer Tricks
How to use TensorLayer
Stars: ✭ 357 (+181.1%)
Mutual labels:  reinforcement-learning, neural-networks
Tensorflow Value Iteration Networks
TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper
Stars: ✭ 549 (+332.28%)
Mutual labels:  reinforcement-learning, neural-networks
Gdrl
Grokking Deep Reinforcement Learning
Stars: ✭ 304 (+139.37%)
Mutual labels:  reinforcement-learning, neural-networks
Tensorflow Tutorial
TensorFlow and Deep Learning Tutorials
Stars: ✭ 748 (+488.98%)
Mutual labels:  reinforcement-learning, neural-networks
Nn
🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Stars: ✭ 5,720 (+4403.94%)
Mutual labels:  reinforcement-learning, neural-networks
Rlgraph
RLgraph: Modular computation graphs for deep reinforcement learning
Stars: ✭ 272 (+114.17%)
Mutual labels:  reinforcement-learning, neural-networks
Artificialintelligenceengines
Computer code collated for use with Artificial Intelligence Engines book by JV Stone
Stars: ✭ 35 (-72.44%)
Mutual labels:  reinforcement-learning, neural-networks
Mujocounity
Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).
Stars: ✭ 47 (-62.99%)
Mutual labels:  reinforcement-learning, neural-networks

Bandit ML

PyPI version

What's banditml?

banditml is a lightweight contextual bandit & reinforcement learning library designed to be used in production Python services. This library is developed by Bandit ML and ex-authors of Facebook's applied reinforcement learning platform, Reagent.

Specifically, this repo contains:

  • Feature engineering & preprocessing
  • Model implementations
  • Model training workflows
  • Model serving code for Python services

Supported models

Models supported:

4 feature types supported:

  • Numeric: standard floating point features
    • e.g. {totalCartValue: 39.99}
  • Categorical: low-cardinality discrete features
    • e.g. {currentlyViewingCategory: "men's jeans"}
  • ID list: high-cardinality discrete features
    • e.g. {productsInCart: ["productId022", "productId109"...]}
    • Handled via. learned embedding tables
  • "Dense" ID list: high-cardinality discrete features, manually mapped to dense feature vectors
    • e.g {productId022: [0.5, 1.3, ...], productId109: [1.9, 0.1, ...], ...}

Docs

pip install banditml

Get started

License

GNU General Public License v3.0 or later

See COPYING to see the full text.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].