All Projects → VowpalWabbit → Vowpal_wabbit

VowpalWabbit / Vowpal_wabbit

Licence: other
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

Programming Languages

C++
36643 projects - #6 most used programming language
Jupyter Notebook
11667 projects
C#
18002 projects
PLSQL
303 projects
python
139335 projects - #7 most used programming language
CMake
9771 projects

Projects that are alternatives of or similar to Vowpal wabbit

MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Stars: ✭ 15 (-99.81%)
Mutual labels:  online-learning, contextual-bandits
Agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Stars: ✭ 2,135 (-72.68%)
Mutual labels:  reinforcement-learning, contextual-bandits
Flappybirdrl
Flappy Bird hack using Reinforcement Learning
Stars: ✭ 876 (-88.79%)
Mutual labels:  reinforcement-learning
Gym Panda
An OpenAI Gym Env for Panda
Stars: ✭ 29 (-99.63%)
Mutual labels:  reinforcement-learning
Doyouevenlearn
Essential Guide to keep up with AI/ML/DL/CV
Stars: ✭ 913 (-88.32%)
Mutual labels:  reinforcement-learning
Rl algos
Reinforcement Learning Algorithms
Stars: ✭ 14 (-99.82%)
Mutual labels:  reinforcement-learning
World Models Sonic Pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more work needed
Stars: ✭ 27 (-99.65%)
Mutual labels:  reinforcement-learning
Awesome Ai Books
Some awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning
Stars: ✭ 855 (-89.06%)
Mutual labels:  reinforcement-learning
Emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Stars: ✭ 31 (-99.6%)
Mutual labels:  reinforcement-learning
Gym Dart
OpenAI Gym environments using DART
Stars: ✭ 20 (-99.74%)
Mutual labels:  reinforcement-learning
Impala Distributed Tensorflow
Stars: ✭ 28 (-99.64%)
Mutual labels:  reinforcement-learning
Udacity Deep Learning Nanodegree
This is just a collection of projects that made during my DEEPLEARNING NANODEGREE by UDACITY
Stars: ✭ 15 (-99.81%)
Mutual labels:  reinforcement-learning
Pytorch A3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Stars: ✭ 879 (-88.75%)
Mutual labels:  reinforcement-learning
Gym
Seoul AI Gym is a toolkit for developing AI algorithms.
Stars: ✭ 27 (-99.65%)
Mutual labels:  reinforcement-learning
Gym Alttp Gridworld
A gym environment for Stuart Armstrong's model of a treacherous turn.
Stars: ✭ 14 (-99.82%)
Mutual labels:  reinforcement-learning
Drlkit
A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (-99.63%)
Mutual labels:  reinforcement-learning
Easy21
Reinforcement Learning Assignment: Easy21
Stars: ✭ 11 (-99.86%)
Mutual labels:  reinforcement-learning
Acis
Actor-Critic Instance Segmentation (CVPR 2019)
Stars: ✭ 15 (-99.81%)
Mutual labels:  reinforcement-learning
Awesome Ai In Finance
🔬 A curated list of awesome machine learning strategies & tools in financial market.
Stars: ✭ 910 (-88.36%)
Mutual labels:  reinforcement-learning
Conversational Ai
Conversational AI Reading Materials
Stars: ✭ 34 (-99.56%)
Mutual labels:  reinforcement-learning

Vowpal Wabbit

Linux build status Windows build status MacOS build status

codecov Total Alerts Gitter chat

This is the Vowpal Wabbit fast online learning code.

Why Vowpal Wabbit?

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning. There is a specific focus on reinforcement learning with several contextual bandit algorithms implemented and the online nature lending to the problem well. Vowpal Wabbit is a destination for implementing and maturing state of the art algorithms with performance in mind.

  • Input Format. The input format for the learning algorithm is substantially more flexible than might be expected. Examples can have features consisting of free form text, which is interpreted in a bag-of-words way. There can even be multiple sets of free form text in different namespaces.
  • Speed. The learning algorithm is fast -- similar to the few other online algorithm implementations out there. There are several optimization algorithms available with the baseline being sparse gradient descent (GD) on a loss function.
  • Scalability. This is not the same as fast. Instead, the important characteristic here is that the memory footprint of the program is bounded independent of data. This means the training set is not loaded into main memory before learning starts. In addition, the size of the set of features is bounded independent of the amount of training data using the hashing trick.
  • Feature Interaction. Subsets of features can be internally paired so that the algorithm is linear in the cross-product of the subsets. This is useful for ranking problems. The alternative of explicitly expanding the features before feeding them into the learning algorithm can be both computation and space intensive, depending on how it's handled.

Visit the wiki to learn more.

Getting Started

For the most up to date instructions for getting started on Windows, MacOS or Linux please see the wiki. This includes:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].