All Projects → Tf_chatbot_seq2seq_antilm → Similar Projects or Alternatives

503 Open source projects that are alternatives of or similar to Tf_chatbot_seq2seq_antilm

SDLM-pytorch
Code accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts
Stars: ✭ 27 (-92.68%)
Mutual labels:  language-model
Reward Learning Rl
[RSS 2019] End-to-End Robotic Reinforcement Learning without Reward Engineering
Stars: ✭ 310 (-15.99%)
ai-n-queens
Solving and GUI demonstration of traditional N-Queens Problem using Hill Climbing, Simulated Annealing, Local Beam Search, and Genetic Algorithm.
Stars: ✭ 30 (-91.87%)
Mutual labels:  beam-search
Pytorch Ddpg
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
Stars: ✭ 272 (-26.29%)
minicons
Utility for analyzing Transformer based representations of language.
Stars: ✭ 28 (-92.41%)
Mutual labels:  language-model
Azureml Bert
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
Stars: ✭ 342 (-7.32%)
Mutual labels:  language-model
wolpertinger ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
Stars: ✭ 44 (-88.08%)
Drq
DrQ: Data regularized Q
Stars: ✭ 268 (-27.37%)
Seq2seq chatbot
基于seq2seq模型的简单对话系统的tf实现,具有embedding、attention、beam_search等功能,数据集是Cornell Movie Dialogs
Stars: ✭ 308 (-16.53%)
Mutual labels:  beam-search
Object-Goal-Navigation
Pytorch code for NeurIPS-20 Paper "Object Goal Navigation using Goal-Oriented Semantic Exploration"
Stars: ✭ 107 (-71%)
Atari
Persistent advantage learning dueling double DQN for the Arcade Learning Environment
Stars: ✭ 261 (-29.27%)
transformer
Neutron: A pytorch based implementation of Transformer and its variants.
Stars: ✭ 60 (-83.74%)
Mutual labels:  beam-search
Curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
Stars: ✭ 346 (-6.23%)
rl pytorch
Deep Reinforcement Learning Algorithms Implementation in PyTorch
Stars: ✭ 23 (-93.77%)
Planet
Deep Planning Network: Control from pixels by latent planning with learned dynamics
Stars: ✭ 257 (-30.35%)
DRL DeliveryDuel
Deep Reinforcement Learning applied to a modern 3D video-game environment called Delivery Duel.
Stars: ✭ 30 (-91.87%)
Neural Symbolic Machines
Neural Symbolic Machines is a framework to integrate neural networks and symbolic representations using reinforcement learning, with applications in program synthesis and semantic parsing.
Stars: ✭ 305 (-17.34%)
dqn-lambda
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
Stars: ✭ 20 (-94.58%)
pysc2-rl-agents
StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)
Stars: ✭ 124 (-66.4%)
Deep-RL-agents
No description or website provided.
Stars: ✭ 27 (-92.68%)
Crypto Rl
Deep Reinforcement Learning toolkit: record and replay cryptocurrency limit order book data & train a DDQN agent
Stars: ✭ 328 (-11.11%)
MinTL
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Stars: ✭ 61 (-83.47%)
Mutual labels:  language-model
Meta-SAC
Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
Stars: ✭ 19 (-94.85%)
MaRLEnE
Machine- and Reinforcement Learning ExtensioN for (game) Engines
Stars: ✭ 47 (-87.26%)
Gpt Neox
An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.
Stars: ✭ 303 (-17.89%)
Mutual labels:  language-model
playing-mario-with-deep-reinforcement-learning
An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.
Stars: ✭ 55 (-85.09%)
captioning chainer
A fast implementation of Neural Image Caption by Chainer
Stars: ✭ 17 (-95.39%)
Mutual labels:  beam-search
rl-medical
Communicative Multiagent Deep Reinforcement Learning for Anatomical Landmark Detection using PyTorch.
Stars: ✭ 36 (-90.24%)
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (-1.36%)
robustnav
Evaluating pre-trained navigation agents under corruptions
Stars: ✭ 18 (-95.12%)
CommNet
an implementation of CommNet
Stars: ✭ 23 (-93.77%)
Deep-Quality-Value-Family
Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning Algorithms": https://arxiv.org/abs/1909.01779 To appear at the next NeurIPS2019 DRL-Workshop
Stars: ✭ 12 (-96.75%)
Pytorch Trpo
PyTorch implementation of Trust Region Policy Optimization
Stars: ✭ 303 (-17.89%)
deep reinforcement learning gallery
Deep reinforcement learning with tensorflow2
Stars: ✭ 35 (-90.51%)
Deep-Reinforcement-Learning-Notebooks
This Repository contains a series of google colab notebooks which I created to help people dive into deep reinforcement learning.This notebooks contain both theory and implementation of different algorithms.
Stars: ✭ 15 (-95.93%)
pytorch-hdqn
Hierarchical-DQN in pytorch (not actively maintained)
Stars: ✭ 36 (-90.24%)
Deeprl Tensorflow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Stars: ✭ 319 (-13.55%)
FNet-pytorch
Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
Stars: ✭ 204 (-44.72%)
Mutual labels:  language-model
reinforce-js
[INACTIVE] A collection of various machine learning solver. The library is an object-oriented approach (baked with Typescript) and tries to deliver simplified interfaces that make using the algorithms pretty simple.
Stars: ✭ 20 (-94.58%)
language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
Stars: ✭ 84 (-77.24%)
Mutual labels:  language-model
Deep rl
PyTorch implementations of Deep Reinforcement Learning algorithms (DQN, DDQN, A2C, VPG, TRPO, PPO, DDPG, TD3, SAC, SAC-AEA)
Stars: ✭ 291 (-21.14%)
AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
Stars: ✭ 71 (-80.76%)
pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
Stars: ✭ 47 (-87.26%)
chainer-notebooks
Jupyter notebooks for Chainer hands-on
Stars: ✭ 23 (-93.77%)
Mutual labels:  language-model
Im2latex
Image to LaTeX (Seq2seq + Attention with Beam Search) - Tensorflow
Stars: ✭ 342 (-7.32%)
Mutual labels:  beam-search
semantic-guidance
Code for our CVPR-2021 paper on Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings.
Stars: ✭ 19 (-94.85%)
rl implementations
No description or website provided.
Stars: ✭ 40 (-89.16%)
bert-movie-reviews-sentiment-classifier
Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model
Stars: ✭ 12 (-96.75%)
Mutual labels:  language-model
Transfer Nlp
NLP library designed for reproducible experimentation management
Stars: ✭ 287 (-22.22%)
Mutual labels:  language-model
AutoPentest-DRL
AutoPentest-DRL: Automated Penetration Testing Using Deep Reinforcement Learning
Stars: ✭ 196 (-46.88%)
FinRL
FinRL: The first open-source project for financial reinforcement learning. Please star. 🔥
Stars: ✭ 3,497 (+847.7%)
Black-Box-Tuning
ICML'2022: Black-Box Tuning for Language-Model-as-a-Service
Stars: ✭ 99 (-73.17%)
Mutual labels:  language-model
Openai lab
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (-15.18%)
neural-chat
An AI chatbot using seq2seq
Stars: ✭ 30 (-91.87%)
Mutual labels:  beam-search
rlflow
A TensorFlow-based framework for learning about and experimenting with reinforcement learning algorithms
Stars: ✭ 20 (-94.58%)
Kogpt2
Korean GPT-2 pretrained cased (KoGPT2)
Stars: ✭ 368 (-0.27%)
Mutual labels:  language-model
Tf2rl
TensorFlow2 Reinforcement Learning
Stars: ✭ 353 (-4.34%)
Rlzoo
A Comprehensive Reinforcement Learning Zoo for Simple Usage 🚀
Stars: ✭ 342 (-7.32%)
Cadrl ros
ROS package for dynamic obstacle avoidance for ground robots trained with deep RL
Stars: ✭ 309 (-16.26%)
Deep reinforcement learning course
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Stars: ✭ 3,232 (+775.88%)
61-120 of 503 similar projects