Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Unity-Technologies → Banditdungeon

Unity-Technologies / Banditdungeon

Licence: apache-2.0

Demo project using multi-armed bandit algorithm

Labels

unity unity3d

Projects that are alternatives of or similar to Banditdungeon

Unity Animator Helpers

A micro-framework for changing Unity 3D's Animator parameters with ScriptableObject(s). Designed to make going from custom scripts to Animator parameters easy. Works with 2D or 3D projects.

Stars: ✭ 89 (-5.32%)

Mutual labels: unity, unity3d

Gpu Planetary Rendering

GPU atmosphertic scattering and planet generation in Unity 3D

Stars: ✭ 92 (-2.13%)

Mutual labels: unity, unity3d

Projectfieldwarning

Project: Field Warning is a community-made RTS game centered around lethal regiment and division-scale warfare.

Stars: ✭ 86 (-8.51%)

Mutual labels: unity, unity3d

Unityandroidhotupdate

(Unity3D热更新) provide a way to hot update Unity app on Android, support code&resources, not need lua js or IL runtime etc..., will not disturb your project development; just loading the new version apk file to achieve.

Stars: ✭ 85 (-9.57%)

Mutual labels: unity, unity3d

Trailboids

Just tried making boids with particle trails.

Stars: ✭ 93 (-1.06%)

Mutual labels: unity, unity3d

Unity Colourlovers Importer

Unity editor tool to load colours and palettes directly from COLOURlovers.com

Stars: ✭ 85 (-9.57%)

Mutual labels: unity, unity3d

Neolowman

Yet another low-poly man

Stars: ✭ 88 (-6.38%)

Mutual labels: unity, unity3d

Shapes2d

Shapes2D for Unity3D - Make simple art assets quickly in Unity

Stars: ✭ 83 (-11.7%)

Mutual labels: unity, unity3d

Audiopreviewtrack

Instant audio playback (scrubbing) in preview mode of Unity Timeline editor.

Stars: ✭ 88 (-6.38%)

Mutual labels: unity, unity3d

Unityvision Ios

This native plugin enables Unity to take advantage of specific features of Core-ML and Vision Framework on the iOS platform.

Stars: ✭ 85 (-9.57%)

Mutual labels: unity, unity3d

Iridescence

Iridescence shader

Stars: ✭ 89 (-5.32%)

Mutual labels: unity, unity3d

Unity3d

Syphon Implementation for Unity3D Pro

Stars: ✭ 90 (-4.26%)

Mutual labels: unity, unity3d

Unity Azure Pipelines Tasks

Azure DevOps extension adding tools to build and deploy Unity 3D projects using Azure Pipelines

Stars: ✭ 83 (-11.7%)

Mutual labels: unity, unity3d

Unityrecyclinglistview

A fast scrolling list component for Unity UI which recycles its child elements

Stars: ✭ 86 (-8.51%)

Mutual labels: unity, unity3d

Unity Abstract Wire

Unity Abstract Wires Effect

Stars: ✭ 83 (-11.7%)

Mutual labels: unity, unity3d

Unityheapcrawler

Reflection based heap shapshot tool for Unity game engine

Stars: ✭ 91 (-3.19%)

Mutual labels: unity, unity3d

Vertexanimationjob

Vertex animation with C# Job System and new Mesh API

Stars: ✭ 82 (-12.77%)

Mutual labels: unity, unity3d

Temporalreprojectionexample

Temporal reprojection example for Unity

Stars: ✭ 82 (-12.77%)

Mutual labels: unity, unity3d

Adamplanereflection

Planar reflection effect from the Adam Interior Environment package.

Stars: ✭ 86 (-8.51%)

Mutual labels: unity, unity3d

Vrarmik

Unity Inverse Kinematics solution for arms in VR

Stars: ✭ 94 (+0%)

Mutual labels: unity, unity3d

View All Similar Projects ➔

Bandit Dungeon Demo

Simple Unity project demonstrating the multi-armed bandit algorithm.

Overview

In the simplest scenario, there is a single room that contains two chests. Opening a chest either yields a diamond (a good thing) or a ghost (a bad thing). Opening the same chest multiple times will yield a different sequence of diamonds and ghosts based on some underlying probability of yielding a diamond. For example, a chest that has a probability of 0.5 means that it will yield a 50-50 mix of diamonds and ghosts, while a probability of 0.9 means that it will yield a diamond nine out of every ten times, approximately. Note that each chest has its own true probability that the agent (in this case, the entity deciding which chest to open) is not aware of. Each time an agent selects a chest, they either receive a positive reward in the case of finding a diamond, or a negative reward in the case of finding a ghost. The goal of the agent is to maximize its total reward over a number of trials - in each trial the agent is allowed to select any chest.

If the agent is aware of the true underlying probability of each chest, then its task is quite simple, all it has to do is repeatedly select the chest that has the highest probability of yielding a diamond. However, in absence of this information, the best it can do is intelligently trade off between estimating the probabilities (called exploration) and selecting the chest with the highest estimated probability (called exploitation). An agent that only explores will waste all its trials estimating the probability for each chest without maximizing its own reward, while an agent that performs limited exploration will waste most of its trials exploiting based on inaccurate probability estimates. The key here is how to balance exploration and exploitation effectively.

The simplest scenario of a single room that contains two chests can be expanded to include multiple rooms with several chests. In this demo you'll be able to select between a stateless bandit (one room) or a contextual bandit (three rooms). For either of those two scenarios you can choose the number of chests in each room (two through five) in addition to a few other settings discussed below.

For more information on how the agent learns a strategy for trading exploration and exploitation, check out our corresponding post in the Unity AI blog.

Beyond this demo, check out our Q-learning demo and our Unity ML Agents repo which contains an SDK for applying more advanced methods to training behaviors within Unity.

In-game Settings

The goal of this Unity project is to provide an informative visualization for the multi-arm bandit algorithm, enabling you to explore a variety of different settings.

Bandit Type - Stateless bandit contains only a single set of chests. Contextual bandit contains three sets of chests, each denoted by a different room color (red, blue, and green).
Difficulty - How great the difference between the optimally rewarding chest and the other chests.
Bandit Arms - How many chests are in each room.
Begin Optimistic - Whether to initialize the agent's value estimates with high values (active) or low values (inactive).
Agent Speed - How quickly the agent takes actions. Increase speed to learn faster. Decrease speed to more easily visualize.
Agent Confidence - How narrow the probability distribution over actions is. Increasing this causes the agent to more frequently pick only chests with a high estimated value. Decreasing this causes the agent to pick chests more uniformly. This essentially controls the exploration vs exploitation trade-off.

Set-up

To get started with this project:

Download and install Unity if you don't already have it.
Download or clone this GitHub repository.
Open the scene.unity file under the Project/Asset subdirectory.

Within the project:

Agent.cs contains all of the multi-armed bandit logic.
BanditEnvironment.cs contains all of the environment-specific logic.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 94

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗