πŸ‹οΈβ€β™€οΈ bevy_rl

πŸ—οΈ Build πŸ€” Reinforcement Learning πŸ‹πŸΏβ€β™‚οΈ Gym environments with πŸ•Š Bevy engine to train πŸ‘Ύ AI agents that πŸ’‘ learn from πŸ“Ί screen pixels.

Compatibility

| bevy version | bevy_rl version | | ------------ | :-------------: | | 0.7 | 0.0.5 | | 0.8 | 0.8.4 | | 0.9 | 0.9.4 |

πŸ“Features

πŸ“‹ Changelog

πŸ‘©β€πŸ’» Usage

1. Define App States

```rust

[derive(Debug, Clone, Eq, PartialEq, Hash)]

enum AppState { InGame, // where all the game logic is executed Control, // A paused state in which bevy_rl waits for agent actions Reset, // A request to reset environment state } ```

2. Define Action Space and Observation Space

A action space is a set of actions that an agent can take. An observation space is a set of observations that an agent can see. Action space can be discrete or continuous. Observations should be serializable to JSON with serde_json crate.

```rust // Action space

[derive(Default)]

pub struct Actions { ... }

// Observation space

[derive(Default, Serialize, Clone)]

pub struct State { ... }

```

3. Enable AI Gym Plugin

Width and hight should exceed 256, otherwise wgpu will panic.

```rust let gymsettings = AIGymSettings { width: 256, height: 256, numagents: 16, no_graphics: false, };

app .insertresource(gymsettings.clone()) .insertresource(Arc::new(Mutex::new(AIGymState::::new(gymsettings.clone())))) .add_plugin(AIGymPlugin::::default()) ```

4. Implement Environment Logic

DelayedControlTimer should pause environment execution to allow agents to take actions.

rust struct DelayedControlTimer(Timer);

Define systems that implement environment logic.

```rust app.addsystemset( SystemSet::onupdate(AppState::InGame) .withsystem(turnbasedcontrolsystem_switch), );

app.insertresource(DelayedControlTimer(Timer::fromseconds(0.1, true))); // 10 Hz app.addsystemset( SystemSet::onupdate(AppState::Control) // Game Systems .withsystem(turnbasedtextcontrolsystem) // System that parses user command .withsystem(executeresetrequest), // System that performs environment state reset ); ```

πŸ’» AIGymState API

| Method | Description | | -------------------------------------------------- | ------------------------------------------ | | send_step_result(results: Vec<bool>) | Send upon agents interactions are complete | | send_reset_result(result: bool) | Send when reset request is complete | | receive_action_strings(Vec<Option<String>>) | Recieve environment for agent actions | | receive_reset_request() | Recieve environment for reset request | | is_next_action() -> bool | Whether agent actions are supplied | | is_reset_request() -> bool | Whether reset request was sent | | set_reward(agent_index: usize, score: f32) | Set reward for an agent | | set_terminated(agent_index: usize, result: bool) | Set termination status for an agent | | reset() | Reset bevy_rl state | | set_env_state(state: B) | Set current environment state |

🌐 REST API

| Method | Verb | bevy_rl version | | ----------------- | -------- | --------------------------------------------- | | Camera Pixels | GET | http://localhost:7878/visual_observations | | State | GET | http://localhost:7878/state | | Reset Environment | POST | http://localhost:7878/reset | | Step | GET | http://localhost:7878/step payload=ACTION |

✍️ Examples

bevyrlshooter β€” example FPS project