ποΈ Build π€ Reinforcement Learning ππΏββοΈ Gym environments with π Bevy engine to train πΎ AI agents that π‘ can learn from πΊ screen pixels or defined obeservation state.
| bevy version | bevy_rl version | | ------------ | :-------------: | | 0.7 | 0.0.5 | | 0.8 | 0.8.4 | | 0.9 | 0.9.8-beta |
Observation space needs to be Serializable
for REST API to work.
```rust // Action space
pub struct Actions { }
// Observation space
pub struct State { } ```
Width and height should exceed 256, otherwise wgpu will panic.
rust
// Setup bevy_rl
let ai_gym_state = AIGymState::<Actions, State>::new(AIGymSettings {
width: u32, // Width and height of the screen
height: u32, // ...
num_agents: 1, // Number of agents β each will get a camera handle
render_to_buffer: false, // You can disable rendering to buffer
pause_interval: 0.01, // 100 Hz
..default()
});
app.insert_resource(ai_gym_state)
.add_plugin(AIGymPlugin::<Actions, State>::default());
If your environment exports raw pixels, you will need to attach a render target to each camera of your agents.
```rust
pub(crate) fn spawncameras(
aigymstate: Res
for i in 0..ai_gym_settings.num_agents {
let render_image_handle = ai_gym_state.render_image_handles[i as usize].clone();
let render_target = RenderTarget::Image(render_image_handle);
let camera_bundle = Camera3dBundle {
camera: Camera {
target: render_target, // Render target is baked in bevy_rl and used to export pixels
priority: -1, // set to -1 to render at the firstmost pass
..default()
},
..default()
};
commands.spawn(camera_bundle);
}
} ```
| Event | Description |
| ------------------ | ------------------------------------- |
| EventReset
| Reset environment to initial state |
| EventControl
| Switch to control state |
| EventPauseResume
| Pause or resume environment execution |
```rust
// EventPauseResume
fn bevyrlpauserequest(
mut pauseeventreader: EventReader
// EventControl
fn bevyrlcontrolrequest(
mut pauseeventreader: EventReader
Register systems to handle bevy_rl events.
rust
// bevy_rl events
app.add_system(bevy_rl_pause_request);
app.add_system(bevy_rl_control_request);
| Method | Description |
| -------------------------------------------------- | ----------------------------------- |
| set_reward(agent_index: usize, score: f32)
| Set reward for an agent |
| set_terminated(agent_index: usize, result: bool)
| Set termination status for an agent |
| reset()
| Reset bevy_rl state |
| set_env_state(state: State)
| Set current environment state |
| Method | Verb | bevy_rl version |
| ----------------- | -------- | --------------------------------------------- |
| Camera Pixels | GET | http://localhost:7878/visual_observations
|
| State | GET | http://localhost:7878/state
|
| Reset Environment | POST | http://localhost:7878/reset
|
| Step | GET | http://localhost:7878/step
payload=ACTION
|