ποΈ Build π€ Reinforcement Learning ππΏββοΈ Gym environments with π Bevy engine to train πΎ AI agents that π‘ learn from πΊ screen pixels.
| bevy version | bevy_rl version | | ------------ | :-------------: | | 0.7 | 0.0.5 | | 0.8 | 0.8.4 | | 0.9 | 0.9.4 |
Deref
ergonomicsAIGymState
could not be initialized outside of the crate```rust
enum AppState { InGame, // where all the game logic is executed Control, // A paused state in which bevy_rl waits for agent actions Reset, // A request to reset environment state } ```
A action space is a set of actions that an agent can take. An observation space is a set of observations that an agent can see. Action space can be discrete or continuous. Observations should be serializable to JSON with serde_json
crate.
```rust // Action space
pub struct Actions { ... }
// Observation space
pub struct State { ... }
```
Width and hight should exceed 256, otherwise wgpu will panic.
```rust let gymsettings = AIGymSettings { width: 256, height: 256, numagents: 16, no_graphics: false, };
app
.insertresource(gymsettings.clone())
.insertresource(Arc::new(Mutex::new(AIGymState::
DelayedControlTimer
should pause environment execution to allow agents to take actions.
rust
struct DelayedControlTimer(Timer);
Define systems that implement environment logic.
```rust app.addsystemset( SystemSet::onupdate(AppState::InGame) .withsystem(turnbasedcontrolsystem_switch), );
app.insertresource(DelayedControlTimer(Timer::fromseconds(0.1, true))); // 10 Hz app.addsystemset( SystemSet::onupdate(AppState::Control) // Game Systems .withsystem(turnbasedtextcontrolsystem) // System that parses user command .withsystem(executeresetrequest), // System that performs environment state reset ); ```
turnbased_control_system_switch
should pause game world and poll bevy_rl
for agent actions.execute_reset_request
handles environment reset request.turnbased_text_control_system
parses agent actions and issues commands to agents in environment.| Method | Description |
| -------------------------------------------------- | ------------------------------------------ |
| send_step_result(results: Vec<bool>)
| Send upon agents interactions are complete |
| send_reset_result(result: bool)
| Send when reset request is complete |
| receive_action_strings(Vec<Option<String>>)
| Recieve environment for agent actions |
| receive_reset_request()
| Recieve environment for reset request |
| is_next_action() -> bool
| Whether agent actions are supplied |
| is_reset_request() -> bool
| Whether reset request was sent |
| set_reward(agent_index: usize, score: f32)
| Set reward for an agent |
| set_terminated(agent_index: usize, result: bool)
| Set termination status for an agent |
| reset()
| Reset bevy_rl state |
| set_env_state(state: B)
| Set current environment state |
| Method | Verb | bevy_rl version |
| ----------------- | -------- | --------------------------------------------- |
| Camera Pixels | GET | http://localhost:7878/visual_observations
|
| State | GET | http://localhost:7878/state
|
| Reset Environment | POST | http://localhost:7878/reset
|
| Step | GET | http://localhost:7878/step
payload=ACTION
|
bevyrlshooter β example FPS project