step_env
Advance a 2D point-mass environment by applying an acceleration action, returning the new observation and reward based on distance to goal.
Instructions
Apply a 2D acceleration action to the environment and advance one timestep.
Returns the new observation, reward (negative distance to goal), a done flag (agent reached the goal), and the distance to the goal.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes |