You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: gymnasium_robotics/envs/maze/ant_maze_v5.py
+12-12Lines changed: 12 additions & 12 deletions
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,4 @@
1
-
"""A maze environment with the Gymnasium Ant agent (https://github.com/Farama-Foundation/Gymnasium/blob/main/gymnasium/envs/mujoco/ant_v4.py).
1
+
"""A maze environment with the Gymnasium Ant agent (https://github.com/Farama-Foundation/Gymnasium/blob/main/gymnasium/envs/mujoco/ant_v5.py).
2
2
3
3
The code is inspired by the D4RL repository hosted on GitHub (https://github.com/Farama-Foundation/D4RL), published in the paper
4
4
'D4RL: Datasets for Deep Data-Driven Reinforcement Learning' by Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine.
@@ -40,22 +40,22 @@ class AntMazeEnv(MazeEnv, EzPickle):
40
40
#### Maze size
41
41
The map variations for the mazes are the same as for `PointMaze`. The ant environments with fixed goal and reset locations are the following:
42
42
43
-
* `AntMaze_UMaze-v4`
44
-
* `AntMaze_BigMaze-v4`
45
-
* `AntMaze_HardestMaze-v4`
43
+
* `AntMaze_UMaze-v5`
44
+
* `AntMaze_BigMaze-v5`
45
+
* `AntMaze_HardestMaze-v5`
46
46
47
47
#### Diverse goal mazes
48
48
The environments with fixed reset position for the ant and randomly selected goals, also known as diverse goal, are:
49
49
50
-
* `AntMaze_BigMaze_DG-v4`
51
-
* `AntMaze_HardestMaze_DG-v4`
50
+
* `AntMaze_BigMaze_DG-v5`
51
+
* `AntMaze_HardestMaze_DG-v5`
52
52
53
53
#### Diverse goal and reset mazes
54
54
55
55
Finally, the environments that select the reset and goal locations randomly are:
56
56
57
-
* `AntMaze_BigMaze_DGR-v4`
58
-
* `AntMaze_HardestMaze_DGR-v4`
57
+
* `AntMaze_BigMaze_DGR-v5`
58
+
* `AntMaze_HardestMaze_DGR-v5`
59
59
60
60
#### Custom maze
61
61
Also, any of the `AntMaze` environments can be initialized with a custom maze map by setting the `maze_map` argument like follows:
@@ -70,7 +70,7 @@ class AntMazeEnv(MazeEnv, EzPickle):
@@ -153,8 +153,8 @@ class AntMazeEnv(MazeEnv, EzPickle):
153
153
- *sparse*: the returned reward can have two values: `0` if the ant hasn't reached its final target position, and `1` if the ant is in the final target position (the ant is considered to have reached the goal if the Euclidean distance between both is lower than 0.5 m).
154
154
- *dense*: the returned reward is the negative Euclidean distance between the achieved goal position and the desired goal.
155
155
156
-
To initialize this environment with one of the mentioned reward functions the type of reward must be specified in the id string when the environment is initialized. For `sparse` reward the id is the default of the environment, `AntMaze_UMaze-v4`. However, for `dense`
157
-
reward the id must be modified to `AntMaze_UMazeDense-v4` and initialized as follows:
156
+
To initialize this environment with one of the mentioned reward functions the type of reward must be specified in the id string when the environment is initialized. For `sparse` reward the id is the default of the environment, `AntMaze_UMaze-v5`. However, for `dense`
157
+
reward the id must be modified to `AntMaze_UMazeDense-v5` and initialized as follows:
158
158
159
159
```python
160
160
import gymnasium as gym
@@ -197,7 +197,7 @@ class AntMazeEnv(MazeEnv, EzPickle):
0 commit comments