All notable changes to this project will be documented in this file.
- Added vision-based PPO training configs for
CartpoleBalanceandPandaPickCubeCartesianwith tuned hyperparameters. - Update vision notebooks to use MuJoCo Warp.
- Default MuJoCo implementation for all envs is now MuJoCo Warp.
- Renamed deprecated
nconmax/nccdmaxconfig fields tonaconmax/naccdmaxacross all environments, matching the updated MJX API.
- Pass through the MuJoCo Warp
(MjWarp) implementation to MJX, so that MuJoCo Playground environments can
train with MuJoCo Warp! You can pass through the implementation via the config
override
registry.load('CartpoleBalance', config_overrides={'impl': 'warp'}). - Update environments to utilize contact sensors and remove
collision.py. - Remove
mjx_env.initin favor ofmjx_env.make_datasincemake_datanow requires anMjModelargument rather than anmjx.Modelargument. - Add device to
mjx_env.make_data, fixes #174. - Update AutoResetWrapper to allow full resets on done. Fixes #179. Also
provides a means for doing curriculum learning via
state.info['AutoResetWrapper_done_count'], see #140. - Update dependencies to use
mujoco>=3.4andwarp-lang>=1.11.
- Change
light_directionaltolight_typefollowing MuJoCo API change from version 3.3.2 to 3.3.3. Fixes #142. - Fix bug in
get_qpos_ids. - Implement
renderin Wrapper. - Fix #123.
- Fix #126.
- Fix #41.
- Added ALOHA handover task (thanks to @Andrew-Luo1).
- Added Booster T1 joystick task.
- Fixed foot friction randomization for G1 tasks.
- Fix various bugs in
train_jax_ppo.py(thanks to @vincentzhang). - Fixed a small bug in the privileged state of the go1 joystick task.
- Updated supported Python versions to 3.10-3.12.
Initial release.