From 4262142a7676a3b545d6eff1e2370145badc4751 Mon Sep 17 00:00:00 2001 From: ffelten Date: Thu, 24 Oct 2024 15:21:50 +0000 Subject: [PATCH] =?UTF-8?q?Deploying=20to=20gh-pages=20from=20@=20Farama-F?= =?UTF-8?q?oundation/MO-Gymnasium@5099244da152d4c33d700d92beded032eb63670b?= =?UTF-8?q?=20=F0=9F=9A=80?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- main/.buildinfo | 2 +- main/release_notes/index.html | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/main/.buildinfo b/main/.buildinfo index 3edf165..7a1e379 100644 --- a/main/.buildinfo +++ b/main/.buildinfo @@ -1,4 +1,4 @@ # Sphinx build info version 1 # This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done. -config: 9046834584027c3e807ab111a0e2891e +config: 0e29f050df6b59b7c3826a79d9cc791d tags: d77d1c0d9ca2f4c8421862c7c5a0d620 diff --git a/main/release_notes/index.html b/main/release_notes/index.html index 451e1cb..59480d9 100644 --- a/main/release_notes/index.html +++ b/main/release_notes/index.html @@ -440,7 +440,7 @@

MO-Gymnasium 1.0.0 Release Notes

reward = forward_reward - ctrl_cost

With MORL, users have the flexibility to determine the compromises they desire based on their preferences for each objective. Consequently, the environments in MO-Gymnasium do not have predefined weights. Thus, MO-Gymnasium extends the capabilities of Gymnasium to the multi-objective setting, where the agents receives a vectorial reward.

For example, here is an illustration of the multiple policies learned by an MORL agent for the mo-halfcheetah domain, balancing between saving battery and speed:

- +

This release marks the first mature version of MO-Gymnasium within Farama, indicating that the API is stable, and we have achieved a high level of quality in this library.

API