Change callback for `AdversarialTrainer` #626

gunnxx · 2022-11-15T09:35:41Z

Changing the callback mechanism of AdversarialTrainer such that we can insert sb3.EvalCallback. See #607.

mifeet · 2022-11-28T19:59:29Z

src/imitation/algorithms/adversarial/common.py

@@ -421,7 +422,7 @@ def train_gen(
    def train(
        self,
        total_timesteps: int,
-        callback: Optional[Callable[[int], None]] = None,
+        callback: Optional[List[BaseCallback]] = None
    ) -> None:
        """Alternates between training the generator and discriminator.



The last part of description and finally a call to callback(round) is probably misleading now.

mifeet · 2022-11-28T20:06:08Z

src/imitation/algorithms/adversarial/common.py

+            if self.gen_callback is None:
+                self.gen_callback = callback
+            else:
+                self.gen_callback = callback + [self.gen_callback]


Can someone abuse the API by calling train() multiple times? If so, the value of self.gen_callback would contain nested list, which is not correct. Generally, the value of gen_callback is currently Optional[BaseCallback] and we shouldn't change the type to a list at runtime.

Perhaps it would be better to add an optional callback argument to train_gen(), merge callbacks there, and avoid the stateful change here?

Also, can the learn_kwargs argument from train_gen() be removed, as discussed in the original issue #607 ?

mifeet · 2022-11-28T20:17:08Z

src/imitation/algorithms/adversarial/common.py

@@ -421,7 +422,7 @@ def train_gen(
    def train(
        self,
        total_timesteps: int,
-        callback: Optional[Callable[[int], None]] = None,
+        callback: Optional[List[BaseCallback]] = None


Do we want to change the semantics of the argument here, or should we rather deprecate the feature (and introduce a different parameter for additional gen_callback)?

I think the suggestion in the original issue was to add a new gen_callback argument. (Btw, stable-baselines supports both CallbackList and list of callbacks if we wanted to be fancy)

mifeet · 2022-11-28T20:30:01Z

src/imitation/algorithms/adversarial/common.py

@@ -421,7 +422,7 @@ def train_gen(
    def train(


One more thing - if you change the arguments, update of training_adversarial.py will also be needed

gunnxx added 2 commits November 15, 2022 18:25

change callback mechanism

fa9b2f4

Merge branch 'master' of https://github.com/gunnxx/imitation

7b3ed75

mifeet suggested changes Nov 28, 2022

View reviewed changes

smanolloff mentioned this pull request Sep 14, 2023

Add support for SB3 callbacks in adversarial training #786

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change callback for `AdversarialTrainer` #626

Change callback for `AdversarialTrainer` #626

gunnxx commented Nov 15, 2022

mifeet Nov 28, 2022

mifeet Nov 28, 2022

mifeet Nov 28, 2022

mifeet Nov 28, 2022

mifeet Nov 28, 2022

Change callback for AdversarialTrainer #626

Are you sure you want to change the base?

Change callback for AdversarialTrainer #626

Conversation

gunnxx commented Nov 15, 2022

mifeet Nov 28, 2022

Choose a reason for hiding this comment

mifeet Nov 28, 2022

Choose a reason for hiding this comment

mifeet Nov 28, 2022

Choose a reason for hiding this comment

mifeet Nov 28, 2022

Choose a reason for hiding this comment

mifeet Nov 28, 2022

Choose a reason for hiding this comment

Change callback for `AdversarialTrainer` #626

Change callback for `AdversarialTrainer` #626