HF: switch conditional checks to `self.backend` from `AUTO_MODEL_CLASS` #2353

baberabb · 2024-09-25T17:31:46Z

The conditions in HFLM now check for either causal or seq2seq rather than checking for the AUTO_MODEL_CLASS

haileyschoelkopf

Thanks a bunch for working on this!!

Broadly this is the right approach, but I had a different thought on how we should handle things while maintaining user back-compatibility--described in my PR comments. That should also ensure that we still respect novel subclass-overridden self.AUTO_MODEL_CLASS values.

haileyschoelkopf · 2024-09-26T18:34:10Z

lm_eval/models/huggingface.py

 elif (
 getattr(self.config, "model_type") in MODEL_FOR_CAUSAL_LM_MAPPING_NAMES
 ):
 self.AUTO_MODEL_CLASS = transformers.AutoModelForCausalLM
+ self.backend = "causal"


Let's make the below warning message explicitly state that we set backend=causal in cases like this?

haileyschoelkopf · 2024-09-26T18:35:51Z

lm_eval/models/huggingface.py

 sets `self.AUTO_MODEL_CLASS` appropriately if not already set.
+ Should only be called if isinstance(pretrained, str)! otherwise pass `backend` appropriately


This is a change in interface, right?

I think we should still call _get_backend so that if someone passes an HF model we can detect backend for, it is set by us. Let's try not to force people to pass backend unless they absolutely have to.

haileyschoelkopf · 2024-09-26T18:42:46Z

lm_eval/models/huggingface.py

@@ -90,7 +90,7 @@ def __init__(
 **kwargs,
 ) -> None:
 super().__init__()
-
+ self.backend = backend


I think I'd prefer it if we don't set self.backend here. Would want our code to expressly error out if a user subclassing HFLM does not ever call super().__init__() --> HFLM._get_backend() never called.

haileyschoelkopf · 2024-09-26T18:43:53Z

lm_eval/models/huggingface.py

@@ -101,6 +101,8 @@ def __init__(
 self._device = self._model.device
 self._config = self._model.config
 gpus = 0
+ # default backend to causal if not specified
+ self.backend = self.backend if self.backend != "default" else "causal"


Can we handle this case in _get_backend() and remove this line?

It should be sufficient to simple settle for the existing behavior we had in that function (it defaults to causal if we can't detect both that our model is an HF model and that it is among the registered causal or seq2seq HF model types)

haileyschoelkopf · 2024-09-26T18:44:20Z

lm_eval/models/huggingface.py

- config=self.config, backend=backend, trust_remote_code=trust_remote_code
- )
+ # determine which of 'causal' and 'seq2seq' backends to use for HF models
+ self._get_backend(


Unindent if following my above coment!

haileyschoelkopf · 2024-09-26T18:48:02Z

lm_eval/models/huggingface.py

- # the default _get_backend logic,
- # then skip over the method.
- # TODO: this seems very much undesirable in some cases--our code in HFLM
- # references AutoModelForCausalLM at times to check for equality
 if self.AUTO_MODEL_CLASS is not None:
 return


As per above comments on change in interface--this exit-early from the _get_backend() fn means we will now force users who override AUTO_MODEL_CLASS to set self.backend themselves in the init.

I think instead we should drop

if self.AUTO_MODEL_CLASS is not None: return

entirely, and let our code handle things but we should only set AUTO_MODEL_CLASS in the _get_backend() function if AUTO_MODEL_CLASS is currently None !

That way, for subclasses that set AUTO_MODEL_CLASS themselves, we can use our existing "use user-specified backend if provided, else choose causal or seq2seq if can detect it among HF models, else fall back to assume causal" logic here (which should be what we want, so long as we signpost this behavior with logger msgs enough) and yet preserve the desired usage of having a special AUTO_MODEL_CLASS that gets used for model init. Sound good?

(as is, this PR I think breaks HF multimodal LMs at the moment. with the changes in my comments I believe it will no longer?)

baberabb · 2024-10-01T19:46:34Z

@haileyschoelkopf ready to review again! Still need to test, but wanted to make sure if I understood you correctly.

switch conditional checks to self.backend

b0d6d8c

baberabb requested review from haileyschoelkopf and lintangsutawika as code owners September 25, 2024 17:31

baberabb added 2 commits September 25, 2024 22:31

nit

f096d19

nit

f5b0686

haileyschoelkopf requested changes Sep 26, 2024

View reviewed changes

baberabb added 2 commits October 2, 2024 00:36

commit feedback

27d13fc

fix test; update precommit hooks

fa08e6e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HF: switch conditional checks to `self.backend` from `AUTO_MODEL_CLASS` #2353

HF: switch conditional checks to `self.backend` from `AUTO_MODEL_CLASS` #2353

baberabb commented Sep 25, 2024

haileyschoelkopf left a comment

haileyschoelkopf Sep 26, 2024

haileyschoelkopf Sep 26, 2024

haileyschoelkopf Sep 26, 2024

haileyschoelkopf Sep 26, 2024

haileyschoelkopf Sep 26, 2024

haileyschoelkopf Sep 26, 2024

baberabb commented Oct 1, 2024 •

edited

Loading

		sets `self.AUTO_MODEL_CLASS` appropriately if not already set.
		Should only be called if isinstance(pretrained, str)! otherwise pass `backend` appropriately

HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS #2353

Are you sure you want to change the base?

HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS #2353

Conversation

baberabb commented Sep 25, 2024

haileyschoelkopf left a comment

Choose a reason for hiding this comment

haileyschoelkopf Sep 26, 2024

Choose a reason for hiding this comment

haileyschoelkopf Sep 26, 2024

Choose a reason for hiding this comment

haileyschoelkopf Sep 26, 2024

Choose a reason for hiding this comment

haileyschoelkopf Sep 26, 2024

Choose a reason for hiding this comment

haileyschoelkopf Sep 26, 2024

Choose a reason for hiding this comment

haileyschoelkopf Sep 26, 2024

Choose a reason for hiding this comment

baberabb commented Oct 1, 2024 • edited Loading

HF: switch conditional checks to `self.backend` from `AUTO_MODEL_CLASS` #2353

HF: switch conditional checks to `self.backend` from `AUTO_MODEL_CLASS` #2353

baberabb commented Oct 1, 2024 •

edited

Loading