Eval modalities #1

rrutmann · 2024-02-05T15:07:12Z

We add support for models trained with https://github.com/Modalities/modalities

mrudat-iais · 2024-02-19T13:32:55Z

Since I didn't create the PR, I cannot add reviewers. @fromm-m could you please have a look? It is only minor changes to adapt the eval-harness to our modalities repo. I suggest look at the tests first.

lllAlexanderlll · 2024-03-04T08:17:47Z

lm_eval/models/modalities.py

+    def _model_call(
+        self, inputs: TokenSequence, labels: Optional[TokenSequence] = None
+    ) -> TokenSequence:
+        return self.model(inputs)


Is labels never needed?

tests/test_models.py

Co-authored-by: Alexander Weber <[email protected]>

…ities

- Added support for evaluation of Mamba model

… modalities models.

Adapted for modalities

* batch commit * :Revert "batch commit" This reverts commit d859d1c. * batch commit * checkout from main * checkout from main * checkout from main * checkout from main * checkout from main * cleanup * cleanup * cleanup * cleanup * cleanup * cleanup * cleanup * cleanup * cleanup * Chat template fix (OpenGPTX#7) * cleanup * cleanup * cleanup * linting * fix tests * add ifeval install to new_task CI * Revert "add ifeval install to new_task CI" This reverts commit 1d19449. * adds leaderboard tasks (#1) * adds leaderboard tasks * Delete lm_eval/tasks/leaderboard/leaderboard_chat_template.yaml * add readme * Delete lm_eval/tasks/leaderboard/mmlu_pro/mmlu_pro_chat_template.yaml * modify readme * fix bbh task * fix bbh salient task * modify the readme * Delete lm_eval/tasks/leaderboard/ifeval/README.md * Delete lm_eval/tasks/leaderboard/math/README.md * add leaderboard to the tasks repertory * add anouncment about new leaderbaord tasks * linting * Update README.md Co-authored-by: Hailey Schoelkopf <[email protected]> * installs ifeval dependency in new_task github workflow --------- Co-authored-by: Nathan Habib <[email protected]> Co-authored-by: Hailey Schoelkopf <[email protected]> * fix math parser * fix math parser * fix version * add warning about chat template --------- Co-authored-by: Nathan Habib <[email protected]> Co-authored-by: Nathan Habib <[email protected]> Co-authored-by: Nathan Habib <[email protected]> Co-authored-by: Hailey Schoelkopf <[email protected]> Co-authored-by: Nathan Habib <[email protected]>

rrutmann added 3 commits February 5, 2024 09:46

feat: Add support for models from modalities

1f711cf

test(eval): Test models from modalities

51a0cf5

test(modalities): Pass max_length to greedy_until

b304f2d

rrutmann self-assigned this Feb 5, 2024

refactor(HF_test): Test greedy_until() for 20 tokens

1908cd8

mrudat-iais mentioned this pull request Feb 19, 2024

Integrate new HF checkpoints into lm-evaluation-harness Modalities/modalities#26

Closed

mrudat-iais mentioned this pull request Feb 19, 2024

Add generation abilities to HF model for downstream evaluation Modalities/modalities#46

Closed

le1nux requested a review from fromm-m March 4, 2024 08:07

le1nux added the enhancement New feature or request label Mar 4, 2024

lllAlexanderlll reviewed Mar 4, 2024

View reviewed changes

ajude2s and others added 6 commits March 4, 2024 12:28

Update tests/test_models.py

5ef5928

Co-authored-by: Alexander Weber <[email protected]>

fix(eval): Using the array accessor instead

03b7c20

Merge remote-tracking branch 'origin/eval_modalities' into eval_modal…

ba25e26

…ities

feat(checkpointing):

7c5a9c0

- Added support for evaluation of Mamba model

feat(Modalities): Adapted eval harness to be able to evaluate generic…

09dd787

… modalities models.

feat(DownstreamEval):

bf873c6

Adapted for modalities

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eval modalities #1

Eval modalities #1

rrutmann commented Feb 5, 2024

mrudat-iais commented Feb 19, 2024

lllAlexanderlll Mar 4, 2024

Eval modalities #1

Are you sure you want to change the base?

Eval modalities #1

Conversation

rrutmann commented Feb 5, 2024

mrudat-iais commented Feb 19, 2024

lllAlexanderlll Mar 4, 2024

Choose a reason for hiding this comment