Bug in the float limit handling #2324

cjluo-omniml · 2024-09-19T19:06:02Z

Hi, LM eval team,

In this line: https://github.com/EleutherAI/lm-evaluation-harness/blob/main/lm_eval/evaluator.py#L439

The limit is reassigned to an int if the original limit flag is a float. However, this does not count the case if I'm running two tasks together.

E.g. TASK A: Full data samples 100, TASK B: Full data samples 1000. And I use a limit of 0.1.

I'm expecting the eval to run 10 samples from TASK A and 100 samples from TASK B.

However, the current logic does not process the TASK B limit compute as the limit is set to 10 already after processing TASK A.

So it ends with we run only 10 samples for TASK B as well.

This should be an easy fix, could you help update your code? Thanks

baberabb · 2024-09-19T19:30:23Z

Hi! nice catch! yes, a PR will be appreciated!

See: EleutherAI#2324 The float limit will be override with the previous int limit of multiple tasks are triggered together. This PR fix this issue

cjluo-omniml · 2024-09-19T21:39:03Z

@baberabb could you help review the fix above?

cjluo-omniml · 2024-09-23T15:19:03Z

@baberabb friendly ping? This should be an easy fix

baberabb · 2024-09-23T15:26:18Z

Hi! @cjluo-omniml . left a comment in the PR earlier. The script uses limit again downstream as well so need to handle that as well.

cjluo-omniml · 2024-09-24T23:15:25Z

@baberabb fixed. Could you review again?

cjluo-omniml · 2024-09-26T17:20:38Z

@baberabb friendly ping?

cjluo-omniml added a commit to cjluo-omniml/lm-evaluation-harness that referenced this issue Sep 19, 2024

Fix float limit override

33ae617

See: EleutherAI#2324 The float limit will be override with the previous int limit of multiple tasks are triggered together. This PR fix this issue

cjluo-omniml mentioned this issue Sep 19, 2024

Fix float limit override #2325

Open

baberabb added the feature request A feature that isn't implemented yet. label Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in the float limit handling #2324

Bug in the float limit handling #2324

cjluo-omniml commented Sep 19, 2024

baberabb commented Sep 19, 2024

cjluo-omniml commented Sep 19, 2024

cjluo-omniml commented Sep 23, 2024

baberabb commented Sep 23, 2024

cjluo-omniml commented Sep 24, 2024

cjluo-omniml commented Sep 26, 2024

Bug in the float limit handling #2324

Bug in the float limit handling #2324

Comments

cjluo-omniml commented Sep 19, 2024

baberabb commented Sep 19, 2024

cjluo-omniml commented Sep 19, 2024

cjluo-omniml commented Sep 23, 2024

baberabb commented Sep 23, 2024

cjluo-omniml commented Sep 24, 2024

cjluo-omniml commented Sep 26, 2024