Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
6,222 workflow runs
6,222 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add the BlueBench benchmark
Tasks Modified #3479: Pull request #2369 synchronize by shachardon
October 2, 2024 08:11 Action required shachardon:bluebench_pr
October 2, 2024 08:11 Action required
Add the BlueBench benchmark
Unit Tests #3451: Pull request #2369 synchronize by shachardon
October 2, 2024 08:11 Action required shachardon:bluebench_pr
October 2, 2024 08:11 Action required
[API] tokenizer: add trust-remote-code
Unit Tests #3450: Pull request #2372 opened by baberabb
October 1, 2024 21:00 10m 9s api_trust
October 1, 2024 21:00 10m 9s
[API] tokenizer: add trust-remote-code
Tasks Modified #3478: Pull request #2372 opened by baberabb
October 1, 2024 21:00 12s api_trust
October 1, 2024 21:00 12s
HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS
Tasks Modified #3477: Pull request #2353 synchronize by baberabb
October 1, 2024 19:43 15s automodel
October 1, 2024 19:43 15s
HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS
Unit Tests #3449: Pull request #2353 synchronize by baberabb
October 1, 2024 19:43 6m 34s automodel
October 1, 2024 19:43 6m 34s
HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS
Unit Tests #3448: Pull request #2353 synchronize by baberabb
October 1, 2024 19:37 4m 58s automodel
October 1, 2024 19:37 4m 58s
HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS
Tasks Modified #3476: Pull request #2353 synchronize by baberabb
October 1, 2024 19:37 13s automodel
October 1, 2024 19:37 13s
Add the BlueBench benchmark
Unit Tests #3447: Pull request #2369 synchronize by shachardon
October 1, 2024 13:49 Action required shachardon:bluebench_pr
October 1, 2024 13:49 Action required
Add the BlueBench benchmark
Tasks Modified #3475: Pull request #2369 synchronize by shachardon
October 1, 2024 13:49 Action required shachardon:bluebench_pr
October 1, 2024 13:49 Action required
Add the BlueBench benchmark
Tasks Modified #3474: Pull request #2369 opened by shachardon
October 1, 2024 13:18 Action required shachardon:bluebench_pr
October 1, 2024 13:18 Action required
Add the BlueBench benchmark
Unit Tests #3446: Pull request #2369 opened by shachardon
October 1, 2024 13:18 -1s shachardon:bluebench_pr
October 1, 2024 13:18 -1s
Remove unnecessary space prefix
Tasks Modified #3471: Pull request #2368 opened by eldarkurtic
October 1, 2024 12:16 1m 29s eldarkurtic:fix-leaderboard_v2
October 1, 2024 12:16 1m 29s
Remove unnecessary space prefix
Unit Tests #3443: Pull request #2368 opened by eldarkurtic
October 1, 2024 12:16 5m 6s eldarkurtic:fix-leaderboard_v2
October 1, 2024 12:16 5m 6s
Add new benchmark: Catalan bench
Unit Tests #3442: Pull request #2154 synchronize by zxcvuser
September 30, 2024 15:32 5m 37s zxcvuser:catalan_bench
September 30, 2024 15:32 5m 37s
Add new benchmark: Catalan bench
Tasks Modified #3470: Pull request #2154 synchronize by zxcvuser
September 30, 2024 15:32 1m 50s zxcvuser:catalan_bench
September 30, 2024 15:32 1m 50s
Fix missing key in custom task loading. (#2304)
Tasks Modified #3469: Commit 15ffb0d pushed by haileyschoelkopf
September 30, 2024 15:01 1m 39s main
September 30, 2024 15:01 1m 39s
Fix missing key in custom task loading. (#2304)
Unit Tests #3441: Commit 15ffb0d pushed by haileyschoelkopf
September 30, 2024 15:01 5m 17s main
September 30, 2024 15:01 5m 17s
Add new benchmark: Catalan bench
Tasks Modified #3468: Pull request #2154 synchronize by haileyschoelkopf
September 30, 2024 14:55 1m 28s zxcvuser:catalan_bench
September 30, 2024 14:55 1m 28s
Add new benchmark: Catalan bench
Unit Tests #3440: Pull request #2154 synchronize by haileyschoelkopf
September 30, 2024 14:55 7m 11s zxcvuser:catalan_bench
September 30, 2024 14:55 7m 11s
Add new benchmark: Portuguese bench (#2156)
Tasks Modified #3467: Commit caa7c40 pushed by haileyschoelkopf
September 30, 2024 14:53 3m 38s main
September 30, 2024 14:53 3m 38s