Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new benchmark: Galician bench #2155

Open
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

zxcvuser
Copy link
Contributor

GalicianBench is a benchmark for tasks in Galician that cover several evaluation areas. The datasets consist of professional translations of relevant English datasets and newly created datasets in Galician. The README.md contains detailed information on all the tasks included in the benchmark.

@CLAassistant
Copy link

CLAassistant commented Jul 30, 2024

CLA assistant check
All committers have signed the CLA.

@@ -0,0 +1,28 @@
group: flores
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
group: flores
tag: flores

@@ -0,0 +1,27 @@
group:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we should remove this as it refers to the original mgsm paper. Also is this human translated? The dataset page doesn't say

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We will remove the group. As for translation, it is human translated. We will ask the authors to add this information in the dataset page. Thank you!

@@ -0,0 +1,20 @@
group:
- pawsx
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should remove this as well

@@ -0,0 +1,70 @@
group:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
group:
tag:

@@ -0,0 +1,36 @@
group:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
group:
tag:

@@ -0,0 +1,15 @@
group:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
group:
tag:

@@ -0,0 +1,22 @@
group:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should remove this

@zxcvuser
Copy link
Contributor Author

These are the changes done:

  • Added the benchmark info in lm_eval/tasks/README.md
  • Replaced "-" by "_" in the create_files script in flores_gl and added weight_by_size: false
  • Run linters
  • Remove grouping in mgsm, paws, xnli, and belebele tasks (they were pointing to pre-existing benchmarks)
  • Changed "group" to "tag" in truthful_gl

With these, it should all be fine now. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants