Skip to content

Actions: pwr-ai/JuDDGES

python-lint-and-test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
439 workflow runs
439 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

feat: final version of scraping
python-lint-and-test #339: Commit 5ea838d pushed by asawczyn
September 2, 2024 19:42 8m 23s feat/nsa-multiproxy
September 2, 2024 19:42 8m 23s
feat: final version of scraping
python-lint-and-test #338: Commit d63cd49 pushed by asawczyn
September 2, 2024 18:30 14m 10s feat/nsa-multiproxy
September 2, 2024 18:30 14m 10s
Reproduce Bielik v2
python-lint-and-test #337: Commit 774048b pushed by binkjakub
August 31, 2024 10:04 6m 49s fine-tune-on-english-data
August 31, 2024 10:04 6m 49s
feat: add instruct readme
python-lint-and-test #336: Commit b1af463 pushed by asawczyn
August 30, 2024 15:08 40s feat/en-readme
August 30, 2024 15:08 40s
Fix default num_proc in structured evaluator
python-lint-and-test #335: Commit 3d2d38d pushed by binkjakub
August 30, 2024 08:21 6m 52s fine-tune-on-english-data
August 30, 2024 08:21 6m 52s
Reproduce llm-as-judge with fixed prompt
python-lint-and-test #334: Commit e27516c pushed by binkjakub
August 29, 2024 15:29 8m 54s fine-tune-on-english-data
August 29, 2024 15:29 8m 54s
Reproduce evaluation on fixed parsing
python-lint-and-test #333: Commit a9cc9b3 pushed by binkjakub
August 29, 2024 14:13 38s fine-tune-on-english-data
August 29, 2024 14:13 38s
Fix missing gpt-4o outputs and reproduce English data on gpt
python-lint-and-test #332: Commit d8b83c8 pushed by binkjakub
August 28, 2024 15:13 40s fine-tune-on-english-data
August 28, 2024 15:13 40s
Add bielik v2
python-lint-and-test #331: Commit 89f5199 pushed by binkjakub
August 28, 2024 10:09 1m 11s fine-tune-on-english-data
August 28, 2024 10:09 1m 11s
Add Bielik v0.1 LLM
python-lint-and-test #326: Commit 36cf817 pushed by binkjakub
August 26, 2024 13:08 5m 52s add-multiple-runs-eval
August 26, 2024 13:08 5m 52s
Add Bielik v0.1 LLM
python-lint-and-test #325: Commit 415f3ea pushed by binkjakub
August 26, 2024 09:17 6m 40s add-multiple-runs-eval
August 26, 2024 09:17 6m 40s
Add Bielik v0.1 LLM
python-lint-and-test #323: Commit 8dcbdbd pushed by binkjakub
August 25, 2024 15:31 6m 49s add-multiple-runs-eval
August 25, 2024 15:31 6m 49s
Update README.md with reproduction instruction
python-lint-and-test #322: Commit 57ea86d pushed by binkjakub
August 25, 2024 09:42 6m 48s add-multiple-runs-eval
August 25, 2024 09:42 6m 48s
Update README.md with reproduction instruction
python-lint-and-test #321: Commit bff812a pushed by binkjakub
August 25, 2024 09:22 7m 1s add-multiple-runs-eval
August 25, 2024 09:22 7m 1s
Update README.md with reproduction instruction
python-lint-and-test #320: Commit d770b1e pushed by binkjakub
August 25, 2024 09:21 7m 9s add-multiple-runs-eval
August 25, 2024 09:21 7m 9s
Reproduce Mistral-Nemo and summarize all results
python-lint-and-test #319: Commit 8ab3078 pushed by binkjakub
August 12, 2024 19:03 5m 48s add-multiple-runs-eval
August 12, 2024 19:03 5m 48s
Disable CI on windows for now (utf-8 bugs)
python-lint-and-test #318: Commit d104981 pushed by binkjakub
August 12, 2024 13:09 6m 36s add-multiple-runs-eval
August 12, 2024 13:09 6m 36s
Fix llm-as-judge implementation
python-lint-and-test #317: Commit 7824769 pushed by binkjakub
August 12, 2024 08:24 6m 18s add-multiple-runs-eval
August 12, 2024 08:24 6m 18s
Add llm-as-judge preliminary results (too much non-evaluable)
python-lint-and-test #316: Commit 1d8df86 pushed by binkjakub
August 11, 2024 07:54 9m 3s add-multiple-runs-eval
August 11, 2024 07:54 9m 3s
Evaluate updated Mistral and summarize metrics with mean and std
python-lint-and-test #315: Commit 4ca650e pushed by binkjakub
August 8, 2024 13:42 6m 11s add-multiple-runs-eval
August 8, 2024 13:42 6m 11s