Add metrics calculations to the inference pipeline #23

RashmikaReddy · 2023-12-15T17:19:31Z

Change Description

Adding changes to add metrics to the inference pipeline in main.py. Added unit test case in test_main.py
closes #2

[https://github.com/Add metrics calculations to the inference pipeline #2]

Solution Description

Added BLEU, METEOR evaluation metrics to the inference pipeline.

BLEU score calculation
https://www.baeldung.com/cs/nlp-bleu-score#:~:text=BLEU%20(Bilingual%20Evaluation%20Understudy)%20is,%2Danswering%20systems%2C%20and%20chatbots.
METEOR score calculation
https://huggingface.co/spaces/evaluate-metric/meteor

Code Quality

[Yes] I have read the Contribution Guide
[Yes] My code follows the code style of this project
[Yes] My code builds (or compiles) cleanly without any errors or warnings
[Yes] My code contains relevant comments and necessary documentation

Project-Specific Pull Request Checklists

into rashmika-dec13

codecov · 2023-12-15T22:13:12Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (45bd148) 95.83% compared to head (368e73c) 96.32%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #23      +/-   ##
==========================================
+ Coverage   95.83%   96.32%   +0.49%     
==========================================
  Files           3        3              
  Lines         120      136      +16     
==========================================
+ Hits          115      131      +16     
  Misses          5        5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

carlosgjs

Looking good! Some comments/questions inline. Thx!

docs/requirements.txt

src/autora/doc/pipelines/main.py

carlosgjs · 2023-12-15T22:32:46Z

tests/test_main.py

+        labels = [item["output"] for item in items]
+
+    bleu, meteor = evaluate_documentation(labels, labels)
+    assert bleu >= 0 and bleu <= 1, "BLEU score should be between 0 and 1"


Are bleu, meteor==1 when label==prediction?

It would actually be a bit clearer if you hard code some examples and assert specific values. Eg. all tokens match, extra token in the prediction, missing token in the prediction, etc.

Added more tests cases, and code works as expected.

codecov-commenter · 2024-01-12T17:42:01Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (fdcce7e) 96.15% compared to head (9fcd8ec) 96.57%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #23      +/-   ##
==========================================
+ Coverage   96.15%   96.57%   +0.42%     
==========================================
  Files           3        3              
  Lines         130      146      +16     
==========================================
+ Hits          125      141      +16     
  Misses          5        5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

carlosgjs

Almost done! See just a couple of suggestions in line.

src/autora/doc/pipelines/main.py

RashmikaReddy · 2024-01-18T19:39:27Z

Closing the Pull Request for evaluation metrics

RashmikaReddy added 7 commits December 15, 2023 09:00

Pushing changes made for adding metrics

54a2039

updating main.py

0bb0aaf

Update main.py

4c5e472

Update test_main.py

1f7b43e

formatting changes for evaluation metrics

1811dae

Merge branch 'rashmika-dec13' of https://github.com/AutoResearch/autodoc

fec49d7

into rashmika-dec13

adding dependencies in pyproject.toml

368e73c

RashmikaReddy marked this pull request as ready for review December 15, 2023 22:13

RashmikaReddy requested a review from carlosgjs December 15, 2023 22:13

carlosgjs requested changes Dec 15, 2023

View reviewed changes

Modified the test cases

2376a6d

RashmikaReddy added 2 commits January 12, 2024 11:12

Added test cases

413172d

Merge branch 'main' into rashmika-dec13

0b641b3

carlosgjs approved these changes Jan 13, 2024

View reviewed changes

src/autora/doc/pipelines/main.py Outdated Show resolved Hide resolved

src/autora/doc/pipelines/main.py Outdated Show resolved Hide resolved

RashmikaReddy added 6 commits January 17, 2024 22:22

Made the suggested changes

8d5c75e

Merge branch 'main' into rashmika-dec13

5d9c0c1

Updating test_main.py with changes related to main

e5657b7

Update requirements.txt

3e7e5e8

Update .mypy.ini

3df0c8d

Updated test_main.py

9fcd8ec

RashmikaReddy closed this Jan 18, 2024

RashmikaReddy reopened this Jan 19, 2024

RashmikaReddy merged commit 61d2480 into main Jan 19, 2024
18 checks passed

RashmikaReddy deleted the rashmika-dec13 branch January 19, 2024 18:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add metrics calculations to the inference pipeline #23

Add metrics calculations to the inference pipeline #23

RashmikaReddy commented Dec 15, 2023

codecov bot commented Dec 15, 2023

carlosgjs left a comment

carlosgjs Dec 15, 2023

RashmikaReddy Jan 12, 2024

codecov-commenter commented Jan 12, 2024 •

edited

Loading

carlosgjs left a comment

RashmikaReddy commented Jan 18, 2024

Add metrics calculations to the inference pipeline #23

Add metrics calculations to the inference pipeline #23

Conversation

RashmikaReddy commented Dec 15, 2023

Change Description

Solution Description

Code Quality

Project-Specific Pull Request Checklists

codecov bot commented Dec 15, 2023

Codecov Report

carlosgjs left a comment

Choose a reason for hiding this comment

carlosgjs Dec 15, 2023

Choose a reason for hiding this comment

RashmikaReddy Jan 12, 2024

Choose a reason for hiding this comment

codecov-commenter commented Jan 12, 2024 • edited Loading

Codecov Report

carlosgjs left a comment

Choose a reason for hiding this comment

RashmikaReddy commented Jan 18, 2024

codecov-commenter commented Jan 12, 2024 •

edited

Loading