Add hallucination multicalibrator, with example benchmark #152

gianlucadetommaso · 2023-11-16T14:03:24Z

Add the HallucinationMulticalibrator. This is able to take in input a generative model, a tokenizer and some data, then fit the calibrator and predict calibrated probabilities of hallucinations.

wistuba

I left some comments. Maybe you want to add some more unit tests.

wistuba · 2023-11-16T14:59:26Z

benchmarks/hallucination/mmlu/run.py

+        for task in task_list
+    ]
+
+    answer_map = {a: i for i, a in enumerate(auc)}


choices seem to be in list("ABCD"). why has the answer map more options?

Tries to be more generic, but you're right, can be restricted to "ABCD".

benchmarks/hallucination/mmlu/run.py

fortuna/hallucination/base.py

wistuba · 2023-11-16T15:21:08Z

fortuna/hallucination/base.py

+                    }
+
+                    with torch.no_grad():
+                        __logits = self.generative_model(


double _ intentional?

fortuna/hallucination/base.py

wistuba · 2023-11-16T15:29:52Z

tests/fortuna/hallucination/scoring.py

+
+
+class TestScoringModel(unittest.TestCase):
+    def test_score(self):


can't you compare against exact values?

wistuba · 2023-11-16T15:30:31Z

fortuna/hallucination/utils/string.py

+import re
+
+
+def string_cleaner(text: str) -> str:


great candidate for a function to have unit tests

gianlucadetommaso added 30 commits May 15, 2023 19:07

edit installation instructions in readme

52e96ea

Merge branch 'main' of https://github.com/awslabs/fortuna

5e0076d

Merge branch 'main' of https://github.com/awslabs/fortuna

4c7fd28

bump up version

6cb6581

Merge branch 'main' of https://github.com/awslabs/fortuna

1b39780

Merge branch 'main' of https://github.com/awslabs/fortuna

cb2b49a

Merge branch 'main' of https://github.com/awslabs/fortuna

14e3ca4

Merge branch 'main' of https://github.com/awslabs/fortuna

580067d

Merge branch 'main' of https://github.com/awslabs/fortuna

048ef09

Merge branch 'main' of https://github.com/awslabs/fortuna

ad542a4

Merge branch 'main' of https://github.com/awslabs/fortuna

41417c1

Merge branch 'main' of https://github.com/awslabs/fortuna

64be374

Merge branch 'main' of https://github.com/awslabs/fortuna

a2d0f34

Merge branch 'main' of https://github.com/awslabs/fortuna

66bba06

Merge branch 'main' of https://github.com/awslabs/fortuna

911aa82

Merge branch 'main' of https://github.com/awslabs/fortuna

01f959b

Merge branch 'main' of https://github.com/awslabs/fortuna

79f8dca

Merge branch 'main' of https://github.com/awslabs/fortuna

4dea50f

Merge branch 'main' of https://github.com/awslabs/fortuna

1ced008

Merge branch 'main' of https://github.com/awslabs/fortuna

6992692

make small change in readme because of publish to pypi error

b2540c1

Merge branch 'main' of https://github.com/awslabs/fortuna

2362998

Merge branch 'main' of https://github.com/awslabs/fortuna

6e030f2

Merge branch 'main' of https://github.com/awslabs/fortuna

9bd6f67

Merge branch 'main' of https://github.com/awslabs/fortuna

c5bc94f

Merge branch 'main' of https://github.com/awslabs/fortuna

d3ab46b

Merge branch 'main' of https://github.com/awslabs/fortuna

0e2aca5

Merge branch 'main' of https://github.com/awslabs/fortuna

9520273

Merge branch 'main' of https://github.com/awslabs/fortuna

e9c4108

bump up version

bc64a01

gianlucadetommaso added 23 commits September 21, 2023 22:43

Merge branch 'main' of https://github.com/awslabs/fortuna

b4c161e

Merge branch 'main' of https://github.com/awslabs/fortuna

744dff1

Merge branch 'main' of https://github.com/awslabs/fortuna

a22f97f

Merge branch 'main' of https://github.com/awslabs/fortuna

fffdd76

Merge branch 'main' of https://github.com/awslabs/fortuna

c23d16d

Merge branch 'main' of https://github.com/awslabs/fortuna

1cb2917

Merge branch 'main' of https://github.com/awslabs/fortuna

9c1d07a

Merge branch 'main' of https://github.com/awslabs/fortuna

4b83638

Merge branch 'main' of https://github.com/awslabs/fortuna

610fc37

Merge branch 'main' of https://github.com/awslabs/fortuna

e5b67ba

Merge branch 'main' of https://github.com/awslabs/fortuna

1f03d4e

Merge branch 'main' of https://github.com/awslabs/fortuna

d49ed29

Merge branch 'main' of https://github.com/awslabs/fortuna

8200e42

Merge branch 'main' of https://github.com/awslabs/fortuna

882733b

Merge branch 'main' of https://github.com/awslabs/fortuna

c8ca7e6

Merge branch 'main' of https://github.com/awslabs/fortuna

b1e67fc

Merge branch 'main' of https://github.com/awslabs/fortuna

e6b8c85

Merge branch 'main' of https://github.com/awslabs/fortuna

2197430

copy embeddings during normalization

8a5dfdd

add hallucination multicalibrator

742954d

Merge branch 'main' of https://github.com/awslabs/fortuna

078e275

Merge branch 'main' into grouping2

abe2eec

improve type hinting

86d6ec5

wistuba reviewed Nov 16, 2023

View reviewed changes

gianlucadetommaso added 4 commits November 16, 2023 17:08

small refactoring of hallucination multicalibrator

75d4f7c

batchify processing of multiple answers for speedup

ea14d25

fix embedding dimension

dbe8ecd

change max number of clusters

493b020

gianlucadetommaso merged commit 76ad7a2 into main Nov 20, 2023
6 checks passed

gianlucadetommaso deleted the grouping2 branch November 20, 2023 12:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hallucination multicalibrator, with example benchmark #152

Add hallucination multicalibrator, with example benchmark #152

gianlucadetommaso commented Nov 16, 2023

wistuba left a comment

wistuba Nov 16, 2023

gianlucadetommaso Nov 16, 2023

wistuba Nov 16, 2023

gianlucadetommaso Nov 16, 2023

wistuba Nov 16, 2023

wistuba Nov 16, 2023



		class TestScoringModel(unittest.TestCase):
		def test_score(self):

Add hallucination multicalibrator, with example benchmark #152

Add hallucination multicalibrator, with example benchmark #152

Conversation

gianlucadetommaso commented Nov 16, 2023

wistuba left a comment

Choose a reason for hiding this comment

wistuba Nov 16, 2023

Choose a reason for hiding this comment

gianlucadetommaso Nov 16, 2023

Choose a reason for hiding this comment

wistuba Nov 16, 2023

Choose a reason for hiding this comment

gianlucadetommaso Nov 16, 2023

Choose a reason for hiding this comment

wistuba Nov 16, 2023

Choose a reason for hiding this comment

wistuba Nov 16, 2023

Choose a reason for hiding this comment