We need tests. #131

zkokaja · 2022-12-23T15:11:38Z

We are updating the code a lot, it would be great to have even just really simple tests to run to ensure nothing breaks. Let's brainstorm some ideas, and what to apply it to: podcast, 247, glove, just one conversation? How do we evaluate success?

zkokaja · 2023-01-05T19:01:42Z

all this should be automated in a Makefile target:

what are we testing:

generate glove, and gpt2
one subject, 625
two big conversations
result: base and embedding pickles

encoding:

run encoding for 5 good electrodes glove/gpt2 and plot
result: average encoding

create a standard for what we will compare against in the future, but first we need to ensure current code replicates previous results for all electrodes and conversations.

hacky way to manage results folders:

mv results results-old
mkdir results
DO TEST
mv results results-test
mv results-old results

consider using another Makefile or config options

zkokaja · 2023-01-05T19:03:36Z

Use just 1024 tokens from each conversation?

zkokaja assigned zkokaja, hvgazula and VeritasJoker Dec 23, 2022

zkokaja mentioned this issue Jan 5, 2023

We need tests. hassonlab/247-encoding#63

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

We need tests. #131

We need tests. #131

zkokaja commented Dec 23, 2022

zkokaja commented Jan 5, 2023

zkokaja commented Jan 5, 2023

We need tests. #131

We need tests. #131

Comments

zkokaja commented Dec 23, 2022

zkokaja commented Jan 5, 2023

zkokaja commented Jan 5, 2023