This project leverages deepeval, an LLM output evaluation framework to function.
- Install Deepeval
pip install -u deepeval
deepeval login --confident-api-key <YOUR_DEEPEVAL_KEY>
Make sure you have OPENAI API KEY set
export OPENAI_API_KEY=<YOUR_OPEN_AI_KEY>
how to run deepeval evaluation
deepeval test run <PROGRAM.py>
TBD