add example for Llama3 vllm server #404

cameron-chen · 2024-09-09T03:33:45Z

Hello, I am creating this PR to share the example of evaluating by local model using API call (vllm server).

I find this approach can be quite useful when:

server the local annotator on the cluster so that other nodes only need to call API for evaluating.
the user wants to use "weighted"-style annotator similar with weighted_alpaca_eval_gpt4_turbo.

Please let me know if this looks good. I am happy to add more detailed instructions.

To use the API, add the client config local_configs.yaml and activate it:

default:
    - api_key: "token-abc123"
      base_url: "http://localhost:8000/v1"

YannDubs · 2024-09-15T17:39:53Z

LGTM, can you add a README.md in src/alpaca_eval/evaluators_configs/weighted_alpaca_eval_vllm_llama3_70b/ that explains the setup and goal? thanks!

add example for Llama3 vllm server

3c107a1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add example for Llama3 vllm server #404

add example for Llama3 vllm server #404

cameron-chen commented Sep 9, 2024

YannDubs commented Sep 15, 2024

add example for Llama3 vllm server #404

Are you sure you want to change the base?

add example for Llama3 vllm server #404

Conversation

cameron-chen commented Sep 9, 2024

YannDubs commented Sep 15, 2024