Skip to content

Conversation

@sidhartha-roy
Copy link

This PR adds support for evaluating the Llama3.2-1B model with chat/completions requests. This is helpful in evaluating Llama models that only support chat/completions endpoint. For example, with Protopia where a stained glass transform transforms in the input text to obfuscated embeddings and we evaluate the Llama models on the obfuscated prompt. These are the baseline scores.

The scores comparing Llama 3.2 1B with the completions and chat/completions endpoint that is proposed in the PR.
image
image

@sidhartha-roy
Copy link
Author

@HuanzhiMao, I would really appreciate it if you could take a look at this PR.

@sidhartha-roy
Copy link
Author

@ShishirPatil can you please take a look at this PR?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant