feat: ✨ support Llama 3.2 Models with chat/completions request #1213

sidhartha-roy · 2025-10-14T18:53:57Z

This PR adds support for evaluating the Llama3.2-1B model with chat/completions requests. This is helpful in evaluating Llama models that only support chat/completions endpoint. For example, with Protopia where a stained glass transform transforms in the input text to obfuscated embeddings and we evaluate the Llama models on the obfuscated prompt. These are the baseline scores.

The scores comparing Llama 3.2 1B with the completions and chat/completions endpoint that is proposed in the PR.

This PR adds support for Llama models that are deployed using the chat/completions endpoint instead of the completions endpoint.

…ed models

sidhartha-roy · 2025-10-14T20:12:45Z

@HuanzhiMao, I would really appreciate it if you could take a look at this PR.

sidhartha-roy · 2025-11-12T18:51:37Z

@ShishirPatil can you please take a look at this PR?

sidhartha-roy and others added 8 commits October 9, 2025 16:50

feat: ✨ add support for llama 3.2 1B with chat/completions endpoint

547b36e

This PR adds support for Llama models that are deployed using the chat/completions endpoint instead of the completions endpoint.

Merge branch 'main' into support-llama-3.2-1B-chat-completions

16ce208

fix: 🐛 fix some merge issues

9b6e739

fix: 🐛 fix max_tokens and model config changes

f3aa465

fix: 🐛 fix model name for Llama 3.2 1B

84cbaa5

docs: 📝 Update Llama 3.2 1B chat completions model in list of support…

f3caf03

…ed models

docs: 📝 update the docstrings for LlamaChatCompletionsHandler

7b1819f

Merge branch 'main' into support-llama-3.2-1B-chat-completions

5b6e1fc

sidhartha-roy added 4 commits October 15, 2025 08:57

style: 💡 remove debug comments

9fa0806

feat: ✨ support 70B and 3B and fix notations

ea8b5c7

Merge branch 'main' into support-llama-3.2-1B-chat-completions

59196f3

refactor: ♻️ move query prompting to LlamaChatCompletions handler

192f77a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: ✨ support Llama 3.2 Models with chat/completions request #1213

feat: ✨ support Llama 3.2 Models with chat/completions request #1213

sidhartha-roy commented Oct 14, 2025

Uh oh!

sidhartha-roy commented Oct 14, 2025

Uh oh!

sidhartha-roy commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: ✨ support Llama 3.2 Models with chat/completions request #1213

Are you sure you want to change the base?

feat: ✨ support Llama 3.2 Models with chat/completions request #1213

Conversation

sidhartha-roy commented Oct 14, 2025

Uh oh!

sidhartha-roy commented Oct 14, 2025

Uh oh!

sidhartha-roy commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant