HealthBench RFT - Updates to use synthetic data #2257

robtinn · 2025-11-24T09:30:49Z

Summary

Adding in some functions to generate synthetic data in this example as examples from HealthBench might not always be suitable for fine-tuning.
Also tidied up some of the wording and visualization notebook cells.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-11-24T09:33:26Z

examples/fine-tuned_qa/reinforcement_finetuning_healthbench.ipynb

+    "            \"content\": f\"Produce twenty examples.\"\n",
+    "        }\n",
+    "    ],\n",
+    "text_format=SyntheticData\n",
    ")\n",


responses.parse called with unsupported argument

The new synthetic-data cell calls client.responses.parse(..., text_format=SyntheticData) but the responses API expects a response_format argument for Pydantic parsing; text_format is not a valid parameter and will raise a TypeError before any data is generated. With this change, the notebook cannot produce the synthetic train/val/test JSONL files and the fine-tuning walk-through stops at this cell.

Useful? React with 👍 / 👎.

robtinn added 2 commits November 19, 2025 17:32

Update rft healthbench cookbook to use synthetic data

08af715

Tidy up RFT cookbook healthbench

3c6a84f

chatgpt-codex-connector bot reviewed Nov 24, 2025

View reviewed changes

tompakeman-oai approved these changes Nov 24, 2025

View reviewed changes

robtinn merged commit f249969 into main Nov 24, 2025
1 check passed

robtinn deleted the rtinn/rft_updates branch November 24, 2025 09:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HealthBench RFT - Updates to use synthetic data #2257

HealthBench RFT - Updates to use synthetic data #2257

Uh oh!

robtinn commented Nov 24, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HealthBench RFT - Updates to use synthetic data #2257

HealthBench RFT - Updates to use synthetic data #2257

Uh oh!

Conversation

robtinn commented Nov 24, 2025

Summary

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants