Fix nested vocab_size for DistillationTrainer and GOLDTrainer by Beichen-Ma · Pull Request #5592 · huggingface/trl

Beichen-Ma · 2026-04-19T02:11:46Z

Fixes #5585. The root cause was in DistillationTrainer.__init__ calls teacher_model.resize_token_embeddings(self.model.config.vocab_size), which raises AttributeError on configs where vocab_size is nested (e.g. Qwen3_5Config exposes it under config.text_config).

Fix

Replaced with self.model.config.get_text_config().vocab_size, which transformers defines on PretrainedConfig to return the text sub-config on nested configs and self on flat ones.
Applied the same fix to GOLDTrainer (identical duplicated pattern). Updated an existing GOLD unit test whose SimpleNamespace mock didn't implement get_text_config.

Tests

Reproduced the original error on Qwen3.5-0.8B student / Qwen3.5-2B teacher before the fix.
After the fix, trainer init and one training step complete cleanly.
pytest tests/experimental/test_gold_trainer.py passes.
DistillationTrainer has no existing unit test file. Given it's experimental and the fix is one line mirroring an existing pattern, I scoped this PR to the minimal fix rather than introducing a new test file.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

AI writing disclosure

We welcome the use of AI tools to help with contributions. For transparency and to help us improve our review process, please indicate the level of AI involvement in this PR.

No AI usage: the PR was written entirely by a human.
AI-assisted: some parts were suggested or improved by AI, but the PR was written and reviewed by a human.
AI-generated: the PR was mostly or fully generated by an AI tool.

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.

Note

Medium Risk
Touches trainer initialization for DistillationTrainer and GOLDTrainer; a wrong vocab_size could mis-size teacher embeddings or break startup for some model configs, but the change is small and localized.

Overview
Fixes teacher embedding resizing for models whose vocab_size lives in a nested text config by switching from self.model.config.vocab_size to self.model.config.get_text_config().vocab_size in both DistillationTrainer and GOLDTrainer.

Updates the GOLD unit test mock config to implement get_text_config() so the init path continues to validate the resize call.

^{Reviewed by Cursor Bugbot for commit 99bf555. Bugbot is set up for automated code reviews on this repo. Configure here.}

cmpatino · 2026-04-20T09:08:52Z

The changes look good to me! Let's wait for the review from another maintainer before merging

Beichen-Ma · 2026-05-08T20:01:23Z

Hey @cmpatino, just checking in on this — looks like it has the approvals and wanted to make sure it didn't slip through the cracks. Thanks!

HuggingFaceDocBuilderDev · 2026-05-11T12:58:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

cmpatino · 2026-05-11T13:18:13Z

Thank you for the ping @Beichen-Ma! Looks good to me, so I'm merging it.

fix nested vocab_size for DistillationTrainer and GOLDTrainer

d3d3aa1

Beichen-Ma mentioned this pull request Apr 19, 2026

[Bug] DistillationTrainer fails with Qwen3.5 due to nested config.vocab_size attribute #5585

Closed

5 tasks

k1064190 mentioned this pull request Apr 19, 2026

fix(distillation): reverse-KL server path NaN on variable completion length k1064190/trl#1

Closed

3 tasks

cmpatino self-assigned this Apr 20, 2026

cmpatino self-requested a review April 20, 2026 08:51

kashif approved these changes Apr 20, 2026

View reviewed changes

cmpatino approved these changes Apr 20, 2026

View reviewed changes

Beichen-Ma added 3 commits April 21, 2026 23:07

Merge branch 'main' into fix-nested-vocab-size

1f33cd4

Merge branch 'main' into fix-nested-vocab-size

d1e8543

Merge branch 'main' into fix-nested-vocab-size

12dfe53

Merge branch 'main' into fix-nested-vocab-size

99bf555

cmpatino merged commit 9ff8c78 into huggingface:main May 11, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix nested vocab_size for DistillationTrainer and GOLDTrainer#5592

Fix nested vocab_size for DistillationTrainer and GOLDTrainer#5592
cmpatino merged 5 commits into
huggingface:mainfrom
Beichen-Ma:fix-nested-vocab-size

Beichen-Ma commented Apr 19, 2026 •

edited by cursor Bot

Loading

Uh oh!

cmpatino commented Apr 20, 2026

Uh oh!

Beichen-Ma commented May 8, 2026

Uh oh!

HuggingFaceDocBuilderDev commented May 11, 2026

Uh oh!

cmpatino commented May 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Beichen-Ma commented Apr 19, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix

Tests

Before submitting

AI writing disclosure

Who can review?

Uh oh!

cmpatino commented Apr 20, 2026

Uh oh!

Beichen-Ma commented May 8, 2026

Uh oh!

HuggingFaceDocBuilderDev commented May 11, 2026

Uh oh!

cmpatino commented May 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Beichen-Ma commented Apr 19, 2026 •

edited by cursor Bot

Loading