Align KTO with DPO: Align processing_class initialization by albertvillanova · Pull Request #5578 · huggingface/trl

albertvillanova · 2026-04-17T07:22:42Z

Align KTO with DPO: Align processing_class initialization.

This PR updates the KTOTrainer class to streamline and clarify how processing class is handled for data processing. The main changes include narrowing the accepted types for processing_class, updating its initialization logic, and ensuring consistent usage throughout the code. Documentation and type annotations have also been improved for clarity.

Part of:

KTO refactoring #4786

Changes

Processing class handling improvements:

The accepted types for processing_class are now limited to PreTrainedTokenizerBase or ProcessorMixin, removing support for BaseImageProcessor and FeatureExtractionMixin.
If processing_class is not provided, it is now automatically loaded using AutoProcessor.from_pretrained, and appropriate error handling is added if the class is not of the expected type.
The code now consistently uses the tokenizer derived from the processing_class for padding and tokenization, and updates all relevant references accordingly.

Documentation and error handling:

The docstring for KTOTrainer and its __init__ method are updated to reflect the new requirements and initialization behavior for processing_class.
Removed redundant error checks for processing_class being None since it is now always set during initialization.
Minor cleanup, such as removing the unused assignment of self.processing_class.

Note

Medium Risk
Moderate risk because it changes KTOTrainer initialization defaults (auto-loading a processor and auto-setting pad_token), which can alter tokenization/padding behavior or break callers relying on previously accepted processor types.

Overview
Aligns KTOTrainer’s processing_class handling with DPOTrainer: it now only accepts a PreTrainedTokenizerBase or ProcessorMixin, auto-loads one via AutoProcessor when omitted, and normalizes usage through a derived tokenizer (including defaulting pad_token to eos_token).

Removes the prior requirement/error path for explicitly passing processing_class, updates the default collator and dataset tokenization steps to use the derived tokenizer, and updates docstrings/type hints accordingly.

^{Reviewed by Cursor Bugbot for commit 1585842. Bugbot is set up for automated code reviews on this repo. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit e303285. Configure here.}

HuggingFaceDocBuilderDev · 2026-04-17T07:25:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2026-04-17T13:19:37Z

+        if tokenizer.pad_token is None:
+            tokenizer.pad_token = tokenizer.eos_token


Just a question that came while reviewing the pr. Do we actually need this? Because technically we tokenize per sample, and we don't delegate the padding to the tokenizer (padding is done in the collator)

We instantiate the collator with pad_token_id=tokenizer.pad_token_id

yes, but we could instantiate it with tokenizer.pad_token or tokenizer.eos_token

qgallouedec · 2026-04-17T13:23:59Z

The code now consistently uses the tokenizer derived from the processing_class for padding and tokenization, and updates all relevant references accordingly.

Why not the processing class?

albertvillanova · 2026-04-17T13:32:31Z

Why not the processing class?

Because _tokenize and _process_tokens use atokenizer instance.

Align processing_class init and docstring

e303285

cursor Bot reviewed Apr 17, 2026

View reviewed changes

Comment thread trl/experimental/kto/kto_trainer.py Outdated

Comment thread trl/experimental/kto/kto_trainer.py

albertvillanova added 2 commits April 17, 2026 10:46

Make pad_token fallback to eos_token

cef38ef

Pass tokenizer to _tokenize and _process_tokens

1585842

qgallouedec approved these changes Apr 17, 2026

View reviewed changes

albertvillanova merged commit 8ff0069 into huggingface:main Apr 17, 2026
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align KTO with DPO: Align processing_class initialization#5578

Align KTO with DPO: Align processing_class initialization#5578
albertvillanova merged 3 commits into
huggingface:mainfrom
albertvillanova:align-kto-dpo-processing_class

albertvillanova commented Apr 17, 2026 •

edited by cursor Bot

Loading

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 17, 2026

Uh oh!

qgallouedec Apr 17, 2026

Uh oh!

albertvillanova Apr 17, 2026

Uh oh!

qgallouedec Apr 17, 2026

Uh oh!

qgallouedec commented Apr 17, 2026

Uh oh!

albertvillanova commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if tokenizer.pad_token is None:
		tokenizer.pad_token = tokenizer.eos_token

Conversation

albertvillanova commented Apr 17, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 17, 2026

Uh oh!

qgallouedec Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

albertvillanova Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

qgallouedec Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

qgallouedec commented Apr 17, 2026

Uh oh!

albertvillanova commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

albertvillanova commented Apr 17, 2026 •

edited by cursor Bot

Loading