Improve PEFT integration #4723

qgallouedec · 2025-12-19T04:14:37Z

This PR makes four changes:

Fixes the type hints and documentation for the model argument in SFTTrainer, GRPOTrainer, RLOOTrainer, and RewardTrainer to allow PeftModel.
Uses is_peft_model instead of isinstance(model, PeftModel). The two checks are equivalent, but is_peft_model is more explicit, so this PR adopts it consistently across the codebase.
Introduces use_adapter, a context manager that temporarily selects an adapter (this is useful for point 4).
For methods that use a ref_model (GRPO, RLOO), when the provided model is a PeftModel, the adapter is cloned under the name "ref". Previously, the reference model was obtained by just disabling the adapter, which is only correct when the adapter is newly initialized—not when it has already been pretrained. An example of this issue can be found in Allow swapping PEFT adapters for target/ref model. #1193.

Note that we don't apply these changes for DPO as there is currently a refactoring in #3906

HuggingFaceDocBuilderDev · 2025-12-19T04:39:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

albertvillanova

Thanks for the improvement.

CI is red:

AttributeError: 'list' object has no attribute 'keys'

and docstyle

edbeeching · 2026-01-05T08:32:20Z

trl/trainer/grpo_trainer.py

    def __init__(
        self,
-        model: str | PreTrainedModel,
+        model: "str | PreTrainedModel | PeftModel",


Suggested change

model: "str | PreTrainedModel | PeftModel",

model: str | PreTrainedModel | "PeftModel",

edbeeching · 2026-01-05T08:35:57Z

trl/trainer/rloo_trainer.py

    def __init__(
        self,
-        model: str | PreTrainedModel,
+        model: "str | PreTrainedModel | PeftModel",


Suggested change

model: "str | PreTrainedModel | PeftModel",

model: str | PreTrainedModel | "PeftModel",

edbeeching · 2026-01-05T08:36:25Z

trl/trainer/sft_trainer.py

    def __init__(
        self,
-        model: str | PreTrainedModel,
+        model: "str | PreTrainedModel | PeftModel",


Suggested change

model: "str | PreTrainedModel | PeftModel",

model: str | PreTrainedModel | "PeftModel",

edbeeching

I believe the type annotations could clearer with a small change, but otherwise LGTM

qgallouedec · 2026-01-06T16:46:55Z

I believe the type annotations could clearer with a small change, but otherwise LGTM

Agree, but it seems not correct:

from peft import PeftModel

def func(model: str | "PeftModel"): ...

Traceback (most recent call last):
  File "/fsx/qgallouedec/trl/dem.py", line 3, in <module>
    def func(model: str | "PeftModel"): ...
                    ~~~~^~~~~~~~~~~~~
TypeError: unsupported operand type(s) for |: 'type' and 'str'

qgallouedec and others added 6 commits December 17, 2025 22:43

Disallow PeftModel + peft_config in trainers

e421a04

remove tests

1d603ff

remove old comments

f4019fa

Merge branch 'main' into disallow-peft-model-with-config

6a71878

Better peft integration

7406c79

type hint

e7de51b

tiny models

f16ba77

qgallouedec requested review from albertvillanova, edbeeching, kashif and lewtun December 19, 2025 05:12

Remove force option from push_to_hub calls in generate_tiny_models.py

6d80078

qgallouedec changed the title ~~[WIP] Improve PEFT integration~~ Improve PEFT integration Dec 19, 2025

albertvillanova requested changes Dec 19, 2025

View reviewed changes

Base automatically changed from disallow-peft-model-with-config to main December 19, 2025 15:09

qgallouedec and others added 3 commits December 21, 2025 14:56

Merge branch 'main' into better-peft-integration

f38410d

Merge branch 'main' into better-peft-integration

d1c3542

Don't push tokenizer with adapter

4714d0e

and docstyle

qgallouedec requested a review from albertvillanova December 22, 2025 22:14

edbeeching reviewed Jan 5, 2026

View reviewed changes

edbeeching approved these changes Jan 5, 2026

View reviewed changes

Merge branch 'main' into better-peft-integration

6cbab27

qgallouedec enabled auto-merge (squash) January 6, 2026 18:02

Merge branch 'main' into better-peft-integration

f648da9

qgallouedec disabled auto-merge January 6, 2026 18:03

qgallouedec merged commit 42cce51 into main Jan 6, 2026
11 checks passed

qgallouedec deleted the better-peft-integration branch January 6, 2026 18:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve PEFT integration #4723

Improve PEFT integration #4723

Uh oh!

qgallouedec commented Dec 19, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Dec 19, 2025

Uh oh!

albertvillanova left a comment •

edited

Loading

Uh oh!

edbeeching Jan 5, 2026 •

edited

Loading

Uh oh!

edbeeching Jan 5, 2026

Uh oh!

edbeeching Jan 5, 2026

Uh oh!

edbeeching left a comment

Uh oh!

qgallouedec commented Jan 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	model: "str \| PreTrainedModel \| PeftModel",
	model: str \| PreTrainedModel \| "PeftModel",

Improve PEFT integration #4723

Improve PEFT integration #4723

Uh oh!

Conversation

qgallouedec commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Dec 19, 2025

Uh oh!

albertvillanova left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

edbeeching Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

edbeeching Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

edbeeching Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

edbeeching left a comment

Choose a reason for hiding this comment

Uh oh!

qgallouedec commented Jan 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

qgallouedec commented Dec 19, 2025 •

edited

Loading

albertvillanova left a comment •

edited

Loading

edbeeching Jan 5, 2026 •

edited

Loading