feat: implement apply_lora_scale to remove boilerplate. #12994

sayakpaul · 2026-01-19T04:55:05Z

What does this PR do?

Currently, we have this pattern throughout the modeling implementations:

if joint_attention_kwargs is not None:
    joint_attention_kwargs = joint_attention_kwargs.copy()
    lora_scale = joint_attention_kwargs.pop("scale", 1.0)
else:
    lora_scale = 1.0

if USE_PEFT_BACKEND:
    # weight the lora layers by setting `lora_scale` for each PEFT layer
    scale_lora_layers(self, lora_scale)
else:
    if joint_attention_kwargs is not None and joint_attention_kwargs.get("scale", None) is not None:
        logger.warning(
            "Passing `scale` via `joint_attention_kwargs` when not using the PEFT backend is ineffective."
        )


...


if USE_PEFT_BACKEND:
    # remove `lora_scale` from each PEFT layer
    unscale_lora_layers(self, lora_scale)

IMO, this is not pretty and should possibly be minimized for a clean and self-contained implementation of the forward().

Hence, this PR introduces a decorator apply_lora_scale that can be used to decorate the forward method of a model supporting LoRA. I think this will help us reduce a bunch of boilerplate code.

For keeping the PR simple, I have only applied the decorator to src/diffusers/models/transformers/transformer_flux.py The LoRA tests for it are passing, indicating this direction might be a nice one.

sayakpaul · 2026-01-19T04:55:31Z

@DN6 would love to know your thoughts!

sayakpaul · 2026-01-19T05:03:09Z

src/diffusers/models/transformers/transformer_flux.py


 from ...configuration_utils import ConfigMixin, register_to_config
 from ...loaders import FluxTransformer2DLoadersMixin, FromOriginalModelMixin, PeftAdapterMixin
-from ...utils import USE_PEFT_BACKEND, logging, scale_lora_layers, unscale_lora_layers


This yields 21 LoC deletions. We have this pattern in about 32 files. So, this amounts for a 672 deletions. Not bad, IMO.

HuggingFaceDocBuilderDev · 2026-01-19T05:03:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

DN6 · 2026-01-27T10:34:23Z

src/diffusers/models/transformers/transformer_flux.py


        self.gradient_checkpointing = False

+    @apply_lora_scale("joint_attention_kwargs")


Nice! 👍🏽 Design looks good to me.

feat: implement apply_lora_scale to remove boilerplate.

afa4a23

sayakpaul requested a review from DN6 January 19, 2026 04:55

sayakpaul commented Jan 19, 2026

View reviewed changes

Merge branch 'main' into apply-lora-scale-decorator

835a087

DN6 reviewed Jan 27, 2026

View reviewed changes

sayakpaul added 3 commits January 27, 2026 20:21

Merge branch 'main' into apply-lora-scale-decorator

3cdce4d

Merge branch 'main' into apply-lora-scale-decorator

9afafe5

apply to the rest.

d6fcd78

sayakpaul changed the title ~~[wip] feat: implement apply_lora_scale to remove boilerplate.~~ feat: implement apply_lora_scale to remove boilerplate. Jan 28, 2026

sayakpaul added 4 commits January 28, 2026 12:10

up

290f749

remove more.

458ac94

remove.

8c402d3

fix

e5ebacb

sayakpaul marked this pull request as ready for review January 28, 2026 07:11

sayakpaul requested a review from DN6 January 28, 2026 08:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement apply_lora_scale to remove boilerplate. #12994

feat: implement apply_lora_scale to remove boilerplate. #12994

sayakpaul commented Jan 19, 2026

Uh oh!

sayakpaul commented Jan 19, 2026

Uh oh!

sayakpaul Jan 19, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Jan 19, 2026

Uh oh!

DN6 Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		self.gradient_checkpointing = False

		@apply_lora_scale("joint_attention_kwargs")

feat: implement apply_lora_scale to remove boilerplate. #12994

Are you sure you want to change the base?

feat: implement apply_lora_scale to remove boilerplate. #12994

Conversation

sayakpaul commented Jan 19, 2026

What does this PR do?

Uh oh!

sayakpaul commented Jan 19, 2026

Uh oh!

sayakpaul Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jan 19, 2026

Uh oh!

DN6 Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants