BUG🐛: Fixed scale related bugs in LoKr | Added rank_dropout_scale parameter #2180

yaswanth19 · 2024-10-26T13:44:14Z

This PR adds a new parameter rank_dropout_scale and fixes scale related bugs in the LoKr. Please refer to the following function in Lycoris:
https://github.com/KohakuBlueleaf/LyCORIS/blob/258387f586beabfca71646a9671027f75ed34597/lycoris/modules/lokr.py#L347

yaswanth19 · 2024-10-26T13:50:32Z

@BenjaminBossan Please review the minor corrections to the LoKr. The default initialization of W1 matrix is zeros and I am not sure to change it or not. Also shall I rewrite this LoKr implementation to remove lycoris dependency as I can use most of the LycorisLokr implementation code.

BenjaminBossan

Thanks a lot for working on these bug fixes for LoKr, this is really appreciated.

I have some small comments, but overall this looks good.

The default initialization of W1 matrix is zeros and I am not sure to change it or not.

I'd say leave it as is. What I think could be useful is to add a new option to the config that allows users to use the lycoris way of layer initialization, then they have a choice of which one they want. That could be done in a separate PR.

Also shall I rewrite this LoKr implementation to remove lycoris dependency as I can use most of the LycorisLokr implementation code.

I'm not sure what you mean by this, could you please elaborate.

src/peft/tuners/lokr/config.py

src/peft/tuners/lokr/layer.py

src/peft/tuners/lokr/model.py

yaswanth19 · 2024-10-28T10:57:04Z

I'm not sure what you mean by this, could you please elaborate.

I meant to say, we still have lycoris_utils dependency. Also the codebase is kind of old for LoKr. Since I have updated the codebase of LoKr w.r.t latest PEFT standards in LycorisLoKr #2133, It would be simple copy paste and we don't need a seperate implementation

BenjaminBossan · 2024-10-28T11:06:50Z

Ah I see what you mean, thanks for explaining. I wasn't sure if you were referring to lycoris_utils.py.

To give a bit of context, lycoris_utils.py was added with the idea of removing some of the boiler plate involved in creating a new adapter by providing a few more abstractions. In that sense, it's not really older compared to the way that, say, LoRA is implemented. However, these new abstractions never really took off, either they did not really fit the new methods being added or the contributors just preferred to use LoRA as a starting point.

In general, I like that this PR is small and thus easy to review. If you rewrote LoKr to remove the usage of lycoris_utils.py, it would be quite a big change and I see little benefit, except of this was necessary to fix underlying bugs. To give you an idea of the size of the change, this PR moved the OFT implementation away from lycoris_utils.py.

We could still make this step, I think it would make sense if we also plan to move LoHa away from lycoris_utils.py and thus could remove that module completely. But that exercise should be left for later, we should first focus on fixing the bugs.

yaswanth19 · 2024-10-28T11:16:50Z

Hmm makes sense given LoKr/LoHa is used less frequently. Then I will make the suggested changes. The big question is what to do with LycorisLoKr #2133 😅 . Shall I close it since we are using very little of lycoris package to have the feature what we thought and rewriting the LoKr also reaps very little benefit.

yaswanth19 · 2024-10-28T14:03:17Z

@BenjaminBossan Done with suggested changes ✅

I'd say leave it as is. What I think could be useful is to add a new option to the config that allows users to use the lycoris way of layer initialization, then they have a choice of which one they want. That could be done in a separate PR.

A separate flag is not needed, they can pass init_weight=False to initialize with random weights instead of zero weights.

BenjaminBossan · 2024-10-29T10:00:11Z

A separate flag is not needed, they can pass init_weight=False to initialize with random weights instead of zero weights.

What I meant is that we found that lycoris initializes layers a bit differently than PEFT, as I mentioned on the other PR:

In PEFT, at the start, we initialize w1 to zeros and w2_a and w2_b randomly. LyCORIS, however, initializes w2_b to zeros and w1 and w2_a randomly.

We could have an init option like init_weights = {True, False, "lycoris"} and if the latter is chosen, we initialize the same way as lycoris does.

yaswanth19 · 2024-10-29T15:30:05Z

@BenjaminBossan Please review; Added lycoris style initialization and removed the alpha parameter setting.

BenjaminBossan

Thanks so much for the updates. This PR is almost good to go. I found some nits that I commented on. Apart from that, since a new initialization scheme was added, let's add a test for this.

I think the best place would be in test_initialization.py. Let's create a test class similar to what we have for LoRA. The tests can be very simple: One test for init_weight=True that checks that we get the same output as from the base model. Same for init_weight="lycoris". Finally, a test for init_weight=False that checks that the output is different from the base model's output. No need for any stats on the outputs. LMK if you have questions.

src/peft/tuners/lokr/config.py

src/peft/tuners/lokr/layer.py

yaswanth19 · 2024-10-31T04:32:11Z

@BenjaminBossan Added the testcases, please review it.

BenjaminBossan

Fantastic, thanks for the updates. Tests look good to. There are only a handful of small issues left, please take a look.

src/peft/tuners/lokr/config.py

yaswanth19 · 2024-10-31T17:03:05Z

@BenjaminBossan The doc string changes are Addressed.

HuggingFaceDocBuilderDev · 2024-11-01T11:32:04Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

Thanks for addressing my last points. This PR LGTM, thanks a lot for your amazing work on LoKr.

I'd like to leave this PR open for a little bit to see if there is more feedback from other folks. If not, I'll probably merge it at the start of next week.

BenjaminBossan · 2024-11-05T14:33:25Z

Again, thanks so much @yaswanth19 for implementing these fixes and enhancements for LoKr. Since there hasn't been any further feedback, I decided to merge now.

LMK if you plan on working on further fixes for LoKr or LoHa. Ideally, we could have them all in the same PEFT release, as users may have to retrain their models, depending on the type of fix.

yaswanth19 · 2024-11-06T19:56:16Z

@BenjaminBossan I don't have any plans to work on this immediately, probably it would be best if somebody else can pick this so to have the changes in the same release

BenjaminBossan · 2024-11-07T18:39:54Z

All right, thanks for letting me know.

BenjaminBossan requested changes Oct 28, 2024

View reviewed changes

src/peft/tuners/lokr/config.py Outdated Show resolved Hide resolved

src/peft/tuners/lokr/layer.py Outdated Show resolved Hide resolved

src/peft/tuners/lokr/model.py Outdated Show resolved Hide resolved

yaswanth19 force-pushed the fix-lokr-bugs branch 2 times, most recently from 4cf3832 to 7e3868e Compare October 29, 2024 17:41

BenjaminBossan requested changes Oct 30, 2024

View reviewed changes

src/peft/tuners/lokr/config.py Outdated Show resolved Hide resolved

src/peft/tuners/lokr/layer.py Show resolved Hide resolved

yaswanth19 added 6 commits October 30, 2024 20:57

Added rank_dropout_scale parameter

b910e39

scale related corrections

190ba54

review changes

585a211

Added lycrois weight initialization

a0c6aec

minor fixes

e93359b

lycoris initialization testcases

4926687

yaswanth19 force-pushed the fix-lokr-bugs branch from 1113989 to 4926687 Compare October 30, 2024 15:27

BenjaminBossan requested changes Oct 31, 2024

View reviewed changes

doc fixes

1b4ec01

BenjaminBossan approved these changes Nov 1, 2024

View reviewed changes

BenjaminBossan mentioned this pull request Nov 1, 2024

[Call for contributions] help us improve LoKr, LoHa, and other LyCORIS #1935

Open

6 tasks

BenjaminBossan merged commit b1fd97d into huggingface:main Nov 5, 2024
14 checks passed

BenjaminBossan mentioned this pull request Nov 20, 2024

Updated LoKr implementation with lycoris support #2133

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG🐛: Fixed scale related bugs in LoKr | Added rank_dropout_scale parameter #2180

BUG🐛: Fixed scale related bugs in LoKr | Added rank_dropout_scale parameter #2180

yaswanth19 commented Oct 26, 2024

yaswanth19 commented Oct 26, 2024

BenjaminBossan left a comment

yaswanth19 commented Oct 28, 2024 •

edited

Loading

BenjaminBossan commented Oct 28, 2024

yaswanth19 commented Oct 28, 2024 •

edited

Loading

yaswanth19 commented Oct 28, 2024

BenjaminBossan commented Oct 29, 2024

yaswanth19 commented Oct 29, 2024

BenjaminBossan left a comment

yaswanth19 commented Oct 31, 2024

BenjaminBossan left a comment

yaswanth19 commented Oct 31, 2024

HuggingFaceDocBuilderDev commented Nov 1, 2024

BenjaminBossan left a comment

BenjaminBossan commented Nov 5, 2024

yaswanth19 commented Nov 6, 2024 •

edited

Loading

BenjaminBossan commented Nov 7, 2024

BUG🐛: Fixed scale related bugs in LoKr | Added rank_dropout_scale parameter #2180

BUG🐛: Fixed scale related bugs in LoKr | Added rank_dropout_scale parameter #2180

Conversation

yaswanth19 commented Oct 26, 2024

yaswanth19 commented Oct 26, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

yaswanth19 commented Oct 28, 2024 • edited Loading

BenjaminBossan commented Oct 28, 2024

yaswanth19 commented Oct 28, 2024 • edited Loading

yaswanth19 commented Oct 28, 2024

BenjaminBossan commented Oct 29, 2024

yaswanth19 commented Oct 29, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

yaswanth19 commented Oct 31, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

yaswanth19 commented Oct 31, 2024

HuggingFaceDocBuilderDev commented Nov 1, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan commented Nov 5, 2024

yaswanth19 commented Nov 6, 2024 • edited Loading

BenjaminBossan commented Nov 7, 2024

yaswanth19 commented Oct 28, 2024 •

edited

Loading

yaswanth19 commented Oct 28, 2024 •

edited

Loading

yaswanth19 commented Nov 6, 2024 •

edited

Loading