Add LoRA for Zipformer #1540

marcoyang1998 · 2024-03-11T03:06:12Z

This PR adds LoRA adaptation for Zipformer. For more details, please refer to the original LoRA paper (https://arxiv.org/abs/2106.09685).

To do:

Add LoRA support for QKV in self-attention
Add LoRA for feedforward module
Support exporting as a normal Zipformer
Benchmark against adapter

…sion

marcoyang1998 · 2024-03-14T10:39:01Z

Add LoRA to the following layers:

QKV projection in self-attention
in_proj in Feedforward module
out_proj in Feedforward module

Experiment setup:

Use Zipformer pre-trained on LibriSpeech as initialization, fine-tune with LoRA on GigaSpeech subset small for 20 epochs

Experiment results:

Exp	LoRA layers	`r`	trainable params	WER on Giga
baseline, no finetune	-	-	-	20.06/19.27
v1	QK	8	89,600	18.06/17.99
v2	QKV	4	98,560	18.06/17.99
v2	QKV	8	197,120	18.06/17.99
v3	QKV + FFW in_proj	4	364,288	15.99/16.17
v3	QKV + FFW in_proj	8	728,576	15.63/15.74
v4	QKV + full FFW	4	630,016	15.57/15.61
v4	QKV + full FFW	8	1,260,032	15.27/15.33

observations

Increasing r improves the performance
Beneficial to add more LoRA layers than increase r

Comparison with adapter

Exp	num trainable	WER on Giga
adapter	1.49M	15.05/15.18
LoRA	1.26M	15.27/15.33

marcoyang1998 added 16 commits March 5, 2024 18:30

initial commit

86a51e0

add more files

3c92bdb

add files again

3b3256a

add finetune.py

87b19c0

first version of lora, could just run

a560347

add script for decoding

9bc1ad8

also add lora in SelfAttention (for the value proj)

5272a71

add lora to the in_proj of the feedforward module

bb8f6b0

add lora version of ActivationDropouAndLinear; currently a simple ver…

3492d94

…sion

bug fix

6806810

register the weight and bias

c0924f0

support export and merge weight of a LoRA zipformer

af04e7d

add missing file

c7180b9

update README

cf92d40

resolve conflict

390f016

update readme

7ead73f

fix style

77bfecd

marcoyang1998 merged commit 2dfd5db into k2-fsa:master Mar 15, 2024
142 of 143 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LoRA for Zipformer #1540

Add LoRA for Zipformer #1540

marcoyang1998 commented Mar 11, 2024 •

edited

Loading

marcoyang1998 commented Mar 14, 2024

Add LoRA for Zipformer #1540

Add LoRA for Zipformer #1540

Conversation

marcoyang1998 commented Mar 11, 2024 • edited Loading

marcoyang1998 commented Mar 14, 2024

Experiment setup:

Experiment results:

observations

Comparison with adapter

marcoyang1998 commented Mar 11, 2024 •

edited

Loading