Question about LoRA alpha #5

vishaal27 · 2023-09-10T11:40:51Z

Hi, thanks for your great work. I noticed that in your scripts, you hard-coded the lora alpha to be 128 and the rank r to be 4 (therefore leading to a scaling factor of 32):

PEViT/vision_benchmark/evaluation/lora_model.py

Lines 455 to 463 in be6fb43

    
                   ''' 
        
                   LoRA setting 
        
                   ''' 
        
                   self.lora_moe_lambda = 1.0 
        
                   self.lora_moe_act = 'linear' 
        
                   self.lora_r_dropout = None 
        
                   self.lora_attn_dim = 4 
        
                   self.lora_moe = 0 
        
                   self.lora_attn_alpha=128

Was there a principled justification for these choices? I am just wondering if you did any tuning on these values to suggest what would be good values to use.

jkooy · 2023-09-17T18:52:06Z

Hi, thanks for the interests! The setting is inherited from LoRA's official development code:
https://github.com/microsoft/LoRA/tree/snapshot-9-15-2021
https://github.com/microsoft/LoRA/blob/snapshot-9-15-2021/src/model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about LoRA alpha #5

Question about LoRA alpha #5

vishaal27 commented Sep 10, 2023

jkooy commented Sep 17, 2023

Question about LoRA alpha #5

Question about LoRA alpha #5

Comments

vishaal27 commented Sep 10, 2023

jkooy commented Sep 17, 2023