[Bug Report] SAE training tutorial metrics do not match linked run #276

naterush · 2024-09-03T18:28:29Z

Describe the bug

Hey. Working through the training tutorial, and without any changes, I'm unable to train a basic SAE with loss numbers that are as good as linked. Not sure if this is numerical instability, or something's changed, or if my differences are actually not consequential -- so I'm opening this issue to get tot he bottom of it!

My steps:

Opened the notebook in Google Colab (see notebook here)
Selected an A100 GPU (for fast execution)
Executed the notebook all the way through

This is the run I end up with: https://wandb.ai/naterush-personal/sae_lens_tutorial/reports/Untitled-Report--Vmlldzo5MjQwMTcx/edit?firstReport&runsetFilter
Note that this is the sample run you link to: https://wandb.ai/jbloom/sae_lens_tutorial/runs/x44akxec?nw=nwuserjbloom

Differences between my training run and yours

My overall loss is 360, yours is 133
My L0 is 160ish, yours is 80ish

There are a lot more differences - but wondering if you have thoughts on why this is. I'm new to SAE work generally, so any helpful tips here would be appreciated.

Code example

See notebook here

System Info

Google Colab Pro+
A100 GPU, took about 1 hour to train

Checklist

I have checked that there is no similar issue in the repo (required)

niniack · 2024-10-02T12:34:07Z

+1, I've been toying around with the library to get results from the wandb tutorial run, as well as these runs
https://wandb.ai/jbloom/mats_sae_training_gpt2_small_resid_pre_5?nw=nwuserjbloom
but have not had success with either.

I have replicated the hyperparameters that were set in the gpt2 runs (linked above) to no avail. I suspect that later versions of the library introduced some changes which needs different hyperparameters? I don't have a good theory.

Side note: @naterush your wandb run is private, other users cannot see the results!

Numeri · 2024-10-03T17:33:56Z

I've also run it several times and not managed to get anything with good loss curves – it plateaus very quickly around MSE loss of 200 and L1 loss of 165.

jbloomAus · 2024-10-03T18:55:36Z

Odd, I'll take a look.

…

On Thu, Oct 3, 2024, 10:34 AM Kaden Uhlig ***@***.***> wrote: I've also run it several times and not managed to get anything with good loss curves – it plateaus very quickly around MSE loss of 200 and L1 loss of 165. — Reply to this email directly, view it on GitHub <#276 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AQPMYZ2B6U3EQCZUI77QPOTZZV5ZXAVCNFSM6AAAAABNSWINMGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGOJRHE3DANJQGU> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

naterush changed the title ~~[Bug Report] Basic training tutorial metrics do not match linked run~~ [Bug Report] SAE training tutorial metrics do not match linked run Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug Report] SAE training tutorial metrics do not match linked run #276

[Bug Report] SAE training tutorial metrics do not match linked run #276

naterush commented Sep 3, 2024

niniack commented Oct 2, 2024 •

edited

Loading

Numeri commented Oct 3, 2024

jbloomAus commented Oct 3, 2024 via email

[Bug Report] SAE training tutorial metrics do not match linked run #276

[Bug Report] SAE training tutorial metrics do not match linked run #276

Comments

naterush commented Sep 3, 2024

Differences between my training run and yours

Checklist

niniack commented Oct 2, 2024 • edited Loading

Numeri commented Oct 3, 2024

jbloomAus commented Oct 3, 2024 via email

niniack commented Oct 2, 2024 •

edited

Loading