We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi, very helpful repo, learned a lot from it.
I got a question about an implementation detail in TransformerXL.
In the transformer_xl_from_scratch notebook, the memory length during validation is calculated as val_memory_length + train_bptt - val_bptt.
transformer_xl_from_scratch
val_memory_length + train_bptt - val_bptt
Why aren't it just set to val_memory_length?
val_memory_length
Looking forward to reply.
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Hi, very helpful repo, learned a lot from it.
I got a question about an implementation detail in TransformerXL.
In the
transformer_xl_from_scratch
notebook, the memory length during validation is calculated asval_memory_length + train_bptt - val_bptt
.Why aren't it just set to
val_memory_length
?Looking forward to reply.
The text was updated successfully, but these errors were encountered: