RF cum_concat_step simplify and other RF things #1665

albertz · 2024-12-11T19:00:22Z

(This is just here to at least run the test once before I merge it into main.)

(The other commits accidentally went into here, but it cannot hurt. Let's just not squash them together but rebase then.)

#1666

returnn/frontend/array_.py

NeoLegends

What about the other backends? It seems strange to me that only one backend can pad w/ non-scalar values. If we only implement this for torch I'd rather adjust the RF cum_conat_step impl to not rely on rf.pad but a more general operation that works across all backends, no?.

albertz · 2024-12-13T10:29:10Z

What about the other backends? It seems strange to me that only one backend can pad w/ non-scalar values. If we only implement this for torch I'd rather adjust the RF cum_conat_step impl to not rely on rf.pad but a more general operation that works across all backends, no?.

There is not really any other backend currently which fully works and is actively used, so I don't think this has such a high priority now.

NeoLegends · 2024-12-13T10:30:18Z

But is it much work to change the impl to be generic by default? If we ever add more tier 1 backends that is one less thing to worry about?

albertz · 2024-12-13T10:35:07Z

I don't exactly understand? Via this PR, rf.cum_concat_step becomes generic, so this PR here fixes this?

NeoLegends · 2024-12-13T10:36:07Z

I mean wrt. rf.pad of non-scalar values.

albertz · 2024-12-13T10:40:11Z

But I still don't understand? rf.pad never supported non-scalar values? Now it does for the PT backend. rf.pad is a backend function. There is no way to have this more generic?

Specifically for cross attention, it could happen that max(q_seq_len+k_seq_len-1) != shape.

Fix #1666

albertz added 3 commits December 11, 2024 15:12

RF set_sparse_dim

9d8c18c

RF concat, check that dims are static

ad1f02c

RF cum_concat_step simplify, pure RF implementation

54579b8

albertz requested review from NeoLegends and a team as code owners December 11, 2024 19:00

albertz added 2 commits December 11, 2024 20:26

RF concat, handle_dynamic_dims

55f6bbc

RF relative_positional_encoding, ignore dyn dims in concat

76b08ff

#1666

NeoLegends reviewed Dec 12, 2024

View reviewed changes

returnn/frontend/array_.py Show resolved Hide resolved

RF pad, support non-scalar value

ceb99e6

albertz marked this pull request as draft December 13, 2024 10:00

NeoLegends reviewed Dec 13, 2024

View reviewed changes

albertz added 5 commits December 13, 2024 14:28

RF TF-layers concat fix out_dim

32fcbbb

TF ConcatLayer, fix explicit custom out_dim

0ae04a8

RF TF-layers, fix concat fix explicit out_dim

4310803

RF relative_positional_encoding, fix internal indices spatial dim

26a136f

Specifically for cross attention, it could happen that max(q_seq_len+k_seq_len-1) != shape.

RF test_relative_positional_encoding_cross

7b2882a

albertz force-pushed the albert-rf-cumconcat-pure branch from 77135fa to 7b2882a Compare December 13, 2024 13:59

albertz added 2 commits December 13, 2024 15:23

RF test_rel_pos_self_attention, extend by batch test

d507460

Fix #1666

RF test_e_branchformer, disable small subcheck for now

a112ff4

albertz marked this pull request as ready for review December 13, 2024 14:48

albertz merged commit 64d234b into master Dec 13, 2024
64 checks passed

albertz deleted the albert-rf-cumconcat-pure branch December 13, 2024 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RF cum_concat_step simplify and other RF things #1665

RF cum_concat_step simplify and other RF things #1665

albertz commented Dec 11, 2024

NeoLegends left a comment

albertz commented Dec 13, 2024

NeoLegends commented Dec 13, 2024 •

edited

Loading

albertz commented Dec 13, 2024

NeoLegends commented Dec 13, 2024

albertz commented Dec 13, 2024 •

edited

Loading

RF cum_concat_step simplify and other RF things #1665

RF cum_concat_step simplify and other RF things #1665

Conversation

albertz commented Dec 11, 2024

NeoLegends left a comment

Choose a reason for hiding this comment

albertz commented Dec 13, 2024

NeoLegends commented Dec 13, 2024 • edited Loading

albertz commented Dec 13, 2024

NeoLegends commented Dec 13, 2024

albertz commented Dec 13, 2024 • edited Loading

NeoLegends commented Dec 13, 2024 •

edited

Loading

albertz commented Dec 13, 2024 •

edited

Loading