how about replacing einsum with normal multiplication #14

tntjd7545 · 2020-01-15T05:24:15Z

in attention.py, class AttentionConv

replacing out = torch.einsum('bnchwk,bnchwk -> bnchw', out, v_out)

with out = (out*v_out).sum(dim=5)

made running time more than 2x faster while training on IMAGENET (2 min vs 53s per 100 step, batchsize 25) which is still 3.5x slower than training normal ResNet on IMAGENET

(Not sure whether this model works for IMAGENET or not)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how about replacing einsum with normal multiplication #14

how about replacing einsum with normal multiplication #14

tntjd7545 commented Jan 15, 2020 •

edited

Loading

how about replacing einsum with normal multiplication #14

how about replacing einsum with normal multiplication #14

Comments

tntjd7545 commented Jan 15, 2020 • edited Loading

tntjd7545 commented Jan 15, 2020 •

edited

Loading