question about grads of alphas in hard Attention #18

denglixi · 2016-03-04T01:00:29Z

Hello, I feel realy confused about the grads of alphas in hard attention. The source code is in line 1199:

known_grads={alphas:opt_outs['masked_cost'][:,:,None]/10.*
(alphas_sample/alphas) + alpha_entropy_c*(tensor.log(alphas) + 1)})

Can anyone explain this to me, please?

ysjakking · 2016-12-08T06:36:27Z

@denglixi I am also confused by this. Did you find the answer？

SijieSong · 2017-09-26T07:48:07Z

@denglixi @ysjakking Did you figure out the answer? I am also confused. Can anyone help me?

shaoxuan92 · 2018-01-19T13:17:05Z

Me too...

AlvinAi96 · 2019-06-16T10:25:57Z

@denglixi @ysjakking @shaoxuan92 @SijieSong Any solution did you get? I also come across this problem.

Provide feedback