Wrong logalpha parameterisation for sparsifying variational dropout #10

gngdb · 2017-10-16T15:40:09Z

Here:

Lines 334 to 340 in 8d18ec2

    
           logalpha = T.log(T.nnet.sigmoid(self.logitalpha)).eval() 
        
           # remove the old parameter 
        
           del self.params[self.logitalpha] 
        
           del self.logitalpha 
        
           self.logalpha = theano.shared( 
        
                           value=logalpha, 
        
                           name='logalpha')

Should depend on w as here: https://github.com/BayesWatch/tf-variational-dropout

The text was updated successfully, but these errors were encountered:

kzhai · 2017-11-28T22:14:21Z

I think your implementation is fine. The reason for including W is due to the combination of variational dropout layer and fully connected layer. Despite the clip/mask operations, in the later computation, the W factor was canceled out. One minor difference though, is that your parameterization is the neuron-level dropout approach, where other implementations are the weight-level drop-connect approach equivalent.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong logalpha parameterisation for sparsifying variational dropout #10

Wrong logalpha parameterisation for sparsifying variational dropout #10

gngdb commented Oct 16, 2017

kzhai commented Nov 28, 2017

Wrong logalpha parameterisation for sparsifying variational dropout #10

Wrong logalpha parameterisation for sparsifying variational dropout #10

Comments

gngdb commented Oct 16, 2017

kzhai commented Nov 28, 2017