Skip to content

larspars/delta_orthogonal_init

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Delta Orthogonal Initialization for Lua Torch

An implementation of the weight initialization proposed in the paper "Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks" by Xiao et al.

To initialize the weights in a single layer, call makeDeltaOrthogonal():

local conv = nn.SpatialConvolution(in, out, 3, 3)
makeDeltaOrthogonal(conv.weight)

To initialize all layers in a network, call initAll()

local model = nn.Sequential(...)
initAll(model)

License: MIT

About

Delta orthogonal initialization for Lua Torch

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages