The implementation is similar to Keras implementation of Neural Network. Layers include Batch Norm, Convolutional 2D, Maxpool 2D, etc.
Notes:
- Does not take advantage of GPU processing
- Not optimized for running on large datasets
- LSTM and GRU implementations are future work
- train.py and validate.py were run on MNIST dataset