- In
tensorflow/examples/speech_commands/input_data.py
, line 487, use self.mfcc_ or self.spectrogram - In
tensorflow/examples/speech_commands/models.py
, line 50, comment/uncomment things with mfcc / spectrogram title
- In
tensorflow/examples/speech_commands/models.py
, line 242, change first_filter_count
- In
tensorflow/examples/speech_commands/train.py
, line 381, change default
python tensorflow\python\tools\inspect_checkpoint.py --file_name=D:\tmp\speech_commands_train\conv.ckpt-18000
(windows)python tensorflow/python/tools/inspect_checkpoint.py --file_name=/tmp/speech_commands_train/conv.ckpt-18000
(linux)
Results in:
Variable (DT_FLOAT) [20,8,1,64]
Variable_1 (DT_FLOAT) [64]
Variable_2 (DT_FLOAT) [10,4,64,64]
Variable_3 (DT_FLOAT) [64]
Variable_4 (DT_FLOAT) [62720,12]
Variable_5 (DT_FLOAT) [12]
global_step (DT_INT64) []
python vis_conv_layer1.py
- (optional) choose which file to visualize by setting the input_wav
python wav_to_spectrogram
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc = 89%
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc = 88.3%
- training time: 70 min
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc =
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc = 86.6%
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc = 87.2%
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc = 85.8%
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc = 85.9%
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc = 81.3%
- training time: 67 min
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc = 77.2%
- 15000 steps (0.001 learning_rate), 3000 steps (0.0001 learning_rate) = total 18000s
- acc = 65.8%
- training time: 120 min
- 30000 steps (0.001 learning_rate), 6000 steps (0.0001 learning_rate) = total 36000s
- acc = 76.4%
- training time: 240 min
- 30000 steps (0.001 learning_rate), 6000 steps (0.0001 learning_rate) = total 36000s
- acc = 76.0%
- training time: 180 min
- 30000 steps (0.001 learning_rate), 6000 steps (0.0001 learning_rate) = total 36000s
- acc = 76.1%
- training time: 150 min
- 30000 steps (0.001 learning_rate), 6000 steps (0.0001 learning_rate) = total 36000s
- acc = 75.8%
- 30000 steps (0.001 learning_rate), 6000 steps (0.0001 learning_rate) = total 36000s
- acc = 71.7%
- training time: 130 min
- 30000 steps (0.001 learning_rate), 6000 steps (0.0001 learning_rate) = total 36000s
- acc = 72.3%
- 30000 steps (0.001 learning_rate), 6000 steps (0.0001 learning_rate) = total 36000s
- acc = 67.3%
- training time: 130 min
- 30000 steps (0.001 learning_rate), 6000 steps (0.0001 learning_rate) = total 36000s
- acc = 65.3%
- training time: 125 min