Skip to content

1603AnsselRNNCNN

Petr Baudis edited this page Mar 27, 2016 · 5 revisions

Answer Sentence Selection RNN-CNN Parameters

RNN-CNN combo, or also "attentionless attn1511".

yodaqa-curatedv2

1rnncnn config:

{"Ddim": "2", "balance_class": "False", "batch_size": "160", "cdim": "{1: 1, 2: 0.5, 3: 0.5, 4: 0.5, 5: 0.5}", "cnnact": "tanh", "cnninit": "glorot_uniform", "dropout": "0.75", "dropoutfix_inp": "0", "dropoutfix_rec": "0", "e_add_flags": "True", "embdim": "300", "epoch_fract": "0.25", "inp_e_dropout": "0.75", "inp_w_dropout": "0", "l2reg": "0.0001", "loss": "<function ranknet at 0xc171de8>", "mlpsum": "sum", "nb_epoch": "16", "pact": "linear", "pdim": "2.5", "project": "True", "ptscorer": "<function mlp_ptscorer at 0xc177848>", "rnn": "<class 'keras.layers.recurrent.GRU'>", "rnnact": "tanh", "rnnbidi": "True", "rnnbidi_mode": "sum", "rnninit": "glorot_uniform", "rnnlevels": "1", "sdim": "2"}

1x ay_1rnncnn (rnncnn--8c20398b87d1bc8) - 0.269639

1x ay_1rnncnn sdim=1 'cdim={2:3}' 'cnnact="relu"' (rnncnn--2d51ed51556409de) - 0.271150

1x ay_1rnncnn sdim=1 'cdim={2:3}' 'cnnact="relu"' 'pact="tanh"' inp_e_dropout=0 dropout=0 (rnncnn--5cbede8c97a72de0) - 0.376694

1x ay_1rnncnn sdim=1 'cnnact="relu"' 'pact="tanh"' inp_e_dropout=0 dropout=0 (rnncnn--1e5db9922d83a6d2) - 0.402643

1x ay_1rnncnn sdim=1 'cnnact="relu"' 'pact="tanh"' inp_e_dropout=1/2 dropout=1/2 (rnncnn-dd6a06c15e3f93d) - 0.226192

1x ay_1rnncnn sdim=1 'cnnact="relu"' 'pact="tanh"' inp_e_dropout=0 dropout=1/2 (rnncnn-7de2baecc50877f3) - 0.312884

1x ay_1rnncnn sdim=1 'cnnact="relu"' 'pact="relu"' inp_e_dropout=0 dropout=0 (rnncnn--10fccb9aa05991e1) - 0.398592

1x ay_1rnncnn 'cnnact="relu"' 'pact="tanh"' inp_e_dropout=0 dropout=0 - TODO

2x ay_1rnncnn 'cnnact="relu"' 'pact="relu"' inp_e_dropout=0 dropout=0 (rnncnn-1ff6512c1ad2703b, rnncnn-39d98497a9e62273) - [0.391590, 0.343675]

1x ay_1rnncnnd0 'cnnact="tanh"' 'pact="tanh"' (rnncnn--74f6477140c434f) - 0.401744

1x ay_1rnncnnd0 'cnnact="tanh"' 'pact="tanh"' sdim=1 (rnncnn--71e255377233d787) - 0.364815

ubuntu

2rnncnn, no dropout, sdim=1 (default)

RunID: rnncnn--193e9d57614db616  ({"Ddim": "2", "batch_size": "192", "cdim": "{1: 1, 2: 0.5, 3: 0.5, 4: 0.5, 5: 0.5}", "cnnact": "relu", "cnninit": "glorot_uniform", "dropout": "0", "dropoutfix_inp": "0", "dropoutfix_rec": "0", "e_add_flags": "True", "embdim": "300", "epoch_fract": "0.25", "inp_e_dropout": "0", "inp_w_dropout": "0", "l2reg": "0.0001", "loss": "binary_crossentropy", "mlpsum": "sum", "nb_epoch": "16", "pact": "tanh", "pdim": "2", "project": "True", "ptscorer": "<function mlp_ptscorer at 0x908b7d0>", "rnn": "<class 'keras.layers.recurrent.GRU'>", "rnnact": "tanh", "rnnbidi": "True", "rnnbidi_mode": "sum", "rnninit": "glorot_uniform", "rnnlevels": "1", "sdim": "1"})

Epoch 14/32
200064/200000 [==============================] - 5767s - loss: 0.3229                                                         val mrr 0.751173

2rnncnn, no dropout, sdim=1/2

RunID: rnncnn--501b84f4f8e5947a  ({"Ddim": "2", "batch_size": "192", "cdim": "{1: 1, 2: 0.5, 3: 0.5, 4: 0.5, 5: 0.5}", "cnnact": "relu", "cnninit": "glorot_uniform", "dropout": "0", "dropoutfix_inp": "0", "dropoutfix_rec": "0", "e_add_flags": "True", "embdim": "300", "epoch_fract": "0.25", "inp_e_dropout": "0", "inp_w_dropout": "0", "l2reg": "0.0001", "loss": "binary_crossentropy", "mlpsum": "sum", "nb_epoch": "16", "pact": "tanh", "pdim": "2", "project": "True", "ptscorer": "<function mlp_ptscorer at 0x94db1b8>", "rnn": "<class 'keras.layers.recurrent.GRU'>", "rnnact": "tanh", "rnnbidi": "True", "rnnbidi_mode": "sum", "rnninit": "glorot_uniform", "rnnlevels": "1", "sdim": "0.5"})

Epoch 10/32
200064/200000 [==============================] - 3641s - loss: 0.3930                                                         val mrr 0.763290

2rnncnn, no dropout, p1dot, sdim=1

RunID: rnncnn--50e6963a84e30522  ({"Ddim": "2", "batch_size": "192", "cdim": "{1: 1, 2: 0.5, 3: 0.5, 4: 0.5, 5: 0.5}", "cnnact": "relu", "cnninit": "glorot_uniform", "dropout": "0", "dropoutfix_inp": "0", "dropoutfix_rec": "0", "e_add_flags": "True", "embdim": "300", "epoch_fract": "0.25", "inp_e_dropout": "0", "inp_w_dropout": "0", "l2reg": "0.0001", "loss": "binary_crossentropy", "mlpsum": "sum", "nb_epoch": "16", "pact": "tanh", "pdim": "1", "project": "True", "ptscorer": "<function dot_ptscorer at 0x737a320>", "rnn": "<class 'keras.layers.recurrent.GRU'>", "rnnact": "tanh", "rnnbidi": "True", "rnnbidi_mode": "sum", "rnninit": "glorot_uniform", "rnnlevels": "1", "sdim": "1"})

Epoch 8/32
200064/200000 [==============================] - 5694s - loss: 0.3646                                                         val mrr 0.783245

2rnncnn, no dropout, p1dot, sdim=1/2, val MRR 0.786321

Clone this wiki locally