Skip to content

Commit cb651c1

Browse files
authored
Merge pull request #315 from ntumlgroup/dropout
Homogenizing Dropout
2 parents c793509 + f38e591 commit cb651c1

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

46 files changed

+195
-112
lines changed

docs/examples/plot_KimCNN_quickstart.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@
4545
# We consider the following settings for the KimCNN model.
4646

4747
model_name = "KimCNN"
48-
network_config = {"embed_dropout": 0.2, "encoder_dropout": 0.2, "filter_sizes": [2, 4, 8], "num_filter_per_size": 128}
48+
network_config = {"embed_dropout": 0.2, "post_encoder_dropout": 0.2, "filter_sizes": [2, 4, 8], "num_filter_per_size": 128}
4949
learning_rate = 0.0003
5050
model = init_model(
5151
model_name=model_name,

docs/examples/plot_bert_quickstart.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,7 @@
5454

5555
model_name = "BERT"
5656
network_config = {
57-
"dropout": 0.1,
57+
"encoder_hidden_dropout": 0.1,
5858
"lm_weight": "bert-base-uncased",
5959
}
6060
learning_rate = 0.00003

example_config/EUR-Lex-57k/bert.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ val_metric: RP@5
2727
model_name: BERT
2828
init_weight: null
2929
network_config:
30-
dropout: 0.1
30+
encoder_hidden_dropout: 0.1
3131
lm_weight: bert-base-uncased
3232
lm_window: 512
3333

example_config/EUR-Lex-57k/bert_lwan.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ val_metric: RP@5
2626
model_name: BERTAttention
2727
init_weight: null
2828
network_config:
29-
dropout: 0.1
29+
post_encoder_dropout: 0.1
3030
lm_weight: bert-base-uncased
3131
lm_window: 512
3232
attention_type: singlehead

example_config/EUR-Lex-57k/bert_lwan_tune.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ model_name: BERTAttention
3131
loss_function: binary_cross_entropy_with_logits
3232
init_weight: null
3333
network_config:
34-
dropout: ['grid_search', [0, 0.1, 0.2, 0.4]]
34+
post_encoder_dropout: ['grid_search', [0, 0.1, 0.2, 0.4]]
3535
lm_weight: bert-base-uncased
3636
lm_window: 512
3737
attention_type: singlehead

example_config/EUR-Lex-57k/bigru_lwan.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ model_name: BiGRULWAN
2929
init_weight: kaiming_uniform
3030
network_config:
3131
embed_dropout: 0.4
32-
encoder_dropout: 0.2
32+
post_encoder_dropout: 0.2
3333
rnn_dim: 256
3434
rnn_layers: 1
3535

example_config/EUR-Lex-57k/bigru_lwan_tune.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ loss_function: binary_cross_entropy_with_logits
3434
init_weight: kaiming_uniform
3535
network_config:
3636
embed_dropout: ['grid_search', [0, 0.2, 0.4, 0.6, 0.8]]
37-
encoder_dropout: ['grid_search', [0, 0.2, 0.4]]
37+
post_encoder_dropout: ['grid_search', [0, 0.2, 0.4]]
3838
rnn_dim: ['grid_search', [256, 512, 1024]]
3939
rnn_layers: 1
4040

example_config/EUR-Lex-57k/cnn_lwan.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ model_name: CNNLWAN
2929
init_weight: kaiming_uniform
3030
network_config:
3131
embed_dropout: 0.2
32-
encoder_dropout: 0.4
32+
post_encoder_dropout: 0.4
3333
filter_sizes: [8]
3434
num_filter_per_size: 256
3535

example_config/EUR-Lex-57k/cnn_lwan_tune.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ init_weight: kaiming_uniform
3535
network_config:
3636
activation: tanh
3737
embed_dropout: ['grid_search', [0, 0.2, 0.4, 0.6, 0.8]]
38-
encoder_dropout: ['grid_search', [0, 0.2, 0.4]]
38+
post_encoder_dropout: ['grid_search', [0, 0.2, 0.4]]
3939
filter_sizes: [8]
4040
num_filter_per_size: ['grid_search', [32, 64, 128, 256, 512, 1024]]
4141

example_config/EUR-Lex-57k/kim_cnn.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ model_name: KimCNN
2929
init_weight: kaiming_uniform
3030
network_config:
3131
embed_dropout: 0
32-
encoder_dropout: 0.4
32+
post_encoder_dropout: 0.4
3333
filter_sizes: [8]
3434
num_filter_per_size: 1024
3535

0 commit comments

Comments
 (0)