Skip to content
This repository has been archived by the owner on Dec 11, 2023. It is now read-only.

The result is bad #478

Open
duguiming111 opened this issue Jun 19, 2020 · 3 comments
Open

The result is bad #478

duguiming111 opened this issue Jun 19, 2020 · 3 comments

Comments

@duguiming111
Copy link

image

I use the zh-en corpus of iwslt2015, and the Bleu value can only reach 5.3. I find that there are many such problems in the issues, which can't reach the result you give. Is there any problem in our operation?

@duguiming111
Copy link
Author

When I run the code,
python3 nmt.py --attention=scaled_luong --src=zh --tgt=en --vocab_prefix=tmp/nmt_data_zh_en/vocab --train_prefix=tmp/nmt_data_zh_en/train --dev_prefix=tmp/nmt_data_zh_en/tst2012 --test_prefix=tmp/nmt_data_zh_en/tst2013 --out_dir=tmp/nmt_attention_model --num_train_steps=12000 --steps_per_stats=100 --num_layers=2 --num_units=512 --encoder_type=bi --dropout=0.2 --decay_scheme=luong234 --metrics=bleu
The best value of blue is 11.4,are there sone problems?

@maharajbrahma
Copy link

Bleu score depends on various factors like architecture used, hyper parameters, number of sentences in data set.

@watermelona
Copy link

@duguiming111 i am the begginer of the nmt, i can not find the zh-en corpus of iwslt2015, can you tell me how
to get this dataset and vocab ?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants