Skip to content

maciejbiesek/smt-phrase-extraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Phrase extraction

Project was created during Statistical Machine Translation at Computer Science, Adam Mickiewicz University. It parses GIZA++ output format then run grow-diag-final-and algorithm and finally extract possible phrases. It was tested on portuguese-polish languages.

To run:

python main.py [fe_file] [ef_file]

Both files has to be in GIZA++ format.

For example:

python main.py data/pt-pl data/pl-pt

About

Statistical Machine Translation Course final project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages