tensorflow-nmt

A Tensorflow implementation of Neural Machine Translation, based mostly on https://github.com/JayParks/tf-seq2seq. Support Tensorflow >= 1.2.1 and GPU.

Features:

Bidirectional LSTM
Learning rate decay

Example of Chinese-to-English translation

download news parallel corpus from WMT2018, e.g.

bash data/download.sh

data preprocess, including tokenization (for Chinese sentences, it would be better to conduct sengmentation first, e.g., using jieba), lowercasing, byte-pair-encoding. You may use

bash data/preprocess.sh

model training

python3 -m train \
--model_dir=model-zh-en/ \
--embedding_size=512 \
--hidden_units=512 \
--batch_size=128 \
--start_decay_step=100000 \
--decay_steps=30000 \
--display_freq=80 \
--save_freq=10000 \
--source_vocabulary=data/zh-en/news-commentary-v13.zh-en.final.zh.json \
--target_vocabulary=data/zh-en/news-commentary-v13.zh-en.final.en.json \
--source_train_data=data/zh-en/news-commentary-v13.zh-en.final.zh \
--target_train_data=data/zh-en/news-commentary-v13.zh-en.final.en

or just

bash train.sh

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.idea		.idea
__pycache__		__pycache__
data		data
.DS_Store		.DS_Store
README.md		README.md
data_iterator.py		data_iterator.py
data_utils.py		data_utils.py
decode.py		decode.py
initial		initial
seq2seq_model.py		seq2seq_model.py
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tensorflow-nmt

Example of Chinese-to-English translation

About

Releases

Packages

Contributors 2

Languages

lkluo/tensorflow-nmt

Folders and files

Latest commit

History

Repository files navigation

tensorflow-nmt

Example of Chinese-to-English translation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages