translation with language models #495

VictorChen2012 · 2019-09-20T04:12:02Z

I'm wondering if it is possible to combine an LM with a seq2seq model under OpenNMT-tf, e.g. shallow fusion, deep fusion or cold fusion.

Currently, vars and ops of LM decoder and seq2seq decoder are in different name scope. It's too complicated to directly load and merge two pretrained models, i.e. LM and the seq2seq model under the same name scope.

Any suggestions to the goal above?

guillaumekln · 2019-09-20T07:26:46Z

Shallow fusion should be the most accessible but it may not be easy to integrate at this time. However, there are some incoming changes that should facilitate such combinations.

I'm interested in supporting shallow fusion in the near future.

VictorChen2012 · 2019-09-22T02:53:39Z

Shallow fusion should be the most accessible but it may not be easy to integrate at this time. However, there are some incoming changes that should facilitate such combinations.

I'm interested in supporting shallow fusion in the near future.

Thanks for your quick reply! I'll see if I can contribute then.

lkluo · 2020-11-05T08:49:59Z

Any update?

guillaumekln · 2020-11-12T07:43:32Z

No one is currently working on this as far as I know.

guillaumekln added the enhancement label Sep 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

translation with language models #495

translation with language models #495

VictorChen2012 commented Sep 20, 2019

guillaumekln commented Sep 20, 2019

VictorChen2012 commented Sep 22, 2019

lkluo commented Nov 5, 2020

guillaumekln commented Nov 12, 2020

translation with language models #495

translation with language models #495

Comments

VictorChen2012 commented Sep 20, 2019

guillaumekln commented Sep 20, 2019

VictorChen2012 commented Sep 22, 2019

lkluo commented Nov 5, 2020

guillaumekln commented Nov 12, 2020