Improve CTranslate2 wrapping in translation_server #2001

francoishernandez · 2021-01-27T08:41:01Z

https://forum.opennmt.net/t/ctranslate2-on-opennmt-py-server/4175/8

guillaumekln · 2021-03-16T10:04:27Z

After reviewing the code, here's what could be improved:

Make the following translator parameters configurable:
- inter_threads
- intra_threads
- compute_type
Allow parallel translations as supported by CTranslate2: I tried to enable that but even though the waitress module is multi-threaded and accepts concurrent requests, it seems the requests are then processed sequentially
Revise the unloading mechanism to not assume the model is running on the GPU
Maybe cleanup the initial dummy translation: the first translation has a higher latency on GPU but this was improved in recent versions (I think it's around 200 ms now)

guillaumekln · 2021-11-18T12:11:47Z

I tried to enable that but even though the waitress module is multi-threaded and accepts concurrent requests, it seems the requests are then processed sequentially

I did not realize that the translation method is inside a critical section. Note this is not needed for CTranslate2: the translation and model loading/unloading are fully thread safe. So removing the critical section for CTranslate2 can improve the scalability of the server for CPU translations with inter_threads > 1 and multi-GPU translations.

vince62s · 2023-02-13T08:46:49Z

@francoishernandez @pltrdy do you recall why this #1108 was introduced ?
threads memoery leakages ?

pltrdy · 2023-02-21T12:05:10Z

I think that in the translation server loading/unloading and even running a model was not thread safe. I don't know anything about CTranslate 2 tho, so I can't tell how they differ

souleymanefall176 · 2023-11-13T00:09:46Z

"Good evening. I have an issue. When I run the command (ct2-opennmt-py-converter --model_path averaged-10-epoch.pt --output_dir ende_ctranslate2 --quantization int8), I get this error (ModuleNotFoundError: No module named 'onmt.inputters.text_dataset')."

francoishernandez added contributions welcome type:enhancement labels Jan 27, 2021

900groove mentioned this issue Aug 13, 2021

add ct2 server parameters #2088

Closed

guillaumekln mentioned this issue May 5, 2022

Translation on Multiple GPUs with device_index in v.2.15.0+ OpenNMT/CTranslate2#786

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve CTranslate2 wrapping in translation_server #2001

Improve CTranslate2 wrapping in translation_server #2001

francoishernandez commented Jan 27, 2021

guillaumekln commented Mar 16, 2021 •

edited

Loading

guillaumekln commented Nov 18, 2021

vince62s commented Feb 13, 2023

pltrdy commented Feb 21, 2023

souleymanefall176 commented Nov 13, 2023

Improve CTranslate2 wrapping in translation_server #2001

Improve CTranslate2 wrapping in translation_server #2001

Comments

francoishernandez commented Jan 27, 2021

guillaumekln commented Mar 16, 2021 • edited Loading

guillaumekln commented Nov 18, 2021

vince62s commented Feb 13, 2023

pltrdy commented Feb 21, 2023

souleymanefall176 commented Nov 13, 2023

guillaumekln commented Mar 16, 2021 •

edited

Loading