Llama device #548

ShawnXuan · 2024-09-03T10:23:42Z

This PR introduces the ability to specify the target device (e.g., CUDA, NPU, XPU) for Llama inference. The changes allow users to select the desired device through configuration, improving flexibility across different hardware platforms.

npu

python projects/Llama/pipeline.py --device=npu --mode=huggingface --config_file=projects/Llama/configs/llama_config_npu.py

xpu

python projects/Llama/pipeline.py --device=xpu --mode=huggingface --config_file=projects/Llama/configs/llama_config_xpu.py

cuda

Please update the projects/Llama/configs/llama_config.py file to configure the model path and tokenizer path.

python projects/Llama/pipeline.py

…a_device

xiezipeng-ML

no problem

* feat: support third-party device oneflow extentions also, refactor the build process of model and tokenizer using pretrained_model_path cofnig * refactor: remove unnecessary config and warnings * docs: update readme for commands to run llama on npu and xpu

ShawnXuan added 6 commits September 3, 2024 08:30

update llama for multi devices

9b77d17

xpu and npu config files

87b2c41

update device for inference

f87b713

update

9ba5a65

update

336c481

update README

24e9c1a

ShawnXuan requested review from fpzh2011, 0x404, Flowingsun007 and xiezipeng-ML September 3, 2024 10:23

ShawnXuan added 4 commits September 3, 2024 18:24

Merge branch 'main' into llama_device

6e802a5

update

032664a

Merge branch 'llama_device' of github.com:Oneflow-Inc/libai into llam…

07de0de

…a_device

format

6f921cb

ShawnXuan requested a review from oneflow-ci-bot September 4, 2024 06:59

format

a238a4b

ShawnXuan requested review from oneflow-ci-bot and removed request for oneflow-ci-bot September 4, 2024 07:09

fix

d4bd6db

ShawnXuan requested review from oneflow-ci-bot and removed request for oneflow-ci-bot September 4, 2024 07:19

xiezipeng-ML approved these changes Sep 4, 2024

View reviewed changes

ShawnXuan requested review from oneflow-ci-bot and removed request for oneflow-ci-bot September 4, 2024 08:41

fix import order

a030a1b

ShawnXuan requested review from oneflow-ci-bot and removed request for oneflow-ci-bot September 4, 2024 08:46

update

942556f

ShawnXuan removed the request for review from oneflow-ci-bot September 4, 2024 08:52

ShawnXuan requested a review from oneflow-ci-bot September 4, 2024 08:52

update

8cfd032

ShawnXuan requested review from xiezipeng-ML and oneflow-ci-bot and removed request for oneflow-ci-bot September 4, 2024 08:55

fix: skip lint on oneflow third-party imports

9aa8b06

0x404 requested review from oneflow-ci-bot and removed request for oneflow-ci-bot September 4, 2024 10:05

0x404 approved these changes Sep 4, 2024

View reviewed changes

fpzh2011 approved these changes Sep 4, 2024

View reviewed changes

ShawnXuan merged commit 1efccd8 into main Sep 5, 2024
2 checks passed

ShawnXuan deleted the llama_device branch September 5, 2024 02:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama device #548

Llama device #548

ShawnXuan commented Sep 3, 2024 •

edited

Loading

xiezipeng-ML left a comment

Llama device #548

Llama device #548

Conversation

ShawnXuan commented Sep 3, 2024 • edited Loading

xiezipeng-ML left a comment

Choose a reason for hiding this comment

ShawnXuan commented Sep 3, 2024 •

edited

Loading