Skip to content

Actions: tenstorrent/vllm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
14 workflow runs
14 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update vLLM commit in tt-metal readme
PR Reminder Comment Bot #9: Pull request #34 opened by skhorasganiTT
November 6, 2024 14:45 12s
November 6, 2024 14:45 12s
Update TTModelRunner due to decode rope changes for llama70b
PR Reminder Comment Bot #8: Pull request #33 opened by skhorasganiTT
November 6, 2024 14:42 11s
November 6, 2024 14:42 11s
[Bugfix] #31 _make_sampler_output return expected SequenceOutput output_token: int
PR Reminder Comment Bot #7: Pull request #32 opened by tstescoTT
October 31, 2024 03:55 14s
October 31, 2024 03:55 14s
Import tt-metal model via pythonpath instead of symlink
PR Reminder Comment Bot #6: Pull request #30 opened by skhorasganiTT
October 29, 2024 17:21 11s
October 29, 2024 17:21 11s
Update vLLM commit in tt_metal README.md
PR Reminder Comment Bot #5: Pull request #28 opened by skhorasganiTT
October 28, 2024 20:28 13s
October 28, 2024 20:28 13s
[Hardware][Tenstorrent] Modify offline_inference_tt.py to include max_tokens arg
PR Reminder Comment Bot #2: Pull request #25 opened by milank94
October 21, 2024 11:17 10s
October 21, 2024 11:17 10s
Update vLLM commit in README.md
PR Reminder Comment Bot #1: Pull request #24 opened by skhorasganiTT
October 17, 2024 22:44 10s
October 17, 2024 22:44 10s
[Bugfix] Print warnings related to mistral_common tokenizer only on…
Lint GitHub Actions workflows #1: Commit d615b5c pushed by skhorasganiTT
October 17, 2024 22:36 16s main
October 17, 2024 22:36 16s
October 17, 2024 22:36 26s
October 17, 2024 22:36 39s
October 17, 2024 22:36 2m 27s
[Bugfix] Print warnings related to mistral_common tokenizer only on…
clang-format #1: Commit d615b5c pushed by skhorasganiTT
October 17, 2024 22:36 17s main
October 17, 2024 22:36 17s