-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
habana_main rebase v6 #182
Commits on Jul 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f058403 - Browse repository at this point
Copy the full SHA f058403View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5cf9254 - Browse repository at this point
Copy the full SHA 5cf9254View commit details -
Configuration menu - View commit details
-
Copy full SHA for cbbc904 - Browse repository at this point
Copy the full SHA cbbc904View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5895b24 - Browse repository at this point
Copy the full SHA 5895b24View commit details -
Configuration menu - View commit details
-
Copy full SHA for 052b6f8 - Browse repository at this point
Copy the full SHA 052b6f8View commit details -
Configuration menu - View commit details
-
Copy full SHA for d7a299e - Browse repository at this point
Copy the full SHA d7a299eView commit details -
[core][misc] improve free_finished_seq_groups (vllm-project#6865)
Co-authored-by: Woosuk Kwon <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6ca8031 - Browse repository at this point
Copy the full SHA 6ca8031View commit details -
Configuration menu - View commit details
-
Copy full SHA for 40c27a7 - Browse repository at this point
Copy the full SHA 40c27a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 79319ce - Browse repository at this point
Copy the full SHA 79319ceView commit details -
Configuration menu - View commit details
-
Copy full SHA for fb4f530 - Browse repository at this point
Copy the full SHA fb4f530View commit details
Commits on Jul 31, 2024
-
[Speculative decoding] Add serving benchmark for llama3 70b + specula…
…tive decoding (vllm-project#6964)
Configuration menu - View commit details
-
Copy full SHA for c32ab8b - Browse repository at this point
Copy the full SHA c32ab8bView commit details -
Configuration menu - View commit details
-
Copy full SHA for da1f7cc - Browse repository at this point
Copy the full SHA da1f7ccView commit details -
Configuration menu - View commit details
-
Copy full SHA for f230cc2 - Browse repository at this point
Copy the full SHA f230cc2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9f0e69b - Browse repository at this point
Copy the full SHA 9f0e69bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 533d193 - Browse repository at this point
Copy the full SHA 533d193View commit details -
Configuration menu - View commit details
-
Copy full SHA for c0644cf - Browse repository at this point
Copy the full SHA c0644cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6512937 - Browse repository at this point
Copy the full SHA 6512937View commit details -
[Bugfix] Clean up MiniCPM-V (vllm-project#6939)
Co-authored-by: hezhihui <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2f4e108 - Browse repository at this point
Copy the full SHA 2f4e108View commit details -
Configuration menu - View commit details
-
Copy full SHA for daed30c - Browse repository at this point
Copy the full SHA daed30cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2ee8d3b - Browse repository at this point
Copy the full SHA 2ee8d3bView commit details -
[MISC] Introduce pipeline parallelism partition strategies (vllm-proj…
…ect#6920) Co-authored-by: youkaichao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for bd70013 - Browse repository at this point
Copy the full SHA bd70013View commit details -
Configuration menu - View commit details
-
Copy full SHA for 460c188 - Browse repository at this point
Copy the full SHA 460c188View commit details -
[Kernel] Enable FP8 Cutlass for Ada Lovelace (vllm-project#6950)
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 93548eb - Browse repository at this point
Copy the full SHA 93548ebView commit details -
[Kernel] Tuned int8 Cutlass Kernels for SM75 (T4) (vllm-project#6996)
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 35e9c12 - Browse repository at this point
Copy the full SHA 35e9c12View commit details -
Configuration menu - View commit details
-
Copy full SHA for a0dce93 - Browse repository at this point
Copy the full SHA a0dce93View commit details -
Revert "[Frontend] Factor out code for running uvicorn" (vllm-project…
…#7012) Co-authored-by: Robert Shaw <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7eb0cb4 - Browse repository at this point
Copy the full SHA 7eb0cb4View commit details
Commits on Aug 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 7ecee34 - Browse repository at this point
Copy the full SHA 7ecee34View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1d2e7fb - Browse repository at this point
Copy the full SHA 1d2e7fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 23993a7 - Browse repository at this point
Copy the full SHA 23993a7View commit details -
[Bugfix][Model] Skip loading lm_head weights if using tie_word_embedd…
…ings (vllm-project#6758) Signed-off-by: Travis Johnson <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 630dd9e - Browse repository at this point
Copy the full SHA 630dd9eView commit details -
PP comm optimization: replace send with partial send + allgather (vll…
…m-project#6695) Co-authored-by: Aurick Qiao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0437492 - Browse repository at this point
Copy the full SHA 0437492View commit details -
[Bugfix] Set SamplingParams.max_tokens for OpenAI requests if not pro…
…vided by user (vllm-project#6954)
Configuration menu - View commit details
-
Copy full SHA for 3c10591 - Browse repository at this point
Copy the full SHA 3c10591View commit details -
Configuration menu - View commit details
-
Copy full SHA for c8a7e93 - Browse repository at this point
Copy the full SHA c8a7e93View commit details -
Configuration menu - View commit details
-
Copy full SHA for a72a424 - Browse repository at this point
Copy the full SHA a72a424View commit details -
[CI/Build] Update PyTorch to 2.4.0 (vllm-project#6951)
Co-authored-by: Michael Goin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7e0861b - Browse repository at this point
Copy the full SHA 7e0861bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2dd3437 - Browse repository at this point
Copy the full SHA 2dd3437View commit details -
Configuration menu - View commit details
-
Copy full SHA for fb3db61 - Browse repository at this point
Copy the full SHA fb3db61View commit details -
Configuration menu - View commit details
-
Copy full SHA for f4fd390 - Browse repository at this point
Copy the full SHA f4fd390View commit details -
[Models] Support Qwen model with PP (vllm-project#6974)
Signed-off-by: Muralidhar Andoorveedu <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fc912e0 - Browse repository at this point
Copy the full SHA fc912e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 562e580 - Browse repository at this point
Copy the full SHA 562e580View commit details -
Configuration menu - View commit details
-
Copy full SHA for 805a8a7 - Browse repository at this point
Copy the full SHA 805a8a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a11fdf - Browse repository at this point
Copy the full SHA 6a11fdfView commit details
Commits on Aug 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6ce01f3 - Browse repository at this point
Copy the full SHA 6ce01f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 954f730 - Browse repository at this point
Copy the full SHA 954f730View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3bb4b1e - Browse repository at this point
Copy the full SHA 3bb4b1eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2523577 - Browse repository at this point
Copy the full SHA 2523577View commit details -
Configuration menu - View commit details
-
Copy full SHA for cf2a1a4 - Browse repository at this point
Copy the full SHA cf2a1a4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 660dea1 - Browse repository at this point
Copy the full SHA 660dea1View commit details -
Configuration menu - View commit details
-
Copy full SHA for db35186 - Browse repository at this point
Copy the full SHA db35186View commit details -
Configuration menu - View commit details
-
Copy full SHA for c16eaac - Browse repository at this point
Copy the full SHA c16eaacView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8069495 - Browse repository at this point
Copy the full SHA 8069495View commit details -
Configuration menu - View commit details
-
Copy full SHA for b482b9a - Browse repository at this point
Copy the full SHA b482b9aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a8d604c - Browse repository at this point
Copy the full SHA a8d604cView commit details -
[Core] Pipeline parallel with Ray ADAG (vllm-project#6837)
Support pipeline-parallelism with Ray accelerated DAG. Signed-off-by: Rui Qiao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0530889 - Browse repository at this point
Copy the full SHA 0530889View commit details -
[Misc] Revive to use loopback address for driver IP (vllm-project#7091)
Signed-off-by: Rui Qiao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 22e718f - Browse repository at this point
Copy the full SHA 22e718fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7089893 - Browse repository at this point
Copy the full SHA 7089893View commit details
Commits on Aug 3, 2024
-
[ Frontend ] Multiprocessing for OpenAI Server with
zeromq
(vllm-pr……oject#6883) Signed-off-by: Joe Runde <[email protected]> Co-authored-by: Joe Runde <[email protected]> Co-authored-by: Joe Runde <[email protected]> Co-authored-by: Nick Hill <[email protected]> Co-authored-by: Simon Mo <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ed812a7 - Browse repository at this point
Copy the full SHA ed812a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 69ea15e - Browse repository at this point
Copy the full SHA 69ea15eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8c025fa - Browse repository at this point
Copy the full SHA 8c025faView commit details -
[ci][distributed] merge distributed test commands (vllm-project#7097)
Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 04e5583 - Browse repository at this point
Copy the full SHA 04e5583View commit details -
Configuration menu - View commit details
-
Copy full SHA for a0d1645 - Browse repository at this point
Copy the full SHA a0d1645View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0c25435 - Browse repository at this point
Copy the full SHA 0c25435View commit details -
Configuration menu - View commit details
-
Copy full SHA for fb2c1c8 - Browse repository at this point
Copy the full SHA fb2c1c8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 99d7cab - Browse repository at this point
Copy the full SHA 99d7cabView commit details -
Configuration menu - View commit details
-
Copy full SHA for 67d745c - Browse repository at this point
Copy the full SHA 67d745cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 44dcb52 - Browse repository at this point
Copy the full SHA 44dcb52View commit details -
[Frontend] Warn if user
max_model_len
is greater than derived `max_……model_len` (vllm-project#7080) Signed-off-by: Jefferson Fialho <[email protected]> Co-authored-by: Nick Hill <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 825b044 - Browse repository at this point
Copy the full SHA 825b044View commit details
Commits on Aug 4, 2024
-
Support for guided decoding for offline LLM (vllm-project#6878)
Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 654bc5c - Browse repository at this point
Copy the full SHA 654bc5cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9fadc7b - Browse repository at this point
Copy the full SHA 9fadc7bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 83c644f - Browse repository at this point
Copy the full SHA 83c644fView commit details -
[Model]Refactor MiniCPMV (vllm-project#7020)
Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 179a6a3 - Browse repository at this point
Copy the full SHA 179a6a3View commit details -
[Bugfix] [SpecDecode] Default speculative_draft_tensor_parallel_size …
…to 1 when using MLPSpeculator (vllm-project#7105) Signed-off-by: Thomas Parnell <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b1c9aa3 - Browse repository at this point
Copy the full SHA b1c9aa3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 16a1cc9 - Browse repository at this point
Copy the full SHA 16a1cc9View commit details -
Configuration menu - View commit details
-
Copy full SHA for f80ab35 - Browse repository at this point
Copy the full SHA f80ab35View commit details
Commits on Aug 5, 2024
-
[Model] Add multi-image support for minicpmv (vllm-project#7122)
Co-authored-by: hezhihui <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7b86e7c - Browse repository at this point
Copy the full SHA 7b86e7cView commit details -
Configuration menu - View commit details
-
Copy full SHA for cc08fc7 - Browse repository at this point
Copy the full SHA cc08fc7View commit details -
[Model] SiglipVisionModel ported from transformers (vllm-project#6942)
Co-authored-by: Roger Wang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c0d8f16 - Browse repository at this point
Copy the full SHA c0d8f16View commit details -
[Speculative decoding] Add periodic log with time spent in proposal/s…
…coring/verification (vllm-project#6963)
Configuration menu - View commit details
-
Copy full SHA for 82a1b1a - Browse repository at this point
Copy the full SHA 82a1b1aView commit details -
Configuration menu - View commit details
-
Copy full SHA for e963045 - Browse repository at this point
Copy the full SHA e963045View commit details -
Configuration menu - View commit details
-
Copy full SHA for 003f8ee - Browse repository at this point
Copy the full SHA 003f8eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 57f560a - Browse repository at this point
Copy the full SHA 57f560aView commit details -
[Misc] Fix typo in GroupCoordinator.recv() (vllm-project#7167)
Signed-off-by: Rui Qiao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 997cf78 - Browse repository at this point
Copy the full SHA 997cf78View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8571ac4 - Browse repository at this point
Copy the full SHA 8571ac4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e4852c - Browse repository at this point
Copy the full SHA 6e4852cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4cf1dc3 - Browse repository at this point
Copy the full SHA 4cf1dc3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4db5176 - Browse repository at this point
Copy the full SHA 4db5176View commit details -
Configuration menu - View commit details
-
Copy full SHA for dfb1a15 - Browse repository at this point
Copy the full SHA dfb1a15View commit details -
[Doc] [SpecDecode] Update MLPSpeculator documentation (vllm-project#7100
) Signed-off-by: Thomas Parnell <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 789937a - Browse repository at this point
Copy the full SHA 789937aView commit details -
[Bugfix] Specify device when loading LoRA and embedding tensors (vllm…
…-project#7129) Co-authored-by: Jacob Schein <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 89b8db6 - Browse repository at this point
Copy the full SHA 89b8db6View commit details -
Configuration menu - View commit details
-
Copy full SHA for ef527be - Browse repository at this point
Copy the full SHA ef527beView commit details -
[Core] Support loading GGUF model (vllm-project#5191)
Co-authored-by: Michael Goin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 360bd67 - Browse repository at this point
Copy the full SHA 360bd67View commit details
Commits on Aug 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e3c664b - Browse repository at this point
Copy the full SHA e3c664bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9118217 - Browse repository at this point
Copy the full SHA 9118217View commit details -
[Model] Support SigLIP encoder and alternative decoders for LLaVA mod…
…els (vllm-project#7153) Co-authored-by: Roger Wang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1f26efb - Browse repository at this point
Copy the full SHA 1f26efbView commit details -
Configuration menu - View commit details
-
Copy full SHA for a3bbbfa - Browse repository at this point
Copy the full SHA a3bbbfaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 541c185 - Browse repository at this point
Copy the full SHA 541c185View commit details -
[Bugfix] add gguf dependency (vllm-project#7198)
Co-authored-by: katarzyna.papis <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 00afc78 - Browse repository at this point
Copy the full SHA 00afc78View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c60c8c - Browse repository at this point
Copy the full SHA 5c60c8cView commit details -
[Kernel] Add per-tensor and per-token AZP epilogues (vllm-project#5941)
Co-authored-by: Tyler Michael Smith <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8d59dbb - Browse repository at this point
Copy the full SHA 8d59dbbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 660470e - Browse repository at this point
Copy the full SHA 660470eView commit details -
[Core] Subclass ModelRunner to support cross-attention & encoder sequ…
…ences (towards eventual encoder/decoder model support) (vllm-project#4942) Co-authored-by: Andrew Feldman <[email protected]> Co-authored-by: Nick Hill <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fd95e02 - Browse repository at this point
Copy the full SHA fd95e02View commit details
Commits on Aug 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f9a5600 - Browse repository at this point
Copy the full SHA f9a5600View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9a3f49a - Browse repository at this point
Copy the full SHA 9a3f49aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2385c8f - Browse repository at this point
Copy the full SHA 2385c8fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7b26109 - Browse repository at this point
Copy the full SHA 7b26109View commit details -
[Frontend] Gracefully handle missing chat template and fix CI failure (…
…vllm-project#7238) Co-authored-by: Roger Wang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 66d617e - Browse repository at this point
Copy the full SHA 66d617eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 639159b - Browse repository at this point
Copy the full SHA 639159bView commit details -
[Misc] Refactor linear layer weight loading; introduce `BasevLLMParam…
…eter` and `weight_loader_v2` (vllm-project#5874)
Configuration menu - View commit details
-
Copy full SHA for 0f7052b - Browse repository at this point
Copy the full SHA 0f7052bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5649857 - Browse repository at this point
Copy the full SHA 5649857View commit details -
Fixes typo in function name (vllm-project#7275)
Signed-off-by: Rafael Vasquez <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ab0f5e2 - Browse repository at this point
Copy the full SHA ab0f5e2View commit details -
[Bugfix] Fix input processor for InternVL2 model (vllm-project#7164)
Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b764547 - Browse repository at this point
Copy the full SHA b764547View commit details -
Configuration menu - View commit details
-
Copy full SHA for 80cbe10 - Browse repository at this point
Copy the full SHA 80cbe10View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0e12cd6 - Browse repository at this point
Copy the full SHA 0e12cd6View commit details -
[BugFix] Fix frontend multiprocessing hang (vllm-project#7217)
Signed-off-by: Max de Bayser <[email protected]> Co-authored-by: Robert Shaw <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fde47d3 - Browse repository at this point
Copy the full SHA fde47d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5223199 - Browse repository at this point
Copy the full SHA 5223199View commit details -
[ci] Make building wheels per commit optional (vllm-project#7278)
Signed-off-by: kevin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 469b3bc - Browse repository at this point
Copy the full SHA 469b3bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 311f743 - Browse repository at this point
Copy the full SHA 311f743View commit details -
Configuration menu - View commit details
-
Copy full SHA for fc1493a - Browse repository at this point
Copy the full SHA fc1493aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d94420 - Browse repository at this point
Copy the full SHA 6d94420View commit details -
Configuration menu - View commit details
-
Copy full SHA for e53dfd3 - Browse repository at this point
Copy the full SHA e53dfd3View commit details
Commits on Aug 8, 2024
-
[Misc] Fix typos in scheduler.py (vllm-project#7285)
Signed-off-by: Rui Qiao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7467096 - Browse repository at this point
Copy the full SHA 7467096View commit details -
Configuration menu - View commit details
-
Copy full SHA for 48abee9 - Browse repository at this point
Copy the full SHA 48abee9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6dffa4b - Browse repository at this point
Copy the full SHA 6dffa4bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 757ac70 - Browse repository at this point
Copy the full SHA 757ac70View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5fb4a3f - Browse repository at this point
Copy the full SHA 5fb4a3fView commit details -
[Frontend] Kill the server on engine death (vllm-project#6594)
Signed-off-by: Joe Runde <[email protected]> Signed-off-by: Joe Runde <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 21b9c49 - Browse repository at this point
Copy the full SHA 21b9c49View commit details -
Configuration menu - View commit details
-
Copy full SHA for 782e53a - Browse repository at this point
Copy the full SHA 782e53aView commit details -
Configuration menu - View commit details
-
Copy full SHA for e14fb22 - Browse repository at this point
Copy the full SHA e14fb22View commit details -
Configuration menu - View commit details
-
Copy full SHA for e904576 - Browse repository at this point
Copy the full SHA e904576View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8334c39 - Browse repository at this point
Copy the full SHA 8334c39View commit details -
Configuration menu - View commit details
-
Copy full SHA for a049b10 - Browse repository at this point
Copy the full SHA a049b10View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5923532 - Browse repository at this point
Copy the full SHA 5923532View commit details
Commits on Aug 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0fa1490 - Browse repository at this point
Copy the full SHA 0fa1490View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7eb4a51 - Browse repository at this point
Copy the full SHA 7eb4a51View commit details -
Configuration menu - View commit details
-
Copy full SHA for 73388c0 - Browse repository at this point
Copy the full SHA 73388c0View commit details -
Configuration menu - View commit details
-
Copy full SHA for e02ac55 - Browse repository at this point
Copy the full SHA e02ac55View commit details -
[Bugfix] Fix speculative decoding with MLPSpeculator with padded voca…
…bulary (vllm-project#7218) Signed-off-by: Travis Johnson <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 99b4cf5 - Browse repository at this point
Copy the full SHA 99b4cf5View commit details -
[Speculative decoding] [Multi-Step] decouple should_modify_greedy_pro…
…bs_inplace (vllm-project#6971)
Configuration menu - View commit details
-
Copy full SHA for 57b7be0 - Browse repository at this point
Copy the full SHA 57b7be0View commit details -
Configuration menu - View commit details
-
Copy full SHA for b4e9528 - Browse repository at this point
Copy the full SHA b4e9528View commit details -
[Model][Jamba] Mamba cache single buffer (vllm-project#6739)
Co-authored-by: Mor Zusman <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 07ab160 - Browse repository at this point
Copy the full SHA 07ab160View commit details -
Configuration menu - View commit details
-
Copy full SHA for 67abdbb - Browse repository at this point
Copy the full SHA 67abdbbView commit details -
Configuration menu - View commit details
-
Copy full SHA for fc7b8d1 - Browse repository at this point
Copy the full SHA fc7b8d1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 74af2bb - Browse repository at this point
Copy the full SHA 74af2bbView commit details -
[Frontend] Support embeddings in the run_batch API (vllm-project#7132)
Co-authored-by: Simon Mo <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 249b882 - Browse repository at this point
Copy the full SHA 249b882View commit details -
Configuration menu - View commit details
-
Copy full SHA for 70d268a - Browse repository at this point
Copy the full SHA 70d268aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 933790c - Browse repository at this point
Copy the full SHA 933790cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c6c54d - Browse repository at this point
Copy the full SHA 5c6c54dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 999ef0b - Browse repository at this point
Copy the full SHA 999ef0bView commit details -
Configuration menu - View commit details
-
Copy full SHA for baa2402 - Browse repository at this point
Copy the full SHA baa2402View commit details
Commits on Aug 10, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 4c5d8e8 - Browse repository at this point
Copy the full SHA 4c5d8e8View commit details
Commits on Aug 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 90bab18 - Browse repository at this point
Copy the full SHA 90bab18View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4fb7b52 - Browse repository at this point
Copy the full SHA 4fb7b52View commit details -
Configuration menu - View commit details
-
Copy full SHA for c08e2b3 - Browse repository at this point
Copy the full SHA c08e2b3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3860879 - Browse repository at this point
Copy the full SHA 3860879View commit details -
Configuration menu - View commit details
-
Copy full SHA for 02b1988 - Browse repository at this point
Copy the full SHA 02b1988View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6c8e595 - Browse repository at this point
Copy the full SHA 6c8e595View commit details
Commits on Aug 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f020a62 - Browse repository at this point
Copy the full SHA f020a62View commit details -
Configuration menu - View commit details
-
Copy full SHA for 86ab567 - Browse repository at this point
Copy the full SHA 86ab567View commit details -
Configuration menu - View commit details
-
Copy full SHA for ec2affa - Browse repository at this point
Copy the full SHA ec2affaView commit details -
Configuration menu - View commit details
-
Copy full SHA for e6e42e4 - Browse repository at this point
Copy the full SHA e6e42e4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 24154f8 - Browse repository at this point
Copy the full SHA 24154f8View commit details -
Configuration menu - View commit details
-
Copy full SHA for d2bc451 - Browse repository at this point
Copy the full SHA d2bc451View commit details -
Configuration menu - View commit details
-
Copy full SHA for cfba4de - Browse repository at this point
Copy the full SHA cfba4deView commit details -
[ci] Entrypoints run upon changes in vllm/ (vllm-project#7423)
Signed-off-by: kevin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 65950e8 - Browse repository at this point
Copy the full SHA 65950e8View commit details -
[ci] Cancel fastcheck run when PR is marked ready (vllm-project#7427)
Signed-off-by: kevin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9b3e2ed - Browse repository at this point
Copy the full SHA 9b3e2edView commit details -
[ci] Cancel fastcheck when PR is ready (vllm-project#7433)
Signed-off-by: kevin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1137f34 - Browse repository at this point
Copy the full SHA 1137f34View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6aa33cb - Browse repository at this point
Copy the full SHA 6aa33cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ddc474 - Browse repository at this point
Copy the full SHA 4ddc474View commit details -
[Core/Bugfix] Add FP8 K/V Scale and dtype conversion for prefix/prefi…
…ll Triton Kernel (vllm-project#7208) Co-authored-by: Cody Yu <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a046f86 - Browse repository at this point
Copy the full SHA a046f86View commit details -
Configuration menu - View commit details
-
Copy full SHA for 91294d5 - Browse repository at this point
Copy the full SHA 91294d5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 774cd1d - Browse repository at this point
Copy the full SHA 774cd1dView commit details
Commits on Aug 13, 2024
-
[Core] Shut down aDAG workers with clean async llm engine exit (vllm-…
…project#7224) Signed-off-by: Rui Qiao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 198d6a2 - Browse repository at this point
Copy the full SHA 198d6a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9ba85bc - Browse repository at this point
Copy the full SHA 9ba85bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 97a6be9 - Browse repository at this point
Copy the full SHA 97a6be9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5469146 - Browse repository at this point
Copy the full SHA 5469146View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7025b11 - Browse repository at this point
Copy the full SHA 7025b11View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d2dc50 - Browse repository at this point
Copy the full SHA 4d2dc50View commit details -
Configuration menu - View commit details
-
Copy full SHA for d6e634f - Browse repository at this point
Copy the full SHA d6e634fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e20233d - Browse repository at this point
Copy the full SHA e20233dView commit details -
Configuration menu - View commit details
-
Copy full SHA for f328349 - Browse repository at this point
Copy the full SHA f328349View commit details -
Configuration menu - View commit details
-
Copy full SHA for 212e87e - Browse repository at this point
Copy the full SHA 212e87eView commit details