Skip to content

Actions: vllm-project/llm-compressor

PR Reminder Comment Bot

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
101 workflow runs
101 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Check for config hidden size
PR Reminder Comment Bot #79: Pull request #840 opened by kylesayrs
October 11, 2024 20:37 2m 42s check-hidden_size
October 11, 2024 20:37 2m 42s
Only untie word embeddings
PR Reminder Comment Bot #78: Pull request #839 opened by kylesayrs
October 11, 2024 20:32 26s kylesayrs/fix-tied-tensors-patch
October 11, 2024 20:32 26s
[Bugfix] Use weight parameter of linear layer
PR Reminder Comment Bot #76: Pull request #836 opened by kylesayrs
October 10, 2024 00:52 15s kylesayrs/fix-hessian-linear-weight
October 10, 2024 00:52 15s
[Bugfix] DisableKVCache Context
PR Reminder Comment Bot #75: Pull request #834 opened by kylesayrs
October 9, 2024 19:45 10s kylesayrs/fix-use-cache
October 9, 2024 19:45 10s
typo
PR Reminder Comment Bot #74: Pull request #833 opened by horheynm
October 9, 2024 18:00 12s fix-abs-path
October 9, 2024 18:00 12s
Remove SparseAutoModelForCausalLM
PR Reminder Comment Bot #73: Pull request #832 opened by horheynm
October 9, 2024 16:46 12s remove-sparseAutoModelForCausalLM
October 9, 2024 16:46 12s
Typehint nits
PR Reminder Comment Bot #70: Pull request #826 opened by kylesayrs
October 7, 2024 17:58 12s kylesayrs/fix-typehint
October 7, 2024 17:58 12s
Install compressed-tensors after llm-compressor
PR Reminder Comment Bot #69: Pull request #825 opened by dbarbuzzi
October 7, 2024 14:14 12s dbarbuzzi:reorder-ct-install
October 7, 2024 14:14 12s
Awq re implementation
PR Reminder Comment Bot #68: Pull request #824 opened by rahul-tuli
October 7, 2024 13:54 12s awq-re-implementation
October 7, 2024 13:54 12s
Set Sparse compression to save_compressed
PR Reminder Comment Bot #67: Pull request #821 opened by rahul-tuli
October 6, 2024 21:50 11s set-sparse-compression-true
October 6, 2024 21:50 11s
Fix import of ModelCompressor
PR Reminder Comment Bot #66: Pull request #776 opened by rahul-tuli
October 4, 2024 13:53 10s fix-import
October 4, 2024 13:53 10s
[WIP] Example for 2:4 sparsity with w8a8
PR Reminder Comment Bot #65: Pull request #775 opened by mgoin
October 4, 2024 00:55 12s sparse-24-w8a8-example
October 4, 2024 00:55 12s
Update workflows/actions
PR Reminder Comment Bot #64: Pull request #774 opened by dbarbuzzi
October 3, 2024 14:46 10s dbarbuzzi:update-workflow-actions
October 3, 2024 14:46 10s
update test
PR Reminder Comment Bot #63: Pull request #773 opened by dsikka
October 3, 2024 01:47 17s update_test
October 3, 2024 01:47 17s
Fix 2/4 GPTQ Model Tests
PR Reminder Comment Bot #62: Pull request #769 opened by dsikka
October 2, 2024 20:49 10s fix_gptq_oneshot
October 2, 2024 20:49 10s
e2e tests
PR Reminder Comment Bot #59: Pull request #742 opened by horheynm
October 1, 2024 16:16 13s kv-cache-e2e
October 1, 2024 16:16 13s
Rename to quantization config
PR Reminder Comment Bot #58: Pull request #730 opened by kylesayrs
September 29, 2024 20:44 11s kylesayrs/rename_to_quantization_config
September 29, 2024 20:44 11s
Add AutoModelForCausalLM example
PR Reminder Comment Bot #56: Pull request #698 opened by dsikka
September 27, 2024 20:09 11s add_automodel_example
September 27, 2024 20:09 11s
Model Initialization Context
PR Reminder Comment Bot #55: Pull request #695 opened by kylesayrs
September 27, 2024 15:14 11s kylesayrs/fast-load-context
September 27, 2024 15:14 11s
Move wrapper definition
PR Reminder Comment Bot #54: Pull request #694 opened by kylesayrs
September 27, 2024 14:48 16s kylesayrs/move-hf_wrap
September 27, 2024 14:48 16s
Increase Sparsity Threshold for compressors
PR Reminder Comment Bot #52: Pull request #679 opened by rahul-tuli
September 26, 2024 12:50 14s update-sparsity-threshold
September 26, 2024 12:50 14s