[Bugfix] DisableKVCache Context #834

kylesayrs · 2024-10-09T19:45:31Z

Purpose

Fixes AttributeError: 'MllamaConfig' object has no attribute 'use_cache' #688
Better catch models with unconventional config structure

Changes

Use a no kv cache context which checks for use_cache and text_config.use_cache in cases like MllamaConfig

Testing

Previously, attempting to forward pass meta-llama/Llama-3.2-11B-Vision-Instruct would lead to attribute error. Now runs normally with model.config.text_config.use_cache == False
Regression tested cache disabling with meta-llama/Meta-Llama-3-8B-Instruct

github-actions · 2024-10-09T19:45:42Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

dsikka

test?

src/llmcompressor/utils/helpers.py

mgoin · 2024-10-24T15:25:47Z

src/llmcompressor/utils/helpers.py

+        # unknown config structure
+        else:
+            raise NotImplementedError(
+                f"Cannot find `use_cache` for config of type {type(model.config)}"
+            )


Could you try this on other models that doesn't have a use_cache in their config? Such as Llava https://huggingface.co/llava-hf/llava-1.5-7b-hf

kylesayrs · 2024-10-25T20:24:12Z

There are likely many ways to achieve this, including using the use_cache argument during the forward pass. However, this is known to at least work on the models we've tested

kylesayrs · 2024-10-25T20:24:31Z

Pending testing with llava models

kylesayrs added 2 commits October 9, 2024 19:21

no cache context

fe97fb0

support mllamaconfig

2eebcdd

kylesayrs changed the title ~~Use Cache~~ DisableKVCache Context Oct 9, 2024

fix typo

16d08b2

kylesayrs self-assigned this Oct 9, 2024

kylesayrs marked this pull request as ready for review October 10, 2024 18:09

dsikka requested changes Oct 10, 2024

View reviewed changes

kylesayrs changed the title ~~DisableKVCache Context~~ [Bugfix] DisableKVCache Context Oct 17, 2024

kylesayrs added 2 commits October 17, 2024 12:36

Merge remote-tracking branch 'origin' into kylesayrs/fix-use-cache

f2fe9b8

apply style

142b25c

kylesayrs requested review from dsikka, mgoin, rahul-tuli and horheynm October 17, 2024 17:46

rahul-tuli previously approved these changes Oct 17, 2024

View reviewed changes

src/llmcompressor/utils/helpers.py Outdated Show resolved Hide resolved

add docstring

7c1097f

kylesayrs dismissed rahul-tuli’s stale review via 7c1097f October 17, 2024 21:30

kylesayrs added 2 commits October 17, 2024 21:32

make docstring runnable

fda1d84

Merge branch 'main' into kylesayrs/fix-use-cache

34e2150

kylesayrs requested a review from rahul-tuli October 18, 2024 22:21

Merge branch 'main' into kylesayrs/fix-use-cache

42c192f

kylesayrs mentioned this pull request Oct 23, 2024

[GPTQ] Vision Model Support #850

Draft

mgoin reviewed Oct 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] DisableKVCache Context #834

[Bugfix] DisableKVCache Context #834

kylesayrs commented Oct 9, 2024 •

edited

Loading

github-actions bot commented Oct 9, 2024

dsikka left a comment

mgoin Oct 24, 2024

kylesayrs commented Oct 25, 2024

kylesayrs commented Oct 25, 2024

[Bugfix] DisableKVCache Context #834

Are you sure you want to change the base?

[Bugfix] DisableKVCache Context #834

Conversation

kylesayrs commented Oct 9, 2024 • edited Loading

Purpose

Changes

Testing

github-actions bot commented Oct 9, 2024

dsikka left a comment

Choose a reason for hiding this comment

mgoin Oct 24, 2024

Choose a reason for hiding this comment

kylesayrs commented Oct 25, 2024

kylesayrs commented Oct 25, 2024

kylesayrs commented Oct 9, 2024 •

edited

Loading