[Bug] AWQ量化InternVL2 20B输出无意义的杂乱文本 #2650

diandianliu · 2024-10-24T10:05:16Z

Checklist

1. I have searched related issues but cannot get the expected help.
2. The bug has not been fixed in the latest version.
3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.

Describe the bug

训练后的模型，使用AWQ量化InternVL2-20B输出无意义的杂乱文本

Reproduction

量化（因为开发环境没有互联网，pth_text_only通过脚本下下载的）
下载脚本
from datasets import load_dataset
traindata = load_dataset('ptb_text_only', 'penn_treebank', split='train')

lmdeploy lite auto_awq
/root/models/internvl2-26B
--calib-dataset 'ptb'
--calib-samples 128
--calib-seqlen 2048
--w-bits 4
--w-group-size 128
--work-dir /root/models/internvl2-26B_awq_4bit

运行
lmdeploy serve api_server
/root/models/internvl2-26B_awq_4bit
--server-name 0.0.0.0
--server-port 23333
--tp2

Environment

python3.9.19
NVIDIA V100
pytorch 2.2.2+cul21
TorchVision 0.17.2+cul21
LMDeploy 0.5.1+unknow
transformer 4.43.3

--内网无法复制

Error traceback

量化时出现告警，不知道有没有影响：
Using the latest cached version of the module from /root/mydataset/modules/datasets_modules/datasets/ptb_text_only/8d1b97746fb9765d140e569ec5ddd35e20af4d37761f5e1bf357ea0b081f2c1f (last modified on Sat Feb 10 16:50:50 2024) since it couldn't be found locally at ptb_text_only
Token indices sequence length is longer than the specified maximum sequence length for this model (1085165> 4096). Running this sequence through the model will result in indexing errors

sjzhou4 · 2024-10-24T10:29:30Z

理解的vl模型的量化和lm模型的AWQ量化有差异吧，AWQ量化需要使用数据集进行处理，但是vl的输入是images feature + query等信息，包括embedding，直接使用query进行量化，效果应该会很差吧

diandianliu · 2024-10-24T14:27:16Z

理解的vl模型的量化和lm模型的AWQ量化有差异吧，AWQ量化需要使用数据集进行处理，但是vl的输入是images feature + query等信息，包括embedding，直接使用query进行量化，效果应该会很差吧

回答的压根不是句子，我看教程只用lmdeploy lite auto_awq 量化的，不知道哪一步错了

diandianliu · 2024-10-24T14:27:42Z

理解的vl模型的量化和lm模型的AWQ量化有差异吧，AWQ量化需要使用数据集进行处理，但是vl的输入是images feature + query等信息，包括embedding，直接使用query进行量化，效果应该会很差吧

回答的压根不是句子，我看教程只用lmdeploy lite auto_awq 量化的，不知道哪一步错了

AllentDan · 2024-10-25T01:46:42Z

没看懂，issue 里面说是 internvl 模型，但是命令给的都是 internlm。可以排查下变量：

模型经过了再次训练
数据集是下载的

diandianliu · 2024-10-25T02:55:48Z

没看懂，issue 里面说是 internvl 模型，但是命令给的都是 internlm。可以排查下变量：

模型经过了再次训练

数据集是下载的

抱歉，路径写错了，已更正，正准备用原始模型试试；步骤是正确的吗（只需执行lmdeploy lite auto_awq，无需其他操作）

diandianliu · 2024-10-25T03:07:21Z

没看懂，issue 里面说是 internvl 模型，但是命令给的都是 internlm。可以排查下变量：

模型经过了再次训练

数据集是下载的

量化时出现这句有没有影响
Token indices sequence length is longer than the specified maximum sequence length for this model (1085165> 4096). Running this sequence through the model will result in indexing errors

AllentDan · 2024-10-25T03:07:50Z

是不是你训的模型 tokenizer 有点问题？1085165> 4096 差距有点大了，其他模型也有，但是量化也没啥问题

lvhan028 assigned AllentDan Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] AWQ量化InternVL2 20B输出无意义的杂乱文本 #2650

[Bug] AWQ量化InternVL2 20B输出无意义的杂乱文本 #2650

diandianliu commented Oct 24, 2024 •

edited

Loading

sjzhou4 commented Oct 24, 2024

diandianliu commented Oct 24, 2024

diandianliu commented Oct 24, 2024

AllentDan commented Oct 25, 2024

diandianliu commented Oct 25, 2024

diandianliu commented Oct 25, 2024

AllentDan commented Oct 25, 2024

[Bug] AWQ量化InternVL2 20B输出无意义的杂乱文本 #2650

[Bug] AWQ量化InternVL2 20B输出无意义的杂乱文本 #2650

Comments

diandianliu commented Oct 24, 2024 • edited Loading

Checklist

Describe the bug

Reproduction

Environment

Error traceback

sjzhou4 commented Oct 24, 2024

diandianliu commented Oct 24, 2024

diandianliu commented Oct 24, 2024

AllentDan commented Oct 25, 2024

diandianliu commented Oct 25, 2024

diandianliu commented Oct 25, 2024

AllentDan commented Oct 25, 2024

diandianliu commented Oct 24, 2024 •

edited

Loading