evaluating has an extremely large value when quantize to 4bit. #105

JiachuanDENG · 2023-06-07T05:50:36Z

I followed the steps try to get 4bit version of llama7b by using command python -m llama.llama_quant decapoda-research/llama-7b-hf c4 --wbits 4 --groupsize 128 --save pyllama-7B4b.pt, the script works well, but at the evaluating stage, it got a very large number 251086.96875.

And when I testing with the quantized .pt file, model returns un-readable results.

Anyone has same problem?

The text was updated successfully, but these errors were encountered:

rapidAmbakar · 2023-07-19T11:28:02Z

Yes same issue, exatly same

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evaluating has an extremely large value when quantize to 4bit. #105

evaluating has an extremely large value when quantize to 4bit. #105

JiachuanDENG commented Jun 7, 2023

rapidAmbakar commented Jul 19, 2023

evaluating has an extremely large value when quantize to 4bit. #105

evaluating has an extremely large value when quantize to 4bit. #105

Comments

JiachuanDENG commented Jun 7, 2023

rapidAmbakar commented Jul 19, 2023