You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
环境依赖安装的没问题,操作系统是windows server2022,显卡NVIDIA A40,模型可以加载,使用chatglm3-6b模型和chatglm3-6b-128k模型都会提示警告:“1torch was not compiled with flash attention.”,怀疑是系统问题,安装了wsl,用ubuntu20.04系统报错消失。chatglm3-6b模型可以正常使用,可是chatglm3-6b-128k模型不管在哪个系统都会提示AssertionError:tensor([[[模型加载进去了,可以交互一次,但是之后就报这个断言错误。实在不知道问题怎么解决,请求大家能否帮助我。
(中间是wsl下Ubuntu系统运行结果,其余为winserver 2022结果,模型都是chatglm3-6b-128k)
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
环境依赖安装的没问题,操作系统是windows server2022,显卡NVIDIA A40,模型可以加载,使用chatglm3-6b模型和chatglm3-6b-128k模型都会提示警告:“1torch was not compiled with flash attention.”,怀疑是系统问题,安装了wsl,用ubuntu20.04系统报错消失。chatglm3-6b模型可以正常使用,可是chatglm3-6b-128k模型不管在哪个系统都会提示AssertionError:tensor([[[模型加载进去了,可以交互一次,但是之后就报这个断言错误。实在不知道问题怎么解决,请求大家能否帮助我。
(中间是wsl下Ubuntu系统运行结果,其余为winserver 2022结果,模型都是chatglm3-6b-128k)
Beta Was this translation helpful? Give feedback.
All reactions