关于32K训练咨询 #542
关于32K训练咨询
#542
-
你好,我看代码里面没有32K的代码,我想问下是不是我只需要把代码和32K的模型参数(https://huggingface.co/THUDM/chatglm3-6b-32k)下载下来,比如说我要微调chatmodel的,我就执行修改./scripts/finetune_pt.sh里面的模型参数路径到本地的32K的模型参数,启动训练就好了? |
Beta Was this translation helpful? Give feedback.
Answered by
zRzRzRzRzRzRzR
Dec 6, 2023
Replies: 2 comments 5 replies
-
32K模型代码仅能对话,没有Agent能力,所以也没有工具调用这些,其他可以直接替换 |
Beta Was this translation helpful? Give feedback.
2 replies
Answer selected by
zRzRzRzRzRzRzR
-
微调32K模型的时候,发现loss训练一小时后不再下降反而上升了,一开始是下降到1点多,然后最后又慢慢上升到6点多 |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
32K模型代码仅能对话,没有Agent能力,所以也没有工具调用这些,其他可以直接替换