Add huggingface back-end #14

jepler · 2023-09-29T15:18:22Z

Tested with mistral-7b-instruct model.

Also fix prompting to be "llama 2 instruct" style, see https://github.com/facebookresearch/llama/blob/v2/llama/generation.py

Fix prompt exclusion to work right/better

add stop tokens to llama.cpp requests and increase the first token timeout -- it was too short for pure-CPU inference.

this also works well with mistral-7b-instruct See https://github.com/facebookresearch/llama/blob/v2/llama/generation.py

defaults to mistral 7b instruct

jepler added 6 commits September 29, 2023 08:39

Add ability to toggle off history context in tui

9fe01de

Use llama2-instruct style prompting

ea03aa0

this also works well with mistral-7b-instruct See https://github.com/facebookresearch/llama/blob/v2/llama/generation.py

set some stop tokens

6792eb0

increase first-token timeout

90a4f17

Improve display of default string params with special chars

2c04964

Add huggingface back-end

b6fa44f

defaults to mistral 7b instruct

jepler merged commit f3bf17c into main Sep 29, 2023
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add huggingface back-end #14

Add huggingface back-end #14

jepler commented Sep 29, 2023

Add huggingface back-end #14

Add huggingface back-end #14

Conversation

jepler commented Sep 29, 2023