Skip to content

Commit

Permalink
llama 3.2 1b instruct q40 model.
Browse files Browse the repository at this point in the history
  • Loading branch information
b4rtaz committed Oct 13, 2024
1 parent 3353d56 commit bf2de45
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,7 @@ Python 3 and C++ compiler required. The command will download the model and the
| Llama 3 8B Instruct Q40 | Chat, API | 6.32 GB | `python launch.py llama3_8b_instruct_q40` |
| Llama 3.1 8B Instruct Q40 | Chat, API | 6.32 GB | `python launch.py llama3_1_8b_instruct_q40` |
| Llama 3.1 405B Instruct Q40 | Chat, API | 238 GB | `python launch.py llama3_1_405b_instruct_q40` |
| Llama 3.2 1B Instruct Q40 | Chat, API | 1.7 GB | `python launch.py llama3_2_1b_instruct_q40` |

### 🛠️ Convert Model Manually

Expand Down
7 changes: 7 additions & 0 deletions launch.py
Original file line number Diff line number Diff line change
Expand Up @@ -37,6 +37,11 @@ def parts(length):
'https://huggingface.co/b4rtaz/Llama-3_1-405B-Q40-Instruct-Distributed-Llama/resolve/main/dllama_tokenizer_llama_3_1.t?download=true',
'q40', 'q80', 'chat'
],
'llama3_2_1b_instruct_q40': [
['https://huggingface.co/b4rtaz/Llama-3_2-1B-Q40-Instruct-Distributed-Llama/resolve/main/dllama_model_llama3.2-1b-instruct_q40.m?download=true'],
'https://huggingface.co/b4rtaz/Llama-3_2-1B-Q40-Instruct-Distributed-Llama/resolve/main/dllama_tokenizer_llama3_2.t?download=true',
'q40', 'q80', 'chat', '--max-seq-len 8192'
],
}

def downloadFile(urls: str, path: str):
Expand Down Expand Up @@ -107,6 +112,8 @@ def printUsage():
else:
command = './dllama inference --steps 64 --prompt "Hello world"'
command += f' --model {modelPath} --tokenizer {tokenizerPath} --buffer-float-type {model[3]} --nthreads 4'
if (len(model) > 5):
command += f' {model[5]}'

print('To run Distributed Llama you need to execute:')
print('--- copy start ---')
Expand Down

0 comments on commit bf2de45

Please sign in to comment.