From dc0e94febf5c731361482451b42312c86a3c1e14 Mon Sep 17 00:00:00 2001
From: b4rtaz <b4rtaz@gmail.com>
Date: Thu, 25 Jul 2024 21:49:13 +0200
Subject: [PATCH] update readme.md.

---
 README.md | 13 +++++++------
 1 file changed, 7 insertions(+), 6 deletions(-)

diff --git a/README.md b/README.md
index 9be874f..0a33e4f 100644
--- a/README.md
+++ b/README.md
@@ -15,17 +15,18 @@ Tensor parallelism is all you need. Run LLMs on weak devices or make powerful de
 
 Python 3 and C++ compiler required. The command will download the model and the tokenizer.
 
-| Model                   | Purpose   | Size     | Command                                   |
-| ----------------------- | --------- | -------- | ----------------------------------------- |
-| TinyLlama 1.1B 3T Q40   | Benchmark | 844 MB   | `python launch.py tinyllama_1_1b_3t_q40`  |
-| Llama 3 8B Q40          | Benchmark | 6.32 GB  | `python launch.py llama3_8b_q40`          |
-| Llama 3 8B Instruct Q40 | Chat, API | 6.32 GB  | `python launch.py llama3_8b_instruct_q40` |
+| Model                     | Purpose   | Size     | Command                                     |
+| ------------------------- | --------- | -------- | ------------------------------------------- |
+| TinyLlama 1.1B 3T Q40     | Benchmark | 844 MB   | `python launch.py tinyllama_1_1b_3t_q40`    |
+| Llama 3 8B Q40            | Benchmark | 6.32 GB  | `python launch.py llama3_8b_q40`            |
+| Llama 3 8B Instruct Q40   | Chat, API | 6.32 GB  | `python launch.py llama3_8b_instruct_q40`   |
+| Llama 3.1 8B Instruct Q40 | Chat, API | 6.32 GB  | `python launch.py llama3_1_8b_instruct_q40` |
 
 ### 🛠️ Convert Model Manually
 
 Supported architectures: Llama, Mixtral, Grok
 
-* [How to Convert Llama 2, Llama 3](./docs/LLAMA.md)
+* [How to Convert Llama 2, Llama 3, Llama 3.1](./docs/LLAMA.md)
 * [How to Convert Hugging Face Model](./docs/HUGGINGFACE.md)
 
 ### 🚧 Known Limitations