Update README.md
Browse files
README.md
CHANGED
@@ -5,12 +5,12 @@ tags:
|
|
5 |
- green
|
6 |
- p8
|
7 |
- llmware-chat
|
8 |
-
-
|
9 |
---
|
10 |
|
11 |
-
# llama-3.1-instruct-
|
12 |
|
13 |
-
**llama-3.1-instruct-ov
|
14 |
|
15 |
[**llama-3.1-instruct**](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) is a leading open source general foundation model from Meta.
|
16 |
|
|
|
5 |
- green
|
6 |
- p8
|
7 |
- llmware-chat
|
8 |
+
- onnx
|
9 |
---
|
10 |
|
11 |
+
# llama-3.1-instruct-onnx
|
12 |
|
13 |
+
**llama-3.1-instruct-ov** is an ONNX int4 quantized version of Llama 3.1 Instruct, providing a very fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
|
14 |
|
15 |
[**llama-3.1-instruct**](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) is a leading open source general foundation model from Meta.
|
16 |
|