Update README.md
Browse files
README.md
CHANGED
@@ -34,6 +34,25 @@ GGML files are for CPU + GPU inference using [llama.cpp](https://github.com/gger
|
|
34 |
* [4-bit, 5-bit, and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/Nous-Hermes-13B-GGML)
|
35 |
* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/NousResearch/Nous-Hermes-13b)
|
36 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
37 |
<!-- compatibility_ggml start -->
|
38 |
## Compatibility
|
39 |
|
|
|
34 |
* [4-bit, 5-bit, and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/Nous-Hermes-13B-GGML)
|
35 |
* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/NousResearch/Nous-Hermes-13b)
|
36 |
|
37 |
+
## Prompt Template
|
38 |
+
|
39 |
+
The model follows the Alpaca prompt format:
|
40 |
+
```
|
41 |
+
### Instruction:
|
42 |
+
|
43 |
+
### Response:
|
44 |
+
```
|
45 |
+
|
46 |
+
or
|
47 |
+
|
48 |
+
```
|
49 |
+
### Instruction:
|
50 |
+
|
51 |
+
### Input:
|
52 |
+
|
53 |
+
### Response:
|
54 |
+
```
|
55 |
+
|
56 |
<!-- compatibility_ggml start -->
|
57 |
## Compatibility
|
58 |
|