Update README.md
Browse files
README.md
CHANGED
@@ -106,7 +106,7 @@ uploads are small.
|
|
106 |
|
107 |
Through a combination of these ingenuous tricks:
|
108 |
|
109 |
-
1. I am not above using a low quant (e.g. Q4_K_S,
|
110 |
2. An nvme drive is "only" 25-50 times slower than RAM. I lock the first 80GB of the model in RAM, and
|
111 |
then stream the remaining data from disk for every iteration.
|
112 |
3. Patience.
|
|
|
106 |
|
107 |
Through a combination of these ingenuous tricks:
|
108 |
|
109 |
+
1. I am not above using a low quant (e.g. Q4_K_S, IQ3_XS or even Q2_K), reducing the size of the model.
|
110 |
2. An nvme drive is "only" 25-50 times slower than RAM. I lock the first 80GB of the model in RAM, and
|
111 |
then stream the remaining data from disk for every iteration.
|
112 |
3. Patience.
|