laurentiubp
commited on
Commit
•
3016d0d
1
Parent(s):
025fd35
Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ CataLlama-v0.2 was trained on roughly **620 million new tokens** which is almost
|
|
27 |
|
28 |
This new (V2) SFT Dataset was built mostly from scratch and it only retained parts of the V1.
|
29 |
|
30 |
-
On top of the existing instructions in Catalan, **250k additional instructions were translated
|
31 |
|
32 |
All the English instructions existing in the V1 of the dataset were discarded and replaced with high quality instructions scored with [RLHFlow/ArmoRM-Llama3-8B-v0.1](https://huggingface.co/RLHFlow/ArmoRM-Llama3-8B-v0.1) reward model.
|
33 |
|
|
|
27 |
|
28 |
This new (V2) SFT Dataset was built mostly from scratch and it only retained parts of the V1.
|
29 |
|
30 |
+
On top of the existing instructions in Catalan, **250k additional instructions were translated for this model.**
|
31 |
|
32 |
All the English instructions existing in the V1 of the dataset were discarded and replaced with high quality instructions scored with [RLHFlow/ArmoRM-Llama3-8B-v0.1](https://huggingface.co/RLHFlow/ArmoRM-Llama3-8B-v0.1) reward model.
|
33 |
|