BramVanroy
commited on
Commit
·
8e395ff
1
Parent(s):
b960df1
Update README.md
Browse files
README.md
CHANGED
@@ -24,15 +24,18 @@ See the original [meta-llama/Llama-2-13b-hf](https://huggingface.co/meta-llama/L
|
|
24 |
|
25 |
## Model description
|
26 |
|
27 |
-
I could not get Llama 2 13B to produce much Dutch, even though the description paper indicates that it was trained on a (small) portion of Dutch data. I therefore
|
28 |
-
|
29 |
-
step I finetuned that model on a collection of synthetic (translated) instruction and chat datasets that I have [collected](https://huggingface.co/datasets/BramVanroy/dutch_chat_datasets).
|
|
|
30 |
|
31 |
- https://huggingface.co/datasets/BramVanroy/dolly-15k-dutch
|
32 |
- https://huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch-baize
|
33 |
- https://huggingface.co/datasets/BramVanroy/stackoverflow-chat-dutch
|
34 |
- https://huggingface.co/datasets/BramVanroy/quora-chat-dutch
|
35 |
|
|
|
|
|
36 |
|
37 |
## Intended uses & limitations
|
38 |
|
|
|
24 |
|
25 |
## Model description
|
26 |
|
27 |
+
I could not get the original Llama 2 13B to produce much Dutch, even though the description paper indicates that it was trained on a (small) portion of Dutch data. I therefore
|
28 |
+
continued training the original Llama 2 13B checkpoint on Dutch data [in regular CLM](https://huggingface.co/BramVanroy/llama2-13b-ft-mc4_nl_cleaned_tiny). In a second
|
29 |
+
step I finetuned that model on a collection of synthetic (translated) instruction and chat datasets that I have [collected](https://huggingface.co/datasets/BramVanroy/dutch_chat_datasets).
|
30 |
+
See their pages for licensing, usage, creation, and citation information.
|
31 |
|
32 |
- https://huggingface.co/datasets/BramVanroy/dolly-15k-dutch
|
33 |
- https://huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch-baize
|
34 |
- https://huggingface.co/datasets/BramVanroy/stackoverflow-chat-dutch
|
35 |
- https://huggingface.co/datasets/BramVanroy/quora-chat-dutch
|
36 |
|
37 |
+
This model is the result of that process. While not perfect by any means, it can perform reasonably well in Dutch depending on the prompts. It is also decent at helping with programming tasks.
|
38 |
+
|
39 |
|
40 |
## Intended uses & limitations
|
41 |
|