File size: 913 Bytes
ec5caa2 d4297b5 afda23f ec5caa2 1230827 ec5caa2 1edfd8a 72c4a8c ec5caa2 1edfd8a ec5caa2 1edfd8a ec5caa2 bb17d33 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 |
---
datasets:
- togethercomputer/RedPajama-Data-V2
language:
- de
pipeline_tag: text-generation
library_name: transformers
license: other
---
# LLäMmlein 1B
This is a German Tinyllama 1B language model trained from scratch using the [Tinyllama](https://github.com/jzhang38/TinyLlama) codebase on the German portion of [RedPajama V2](https://huggingface.co./datasets/togethercomputer/RedPajama-Data-V2).
Find more details on our [page](https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/) and our [preprint](arxiv.org/abs/2411.11171)!
### Usage
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_1B")
tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_1B")
```
### Evaluation
We evaluated our results on the [SuperGLEBer](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark. |