File size: 913 Bytes
ec5caa2
 
 
 
 
d4297b5
 
afda23f
ec5caa2
 
1230827
ec5caa2
1edfd8a
72c4a8c
ec5caa2
 
 
 
 
 
 
1edfd8a
ec5caa2
1edfd8a
ec5caa2
 
 
 
bb17d33
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
---
datasets:
- togethercomputer/RedPajama-Data-V2
language:
- de
pipeline_tag: text-generation
library_name: transformers
license: other
---

# LLäMmlein 1B

This is a German Tinyllama 1B language model trained from scratch using the [Tinyllama](https://github.com/jzhang38/TinyLlama) codebase on the German portion of [RedPajama V2](https://huggingface.co./datasets/togethercomputer/RedPajama-Data-V2).
Find more details on our [page](https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/) and our [preprint](arxiv.org/abs/2411.11171)!


### Usage

```python
from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_1B")

tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_1B")
```


### Evaluation 
We evaluated our results on the [SuperGLEBer](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark.