File size: 930 Bytes
c05e186 2e4a575 c05e186 5ffeeed 419be1f 5ffeeed 584ff6e 7ea989e 584ff6e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
---
library_name: transformers
tags: [Danish, Morphological Tokenization, CerebrasGPT]
---
```
_______ ___ .___ ___. ______ .______ .______ __ __
| \ / \ | \/ | / __ \ | _ \ | _ \ | | | |
| .--. | / ^ \ | \ / | | | | | | |_) | | |_) | | |__| |
| | | | / /_\ \ | |\/| | | | | | | / | ___/ | __ |
| '--' | / _____ \ | | | | | `--' | | |\ \----.| | | | | |
|_______/ /__/ \__\ |__| |__| \______/ | _| `._____|| _| |__| |__|
```
### DA-MORPH-CEREBRAS
This experimental model, built on the CerebrasGPT-111M architecture, uses a custom morphological tokenizer specifically designed for Danish. It explores the impact of morphology-aware tokenization on Danish text generation and understanding. |