File size: 930 Bytes
c05e186
 
2e4a575
c05e186
5ffeeed
419be1f
 
 
 
 
 
 
5ffeeed
 
584ff6e
7ea989e
584ff6e
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
---
library_name: transformers
tags: [Danish, Morphological Tokenization, CerebrasGPT]
---
```
 _______       ___      .___  ___.   ______   .______      .______    __    __  
|       \     /   \     |   \/   |  /  __  \  |   _  \     |   _  \  |  |  |  | 
|  .--.  |   /  ^  \    |  \  /  | |  |  |  | |  |_)  |    |  |_)  | |  |__|  | 
|  |  |  |  /  /_\  \   |  |\/|  | |  |  |  | |      /     |   ___/  |   __   | 
|  '--'  | /  _____  \  |  |  |  | |  `--'  | |  |\  \----.|  |      |  |  |  | 
|_______/ /__/     \__\ |__|  |__|  \______/  | _| `._____|| _|      |__|  |__| 
                                                                               
```

### DA-MORPH-CEREBRAS

This experimental model, built on the CerebrasGPT-111M architecture, uses a custom morphological tokenizer specifically designed for Danish. It explores the impact of morphology-aware tokenization on Danish text generation and understanding.