meelu
/

DA-MORPH-CEREBRAS-TOKEN

Morphological Tokenization

Model card Files Files and versions Community

MikkelWK commited on 15 days ago

Commit

90685c6

·

verified ·

1 Parent(s): 2204267

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -2,7 +2,15 @@
 library_name: tokenizers
 tags: [Danish, Morphological Tokenization, CerebrasGPT]
 ---
 ### DA-MORPH-CEREBRAS-TOKEN
 This morphological tokenizer is designed for the CerebrasGPT architecture and focuses on segmenting Danish text based on linguistic principles, enabling more meaningful subword tokenization.

 library_name: tokenizers
 tags: [Danish, Morphological Tokenization, CerebrasGPT]
 ---
+```
+ _______       ___      .___  ___.   ______   .______      .______    __    __
+|       \     /   \     |   \/   |  /  __  \  |   _  \     |   _  \  |  |  |  |
+|  .--.  |   /  ^  \    |  \  /  | |  |  |  | |  |_)  |    |  |_)  | |  |__|  |
+|  |  |  |  /  /_\  \   |  |\/|  | |  |  |  | |      /     |   ___/  |   __   |
+|  '--'  | /  _____  \  |  |  |  | |  `--'  | |  |\  \----.|  |      |  |  |  |
+|_______/ /__/     \__\ |__|  |__|  \______/  | _| `._____|| _|      |__|  |__|
+```
 ### DA-MORPH-CEREBRAS-TOKEN
 This morphological tokenizer is designed for the CerebrasGPT architecture and focuses on segmenting Danish text based on linguistic principles, enabling more meaningful subword tokenization.