lightblue
/

suzume-llama-3-8B-japanese-gguf

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

ptrdvn commited on May 22

Commit

8f6a1a6

•

1 Parent(s): 22a94b7

Update README.md

Files changed (1) hide show

README.md +21 -0

README.md CHANGED Viewed

@@ -17,6 +17,8 @@ model-index:
 # Suzume
 This Suzume 8B, a Japanese finetune of Llama 3.
 Llama 3 has exhibited excellent performance on many English language benchmarks.
@@ -157,3 +159,22 @@ The following hyperparameters were used during training:
 - Pytorch 2.2.1+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.0

 # Suzume
+[[Paper](https://arxiv.org/abs/2405.12612)] [[Dataset](https://huggingface.co/datasets/lightblue/tagengo-gpt4)]
 This Suzume 8B, a Japanese finetune of Llama 3.
 Llama 3 has exhibited excellent performance on many English language benchmarks.
 - Pytorch 2.2.1+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.0
+# How to cite
+Please cite [this paper](https://arxiv.org/abs/2405.12612) when referencing this model.
+```tex
+@misc{devine2024tagengo,
+      title={Tagengo: A Multilingual Chat Dataset},
+      author={Peter Devine},
+      year={2024},
+      eprint={2405.12612},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```
+# Developer
+Peter Devine - ([ptrdvn](https://huggingface.co/ptrdvn))