MediaTek-Research
/

Breeze-7B-Instruct-v1_0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pochunhsu commited on Mar 7, 2024

Commit

418dd18

·

verified ·

1 Parent(s): 5092f76

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -6,9 +6,9 @@ language:
 - en
 ---
-# Model Card for Breeze-7B-Instruct-v1_0
-Breeze-7B is a language model family that builds on top of [Mistral-7B](https://huggingface.co/mistralai/Mistral-7B-v0.1), specifically intended for Traditional Chinese use.
 [Breeze-7B-Base](https://huggingface.co/MediaTek-Research/Breeze-7B-Base-v1_0) is the base model for the Breeze-7B series.
 It is suitable for use if you have substantial fine-tuning data to tune it for your specific use case.
@@ -17,7 +17,7 @@ It is suitable for use if you have substantial fine-tuning data to tune it for y
 The current release version of Breeze-7B is v1.0, which has undergone a more refined training process compared to Breeze-7B-v0_1, resulting in significantly improved performance in both English and Traditional Chinese.
-For details of this model please read our [paper](https://arxiv.org/abs/).
 Practicality-wise:
 - Breeze-7B-Base expands the original vocabulary with an additional 30,000 Traditional Chinese tokens. With the expanded vocabulary, and everything else being equal, Breeze-7B operates at twice the inference speed for Traditional Chinese to Mistral-7B and Llama 7B. [See [Inference Performance](#inference-performance).]

 - en
 ---
+# Model Card for MediaTek Research Breeze-7B-Instruct-v1_0
+MediaTek Research Breeze-7B (hereinafter referred to as Breeze-7B) is a language model family that builds on top of [Mistral-7B](https://huggingface.co/mistralai/Mistral-7B-v0.1), specifically intended for Traditional Chinese use.
 [Breeze-7B-Base](https://huggingface.co/MediaTek-Research/Breeze-7B-Base-v1_0) is the base model for the Breeze-7B series.
 It is suitable for use if you have substantial fine-tuning data to tune it for your specific use case.
 The current release version of Breeze-7B is v1.0, which has undergone a more refined training process compared to Breeze-7B-v0_1, resulting in significantly improved performance in both English and Traditional Chinese.
+For details of this model please read our [paper](https://arxiv.org/abs/2403.02712).
 Practicality-wise:
 - Breeze-7B-Base expands the original vocabulary with an additional 30,000 Traditional Chinese tokens. With the expanded vocabulary, and everything else being equal, Breeze-7B operates at twice the inference speed for Traditional Chinese to Mistral-7B and Llama 7B. [See [Inference Performance](#inference-performance).]