AXCXEPT
/

EZO-Common-9B-gemma-2-it

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

HODACHI commited on Jul 9, 2024

Commit

beaaf42

·

verified ·

1 Parent(s): 40f6f6e

Update README.md

Files changed (1) hide show

README.md +12 -3

README.md CHANGED Viewed

@@ -1,3 +1,11 @@
 # EZO model card
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/657e900beaad53ff67ba84db/0OYFqT8kACowa9bY1EZF6.png)
 **Terms of Use**: [Terms](https://www.kaggle.com/models/google/gemma/license/consent/verify/huggingface?returnModelRepoId=google/gemma-2-9b-it)
@@ -42,7 +50,7 @@ outputs = model.generate(input_ids=inputs.to(model.device), max_new_tokens=150)
 print(tokenizer.decode(outputs[0]))
 ```
-Template
 ```
 <bos><start_of_turn>user
 Write a hello world program<end_of_turn>
@@ -52,9 +60,10 @@ XXXXXX<end_of_turn><eos>
 ### Model Data
 Information about the data used for model training and how it was processed.
 #### Training Dataset
-We extracted high-quality data from Japanese Wikipedia and FineWeb to create instruction data. This diverse dataset ensures the model can handle a wide range of topics and languages, making it suitable for global use.
-日本語のWikiデータおよび、FineWebから良質なデータのみを抽出し、Instructionデータを作成しました。この多様なデータセットにより、モデルは幅広いトピックと言語を扱うことができ、グローバルな使用に適しています。
 https://huggingface.co/datasets/legacy-datasets/wikipedia
 https://huggingface.co/datasets/HuggingFaceFW/fineweb

+---
+license: gemma
+library_name: transformers
+pipeline_tag: text-generation
+tags:
+- conversational
+---
 # EZO model card
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/657e900beaad53ff67ba84db/0OYFqT8kACowa9bY1EZF6.png)
 **Terms of Use**: [Terms](https://www.kaggle.com/models/google/gemma/license/consent/verify/huggingface?returnModelRepoId=google/gemma-2-9b-it)
 print(tokenizer.decode(outputs[0]))
 ```
+### Template
 ```
 <bos><start_of_turn>user
 Write a hello world program<end_of_turn>
 ### Model Data
 Information about the data used for model training and how it was processed.
 #### Training Dataset
+We extracted high-quality data from Japanese Wikipedia and FineWeb to create instruction data. Our innovative training approach allows for performance improvements across various languages and domains, making the model suitable for global use despite its focus on Japanese data.
+日本語のWikiデータおよび、FineWebから良質なデータのみを抽出し、Instructionデータを作成しました。このモデルでは日本語に特化させていますが、世界中のどんなユースケースでも利用可能なアプローチです。
 https://huggingface.co/datasets/legacy-datasets/wikipedia
 https://huggingface.co/datasets/HuggingFaceFW/fineweb