bay-llm
/

gemma-9b-SFT-180-16bit

@@ -1,22 +1,105 @@
 ---
-base_model: unsloth/gemma-2-9b-bnb-4bit
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - gemma2
 - trl
-license: apache-2.0
 language:
 - en
 ---
 # Uploaded  model
 - **Developed by:** bay-llm
-- **License:** apache-2.0
 - **Finetuned from model :** unsloth/gemma-2-9b-bnb-4bit
 This gemma2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
+base_model:
+- google/gemma-2-9b
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - gemma2
 - trl
+license: gemma
 language:
 - en
+- ja
+datasets:
+- kanhatakeyama/wizardlm8x22b-logical-math-coding-sft_additional-ja
+- kanhatakeyama/AutoMultiTurnByCalm3-22B
+- kanhatakeyama/ramdom-to-fixed-multiturn-Calm3
 ---
+# Model Card for Model ID
+Instruction tuning
+The models have been fine-tuned.
+Usage
+```python
+!pip install vllm==0.6.4.post1 --force-reinstall
+import time
+import torch
+import transformers
+from transformers import (
+    AutoTokenizer,
+    AutoModelForCausalLM,
+)
+import vllm ### packaging==24.1にしないとエラーになる！！ ###
+print(vllm.__version__)
+MAX_LENGTH = 1000
+MODEL_NAME = "bay-llm/gemma-9b-SFT-180-16bit" # コンペで提出したいモデルに適宜置換
+llm = vllm.LLM(
+    model=MODEL_NAME,
+    tensor_parallel_size=1,
+    gpu_memory_utilization=0.95,
+    trust_remote_code=True,
+    max_model_len=1024,
+)
+tokenizer = llm.get_tokenizer()
+# ELYZA-tasks-100-TVの読み込み。事前にファイルをアップロードしてください
+# データセットの読み込み。
+# omnicampusの開発環境では、左にタスクのjsonlをドラッグアンドドロップしてから実行。
+import json
+datasets = []
+with open("../elyza-tasks-100-TV_0.jsonl", "r") as f:
+    item = ""
+    for line in f:
+      line = line.strip()
+      item += line
+      if item.endswith("}"):
+        datasets.append(json.loads(item))
+        item = ""
+print(datasets[0])
+messages_list = [
+    [{"role": "user", "content": datasets[i]["input"]}] for i in range(len(datasets))
+]
+prompts = [line[0]["content"] for line in messages_list]
+prompt_token_ids = [tokenizer.apply_chat_template(messages, add_generation_prompt=True) for messages in messages_list]
+sampling_params = vllm.SamplingParams(
+    temperature=0.5,
+    max_tokens=512,
+)
+outputs = llm.generate(prompt_token_ids=prompt_token_ids, sampling_params=sampling_params)
+for prompt, response in zip(prompts, outputs):
+    print("prompt:", prompt)
+    print("output:", response.outputs[0].text.strip())
+    print("-"*80)
+import json
+data = [{
+    "task_id": i,
+    "input": prompts[i],
+    "output": outputs[i].outputs[0].text.strip()
+} for i in range(len(datasets))]
+file_path = 'submmit.jsonl'
+with open(file_path, 'w', encoding='utf-8') as file:
+    for entry in data:
+        json.dump(entry, file, ensure_ascii=False)
+        file.write('\n')
+```
 # Uploaded  model
 - **Developed by:** bay-llm
+- **License:** gemma
 - **Finetuned from model :** unsloth/gemma-2-9b-bnb-4bit
 This gemma2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)