license: apache-2.0 | |
a LoRA fine-tuned model with Japanese dataset | |
LoRA Experiment | |
rwkv-7b-jp-280.pth is merged model with base | |
Base Model | |
https://huggingface.co./BlinkDL/rwkv-4-raven/tree/main | |
RWKV-4-Raven-7B-v10-Eng89%25-Jpn10%25-Other1%25-20230420-ctx4096.pth | |
Parameters: | |
Lora Rank 1024 | |
Lora Alpha 2048 | |
ctx length 1024 | |
Lora Size: almost 1.6GB | |
Dataset | |
https://huggingface.co./datasets/kunishou/hh-rlhf-49k-ja | |
https://huggingface.co./datasets/kunishou/databricks-dolly-15k-ja | |