license: apache-2.0
a LoRA fine-tuned model with Japanese dataset
LoRA Experiment
rwkv-7b-jp-280.pth is merged model with base
Base Model
https://huggingface.co./BlinkDL/rwkv-4-raven/tree/main
RWKV-4-Raven-7B-v10-Eng89%25-Jpn10%25-Other1%25-20230420-ctx4096.pth
Parameters: Lora Rank 1024 Lora Alpha 2048 ctx length 1024 Lora Size: almost 1.6GB
Dataset
https://huggingface.co./datasets/kunishou/hh-rlhf-49k-ja
https://huggingface.co./datasets/kunishou/databricks-dolly-15k-ja