xiaol
/

rwkv-7B-world-novel-128k

Model card Files Files and versions Community

xiaol commited on Aug 10, 2023

Commit

524a6e3

•

1 Parent(s): b394794

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -4,12 +4,16 @@ datasets:
 - Norquinal/claude_multiround_chat_30k
 - OpenLeecher/Teatime
 ---
-We proudly announce this is the world first 128k context model based on RWKV architecture today, 2023-08-10.
 With RWKV world tokenizer,multi-langs have 1:1 tokenization ratio ,one word to one token.
 (https://github.com/BlinkDL/ChatRWKV/blob/2a13ddecd81f8fd615b6da3a8f1091a594689e30/tokenizer/rwkv_tokenizer.py#L163)
 This model trained with instructions datasets and chinese web novel and tradition wuxia,
 more trainning details would be updated.
@@ -21,6 +25,8 @@ https://github.com/SynthiaDL/TrainChatGalRWKV/blob/main/train_world.sh
 ![QQ图片20230810153529.jpg](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/d8ekmc4Lfhy2lYEdrRKXz.jpeg)
 Using RWKV Runner https://github.com/josStorer/RWKV-Runner  to test this model, only need 16G vram to run fp16 or 8G vram fp16i8, use temp 0.1-0.2 topp 0.7 for more precise answer ,temp between 1-2.x is more creatively.
 ![微信截图_20230810162303.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/Ww45-WMngl4Jyt1OZDAa_.png)

 - Norquinal/claude_multiround_chat_30k
 - OpenLeecher/Teatime
 ---
+# RWKV 7B World 128k for novel writing
+We proudly announce this is the world first **128k context** model based on RWKV architecture today, 2023-08-10.
 With RWKV world tokenizer,multi-langs have 1:1 tokenization ratio ,one word to one token.
 (https://github.com/BlinkDL/ChatRWKV/blob/2a13ddecd81f8fd615b6da3a8f1091a594689e30/tokenizer/rwkv_tokenizer.py#L163)
+# How to train infinte context model?
 This model trained with instructions datasets and chinese web novel and tradition wuxia,
 more trainning details would be updated.
 ![QQ图片20230810153529.jpg](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/d8ekmc4Lfhy2lYEdrRKXz.jpeg)
+# How to Test?
 Using RWKV Runner https://github.com/josStorer/RWKV-Runner  to test this model, only need 16G vram to run fp16 or 8G vram fp16i8, use temp 0.1-0.2 topp 0.7 for more precise answer ,temp between 1-2.x is more creatively.
 ![微信截图_20230810162303.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/Ww45-WMngl4Jyt1OZDAa_.png)