xiaol
/

rwkv-7B-world-novel-128k

xiaol commited on Aug 10, 2023

Commit

759a8f9

•

1 Parent(s): 76e851d

Update README.md

![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/F9unOJfhmJPXsciPHLsrl.png)

Files changed (1) hide show

README.md CHANGED Viewed

@@ -9,6 +9,8 @@ We proudly announce this is the world first 128k context model based on RWKV arc
 This model trained with instructions datasets and chinese web novel and tradition wuxia,
 more trainning details would be updated.
 Full finetuned using this repo to train 128k context model , 4*A800 40hours with 1.3B tokens.
 https://github.com/SynthiaDL/TrainChatGalRWKV/blob/main/train_world.sh
@@ -26,4 +28,7 @@ Using RWKV Runner https://github.com/josStorer/RWKV-Runner  to test this ， use
 ![QQ图片20230810143840.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/LgEjfHJ7XD7PlGM9b3RAf.png)
-![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/b_6KCBdZKW7Q7HwipxE-l.png)

 This model trained with instructions datasets and chinese web novel and tradition wuxia,
 more trainning details would be updated.
+Test input 67k tokens  to summary ,can find in example folders ,more cases are coming.
 Full finetuned using this repo to train 128k context model , 4*A800 40hours with 1.3B tokens.
 https://github.com/SynthiaDL/TrainChatGalRWKV/blob/main/train_world.sh
 ![QQ图片20230810143840.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/LgEjfHJ7XD7PlGM9b3RAf.png)
+![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/b_6KCBdZKW7Q7HwipxE-l.png)
+67k input test
+![image.png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/F9unOJfhmJPXsciPHLsrl.png)