AI & ML interests

None defined yet.

Recent Activity

RWKV's activity

BlinkDL 
posted an update 7 days ago
view post
Post
1351
RWKV-7 "Goose" 0.4B trained w/ ctx4k automatically extrapolates to ctx32k+, and perfectly solves NIAH ctx16k 🤯 100% RNN and attention-free. Only trained on the Pile. No finetuning. Replicable training runs. tested by our community: https://github.com/Jellyfish042/LongMamba
SmerkyG 
in RWKV/v6-Finch-7B-HF about 1 month ago

Update README.md

#1 opened 4 months ago by
SmerkyG
BlinkDL 
posted an update about 1 month ago
view post
Post
3938
RWKV-6-world-v3 (+3.1T tokens) is our best multilingual 7B model as of now: BlinkDL/rwkv-6-world

It's 100% RNN and attention-free. MMLU 54.2% (previous world-v2.1 = 47.9%. note: without eval-boosting tricks such as annealing).

RWKV-7-world-v4 soon :)
BlinkDL 
posted an update 3 months ago
xianbao 
posted an update 4 months ago
view post
Post
1704
With the open-weight release of CogVideoX-5B from THUDM, i.e. GLM team, the Video Generation Model (how about calling it VGM) field has officially became the next booming "LLM"

What does the landscape look like? What are other video generation models? This collection below is all your need.

xianbao/video-generation-models-66c350163c74f60f5c412af6

The above video is generated by @a-r-r-o-w with CogVideoX-5B, taken from a nice lookout for the field!
ybelkada 
posted an update 4 months ago
ybelkada 
posted an update 4 months ago