potter xu
xxllp
AI & ML interests
None yet
Organizations
None yet
xxllp's activity
ft base model
1
#2 opened about 1 month ago
by
xxllp
train code
2
#1 opened about 2 months ago
by
xxllp
different between DeepSeek-V2-Chat-0628 and Deepseek-v2-API-0628
#2 opened 2 months ago
by
xxllp
模型太耗内存了,有量化版本吗?flashatt是不是可以关闭,对显卡限制太多
8
#3 opened 3 months ago
by
fukai
Actual dataset size?
3
#4 opened 4 months ago
by
jlzhou