muziyongshixin
muziyongshixin
AI & ML interests
None yet
Recent Activity
new activity
9 days ago
cognitivecomputations/DeepSeek-R1-AWQ:Any one can run this model with SGlang framework?
new activity
21 days ago
cognitivecomputations/DeepSeek-R1-AWQ:Deployment framework
Organizations
None yet
muziyongshixin's activity
Any one can run this model with SGlang framework?
2
#13 opened 9 days ago
by
muziyongshixin
MLA is not supported with moe_wna16 quantization. Disabling MLA.
5
#7 opened 17 days ago
by
AMOSE
Deployment framework
27
#2 opened about 1 month ago
by
xro7
what is the difference between this model and 01-ai/Yi-1.5-34B?
3
#2 opened 9 months ago
by
muziyongshixin
why does the im_start and im_end token id exceed the tokenizer.voc_size?
1
#36 opened 11 months ago
by
muziyongshixin
How does MergeKit's Moe Integration work?
6
#8 opened about 1 year ago
by
arhanovich
what is the difference between this model and bigscience / bloomz-7b1-mt?
1
#2 opened almost 2 years ago
by
muziyongshixin