Yikang Shen PRO
YikangS
AI & ML interests
None yet
Recent Activity
liked
a model
about 2 months ago
ibm-granite/granite-3.0-8b-instruct
Organizations
YikangS's activity
When can we have the training code as illustrated in the paper.
12
#5 opened 8 months ago
by
Shamane
why not include Qwen1.5-MoE-A2.7B in the table?
1
#4 opened 9 months ago
by
J22
Dataset?
3
#1 opened 9 months ago
by
0xbitches
Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot