Cly
Akikaaa
AI & ML interests
None yet
Recent Activity
updated
a model
8 days ago
Akikaaa/ppo-LunarLander-v2
new activity
13 days ago
Qwen/Qwen2.5-0.5B-Instruct:Does this model apply SFT or SFT+RL during post-training?
Organizations
None yet
Akikaaa's activity
Does this model apply SFT or SFT+RL during post-training?
#8 opened 13 days ago
by
Akikaaa