QuantPanda

QuantPanda

AI & ML interests

None yet

Recent Activity

Organizations

None yet

QuantPanda's activity

reacted to lewtun's post with ๐Ÿ”ฅ 7 days ago
view post
Post
3178
I was initially pretty sceptical about Meta's Coconut paper [1] because the largest perf gains were reported on toy linguistic problems. However, these results on machine translation are pretty impressive!

https://x.com/casper_hansen_/status/1875872309996855343

Together with the recent PRIME method [2] for scaling RL, reasoning for open models is looking pretty exciting for 2025!

[1] Training Large Language Models to Reason in a Continuous Latent Space (2412.06769)
[2] https://huggingface.co./blog/ganqu/prime
New activity in FreedomIntelligence/HuatuoGPT-o1-7B 7 days ago

Update

3
#1 opened 7 days ago by
QuantPanda
New activity in deepseek-ai/DeepSeek-V3 7 days ago

Update README.md

1
#37 opened 10 days ago by
TomGrc
reacted to davidberenstein1957's post with ๐Ÿš€ 8 days ago
replied to davidberenstein1957's post 8 days ago
view reply

This is the kind of write up that I needed, thank you

GGUF

4
#1 opened 27 days ago by
MikeLightheart