Djuunaa

djuna

AI & ML interests

None yet

Recent Activity

liked a model 40 minutes ago
MadeAgents/Hammer2.1-7b
liked a model 42 minutes ago
pe-nlp/R1-Qwen2.5-7B-Instruct
View all activity

Organizations

Djuna Test Lab's profile picture

djuna's activity

reacted to csabakecskemeti's post with 👀 43 minutes ago
view post
Post
300
I've run the open llm leaderboard evaluations + hellaswag on deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.

If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results

Am I made some mistake, or (at least this distilled version) not as good/better than the competition?

I'll run the same on the Qwen 7B distilled version too.
  • 2 replies
·
New activity in arcee-ai/mergekit-gui 1 day ago
reacted to chansung's post with 👍 2 days ago
view post
Post
1516
New look for AI powered paper reviews from the list by Hugging Face Daily Papers ( managed by the @akhaliq )

Bookmark the webpage along, check comprehensive reviews by Google DeepMind Gemini 1.5, and listen to audio podcast made by the same tech used in NotebookLM.

Link: https://deep-diver.github.io/ai-paper-reviewer/

This is not an official service by Hugging Face. It is just a service developed by an individual developer using his own money :)