129 11 307

Djuunaa

djuna

AI & ML interests

None yet

Recent Activity

liked a model 40 minutes ago

MadeAgents/Hammer2.1-7b

liked a model 41 minutes ago

WhiteRabbitNeo/WhiteRabbitNeo-2.5-Qwen-2.5-Coder-7B

liked a model 42 minutes ago

pe-nlp/R1-Qwen2.5-7B-Instruct

View all activity

Organizations

djuna's activity

liked a model 40 minutes ago

MadeAgents/Hammer2.1-7b

Updated Dec 25, 2024 • 483 • 12

liked a model 41 minutes ago

WhiteRabbitNeo/WhiteRabbitNeo-2.5-Qwen-2.5-Coder-7B

Text Generation • Updated Oct 9, 2024 • 411 • 42

liked a model 42 minutes ago

pe-nlp/R1-Qwen2.5-7B-Instruct

Text Generation • Updated about 23 hours ago • 2 • 2

reacted to csabakecskemeti's post with 👀 43 minutes ago

Post

300

I've run the open llm leaderboard evaluations + hellaswag on deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.

If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results

Am I made some mistake, or (at least this distilled version) not as good/better than the competition?

I'll run the same on the Qwen 7B distilled version too.

2 replies

updated a model about 1 hour ago

djuna/MN-Chinofun-12B-4-Q6_K-GGUF

Updated about 1 hour ago

published a model about 1 hour ago

djuna/MN-Chinofun-12B-4-Q6_K-GGUF

Updated about 1 hour ago

updated a model about 1 hour ago

djuna/MN-Chinofun-12B-3

Text Generation • Updated Dec 6, 2024 • 12 • 3

liked a model about 3 hours ago

Delta-Vector/Rei-12B

Text Generation • Updated about 15 hours ago • 15 • 7

liked 4 models about 21 hours ago

New activity in arcee-ai/mergekit-gui 1 day ago

Error: Unimplemented merge method sce

#35 opened 1 day ago by

xi0v

liked a model 1 day ago

tencent/Hunyuan-7B-Instruct

Text Generation • Updated 2 days ago • 196 • 33

upvoted a collection 1 day ago

AI4Privacy_v2

Collection

Collection for AI4Privacy Version 2 trained on PII200k • 6 items • Updated Sep 25, 2024 • 4

reacted to chansung's post with 👍 2 days ago

Post

1516

New look for AI powered paper reviews from the list by Hugging Face Daily Papers ( managed by the @akhaliq )

Bookmark the webpage along, check comprehensive reviews by Google DeepMind Gemini 1.5, and listen to audio podcast made by the same tech used in NotebookLM.

Link: https://deep-diver.github.io/ai-paper-reviewer/

This is not an official service by Hugging Face. It is just a service developed by an individual developer using his own money :)

updated a model 3 days ago

djuna/Q2.5-KwK-7B-Q6_K-GGUF

Updated 3 days ago • 17

published a model 3 days ago

djuna/Q2.5-KwK-7B-Q6_K-GGUF

Updated 3 days ago • 17

liked 2 models 3 days ago

rd690/rdm-animals

Text-to-Image • Updated Mar 17, 2024 • 54 • 2

EvaByte/EvaByte-SFT

Updated 5 days ago • 139 • 29