Simeon Emanuilov PRO

s-emanuilov

AI & ML interests

Software Engineer & Ph.D. candidate | Specializing in ML/DL system development & applying AI to solve real-world business problems.

Recent Activity

liked a model 7 days ago

unsloth/Mistral-Small-24B-Instruct-2501-GGUF

liked a model 7 days ago

nomic-ai/nomic-embed-text-v2-moe

upvoted an article 7 days ago

Merge Large Language Models with mergekit

View all activity

Organizations

s-emanuilov's activity

upvoted an article 7 days ago

Article

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 93

upvoted a paper 9 days ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published 12 days ago • 46

upvoted a paper 10 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 96

upvoted a paper 14 days ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 71

upvoted an article 14 days ago

Article

Open-source DeepResearch – Freeing our search agents

15 days ago

• 1.03k

upvoted a collection 16 days ago

llama.vim

Collection

upvoted an article 17 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 544

upvoted an article 22 days ago

Article

Welcome to Inference Providers on the Hub 🔥

22 days ago

• 378

upvoted a collection 24 days ago

Qwen2.5-1M

Collection

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 24 days ago • 100

upvoted an article 27 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

27 days ago

• 139

upvoted a paper 27 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 28 days ago • 323

upvoted 2 papers 28 days ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 30 days ago • 91

Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement

Paper • 2501.12273 • Published 29 days ago • 14

upvoted an article 30 days ago

Article

Yay! Organizations can now publish blog Articles

and 3 others •

30 days ago

• 34

upvoted a collection 30 days ago

DeepSeek R1 (All Versions)

Collection

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 11 days ago • 188

upvoted a paper 30 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

upvoted a collection about 1 month ago

Jan 17 Releases ❄️

Collection

Models and datasets of the second week of Jan 2025. • 23 items • Updated Jan 17 • 11

upvoted 3 papers about 1 month ago