3 86 87

Shyam Sunder Kumar

theainerd

AI & ML interests

Natural Language Processing

Recent Activity

upvoted an article 1 day ago

SigLIP 2: A better multilingual vision language encoder

reacted to AdinaY's post with 🔥 1 day ago

Wan2.1 🔥📹 new OPEN video model by Alibaba Wan team! Model: https://huggingface.co./Wan-AI/Wan2.1-T2V-14B Demo: https://huggingface.co./spaces/Wan-AI/Wan2.1 ✨Apache 2.0 ✨8.19GB VRAM, runs on most GPUs ✨Multi-Tasking: T2V, I2V, Video Editing, T2I, V2A ✨Text Generation: Supports Chinese & English ✨Powerful Video VAE: Encode/decode 1080P w/ temporal precision

reacted to burtenshaw's post with 🔥 2 days ago

Now the Hugging Face agent course is getting real! With frameworks like smolagents, LlamaIndex, and LangChain. 🔗 Follow the org for updates https://huggingface.co./agents-course This week we are releasing the first framework unit in the course and it’s on smolagents. This is what the unit covers: - why should you use smolagents vs another library? - how to build agents that use code - build multiagents systems - use vision language models for browser use The team has been working flat out on this for a few weeks. Led by @sergiopaniego and supported by smolagents author @m-ric.

View all activity

Organizations

theainerd's activity

upvoted an article 1 day ago

Article

SigLIP 2: A better multilingual vision language encoder

8 days ago

• 114

upvoted a paper 5 days ago

LightThinker: Thinking Step-by-Step Compression

Paper • 2502.15589 • Published 7 days ago • 25

upvoted 2 papers 6 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 8 days ago • 118

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 8 days ago • 167

upvoted 3 papers 8 days ago

upvoted a paper 12 days ago

Jailbreaking to Jailbreak

Paper • 2502.09638 • Published 19 days ago • 4

upvoted a paper 13 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 16 days ago • 142

upvoted an article 16 days ago

Article

Open-source DeepResearch – Freeing our search agents

25 days ago

• 1.11k

upvoted an article 17 days ago

Article

Open R1: Update #2

and 6 others •

18 days ago

• 191

upvoted a paper 21 days ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published 23 days ago • 42

upvoted an article 24 days ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 400

upvoted an article 26 days ago

Article

Open-R1: Update #1

and 7 others •

27 days ago

• 289

upvoted a collection 28 days ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 12 items • Updated 8 days ago • 84

upvoted a paper 29 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 29 days ago • 56

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 782

upvoted a paper about 1 month ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 63

upvoted a collection about 1 month ago

Qwen2.5-1M

Collection

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 2 days ago • 102

upvoted an article about 1 month ago

Article

We now support VLMs in smolagents!

Jan 24

• 86