Aramis's picture
34 6

Aramis

amenur
Β·

AI & ML interests

None yet

Recent Activity

Organizations

None yet

amenur's activity

upvoted an article about 11 hours ago
view article
Article

Open-source DeepResearch – Freeing our search agents

β€’ 307
upvoted an article about 14 hours ago
view article
Article

Introducing smolagents: simple agents that write actions in code.

β€’ 555
upvoted an article 1 day ago
upvoted an article 8 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

β€’ 626
upvoted an article 29 days ago
view article
Article

Superposition in Transformers: A Novel Way of Building Mixture of Experts

By BenChaliah β€’
β€’ 14
reacted to lewtun's post with πŸš€πŸ”₯ about 2 months ago
view post
Post
6824
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute πŸ”₯

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

πŸ“ˆ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

πŸŽ„ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!
  • 2 replies
Β·
upvoted an article 4 months ago
view article
Article

Llama can now see and run on your device - welcome Llama 3.2

β€’ 182
upvoted 2 articles 5 months ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

β€’ 216
view article
Article

Scaling robotics datasets with video encoding

β€’ 36
upvoted 2 articles 6 months ago
view article
Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By mlabonne β€’
β€’ 268
view article
Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

β€’ 63
upvoted 2 articles 8 months ago
view article
Article

Extracting Concepts from LLMs: Anthropic’s recent discoveries πŸ“–

By m-ric β€’
β€’ 26