Nielly's picture

20 35

Nielly

Nielly

·

AI & ML interests

None yet

Recent Activity

reacted to onekq's post with 👀 2 days ago

Huge disappointment to Claude Sonnet 3.7 😞 Big performance regression. Worse than the June version in 2024. 👎 https://huggingface.co./spaces/onekq-ai/WebApp1K-models-leaderboard I'm sure though this version improves on something, only not the thing my leaderboard measures. This proves the point that no model can be the best on everything.

upvoted a paper 2 days ago

Beyond Release: Access Considerations for Generative AI Systems

liked a Space 3 days ago

Wan-AI/Wan2.1

View all activity

Organizations

None yet

Nielly's activity

upvoted a paper 2 days ago

Beyond Release: Access Considerations for Generative AI Systems

Paper • 2502.16701 • Published 5 days ago • 9

upvoted 4 articles 24 days ago

Article

Open-R1: Update #1

By

and 7 others •

27 days ago

• 289

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 782

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1, 2024

• 69

Article

Open-source DeepResearch – Freeing our search agents

25 days ago

• 1.11k

upvoted a collection 30 days ago

DeepSeek-R1-abliterated

7 items • Updated 29 days ago • 92

upvoted 3 papers about 1 month ago

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published Jan 28 • 22

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 62

upvoted 2 collections about 1 month ago

2025 January Papers 🧐

10 items • Updated Jan 28 • 5

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 8 items • Updated 5 days ago • 379

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 335

upvoted a collection about 1 month ago

2025 January

33 items • Updated about 1 month ago • 13

upvoted a paper about 1 month ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 70

upvoted an article about 1 month ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 151

upvoted 3 papers about 2 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 84

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published Jan 11 • 29

upvoted a paper 2 months ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 97

upvoted a collection 2 months ago

DeepSeek-V3

3 items • Updated Jan 6 • 191