-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper โข 2501.04227 โข Published โข 81 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper โข 2501.05366 โข Published โข 80 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper โข 2501.11425 โข Published โข 74 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper โข 2501.10893 โข Published โข 22
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
upvoted
a
collection
about 8 hours ago
Qwen2.5-1M
upvoted
an
article
about 13 hours ago
We now support VLMs in smolagents!
Organizations
Collections
4
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper โข 2412.06769 โข Published โข 75 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper โข 2408.03314 โข Published โข 54 -
Evolving Deeper LLM Thinking
Paper โข 2501.09891 โข Published โข 97 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper โข 2501.12599 โข Published โข 56
models
2
datasets
None public yet