71 1367 2125

taesiri PRO

taesiri

https://taesiri.ai/

AI & ML interests

AGI ... one linear layer at a time

Recent Activity

updated a dataset about 3 hours ago

taesiri/BugsBunny-ManualEvaluationSet

updated a dataset about 3 hours ago

taesiri/BugsBunny-ManualEval-IntermediateSet

updated a dataset about 4 hours ago

taesiri/PhotoEditBattleResults

View all activity

Organizations

taesiri's activity

upvoted a paper about 15 hours ago

CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale

Paper • 2502.16645 • Published 5 days ago • 14

upvoted 2 papers about 16 hours ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published 2 days ago • 47

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Paper • 2502.20395 • Published 1 day ago • 31

upvoted a collection about 24 hours ago

Phi-4

Collection

Phi-4 family of small language and multi-modal models. • 7 items • Updated about 9 hours ago • 83

upvoted 5 papers 1 day ago

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published 2 days ago • 34

upvoted 3 papers 3 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 3 days ago • 54

WebGames: Challenging General-Purpose Web-Browsing AI Agents

Paper • 2502.18356 • Published 3 days ago • 8

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published 3 days ago • 61

upvoted 2 papers 4 days ago

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Paper • 2502.17157 • Published 4 days ago • 48

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 8 days ago • 151

upvoted 4 papers 5 days ago

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities

Paper • 2401.12168 • Published Jan 22, 2024 • 27

Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?

Paper • 2502.15657 • Published 7 days ago • 4

Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence

Paper • 2502.14905 • Published 10 days ago • 9

PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data

Paper • 2502.14397 • Published 8 days ago • 35

upvoted a paper 7 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 8 days ago • 167

upvoted a paper 8 days ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 8 days ago • 56