8 7 2

Jiahang Xu

Jiahang

JiahangXu

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

View all activity

Organizations

Jiahang's activity

upvoted a paper 3 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 3 days ago • 176

upvoted a paper 4 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 67

upvoted 2 papers 5 months ago

Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22, 2024 • 64

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 39

authored 4 papers 5 months ago

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference

Paper • 2303.08308 • Published Mar 15, 2023 • 1

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices

Paper • 2303.09730 • Published Mar 17, 2023 • 1

Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models

Paper • 2310.05015 • Published Oct 8, 2023 • 1

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 67

upvoted a paper 5 months ago

Language Models as Black-Box Optimizers for Vision-Language Models

Paper • 2309.05950 • Published Sep 12, 2023 • 4

authored a paper 9 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 254

upvoted a collection 9 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 701

liked a Space 10 months ago

Running on CPU Upgrade

12.2k

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

authored a paper 11 months ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 114

upvoted a paper 11 months ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 114

liked a Space over 1 year ago

Running

118

📝

LLMLingua

authored a paper over 1 year ago

Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference

Paper • 2306.14393 • Published Jun 26, 2023

updated a Space over 2 years ago

Runtime error

🌖