Prince Canuma's picture

Prince Canuma

prince-canuma

·

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

arcee-ai/Virtuoso-Medium-v2-bf16-mlx

published a model 3 days ago

arcee-ai/Virtuoso-Medium-v2-bf16-mlx

updated a model 3 days ago

arcee-ai/Virtuoso-Medium-v2-8bit-mlx

View all activity

Organizations

prince-canuma's activity

upvoted a paper 10 days ago

Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B

Paper • 2406.07394 • Published Jun 11, 2024 • 27

upvoted a paper 12 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 13 days ago • 63

upvoted 2 papers 14 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 18 days ago • 104

ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Paper • 2501.10132 • Published 17 days ago • 17

upvoted 7 papers 17 days ago

Multimodal LLMs Can Reason about Aesthetics in Zero-Shot

Paper • 2501.09012 • Published 19 days ago • 10

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Paper • 2501.08828 • Published 19 days ago • 30

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 18 days ago • 36

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published 18 days ago • 33

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published 18 days ago • 23

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published 18 days ago • 47

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 18 days ago • 66

upvoted 6 papers 19 days ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 23 days ago • 29

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 26 days ago • 53

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 24 days ago • 80

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 21 days ago • 89

A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following

Paper • 2501.08187 • Published 20 days ago • 24

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 20 days ago • 271

upvoted a paper 20 days ago

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published 24 days ago • 42

upvoted 2 papers 21 days ago

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published 27 days ago • 53

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published 24 days ago • 59