Arunkumar Venkataramanan's picture

55 141

Arunkumar Venkataramanan

ArunkumarVR

·

https://arunkumarramanan.github.io

AI & ML interests

AGI Research: Reasoning, Safety & Alignment (Superalignment), Generative AI (GenAI), Multi-Modal Foundation Models (FMs), Large Language Models (LLMs), Transformers & Diffusion Models, Open LLM Training, Optimization & Finetuning, Serving & Inference

Recent Activity

upvoted a paper 1 day ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

liked a model 5 days ago

deepseek-ai/DeepSeek-R1-Zero

upvoted a collection 5 days ago

View all activity

Organizations

ArunkumarVR's activity

upvoted a paper 1 day ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 3 days ago • 174

liked a model 5 days ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 3 days ago • 5.42k • 431

upvoted a collection 5 days ago

DeepSeek-R1

8 items • Updated 5 days ago • 141

liked a model 5 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 3 days ago • 109k • 2.6k

upvoted a paper 7 days ago

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published 25 days ago • 14

liked a dataset 9 days ago

NovaSky-AI/Sky-T1_data_17k

Viewer • Updated 12 days ago • 16.4k • 3.17k • 151

liked a model 9 days ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated 13 days ago • 11.4k • 511

liked a model 17 days ago

microsoft/phi-4-gguf

Text Generation • Updated 17 days ago • 35.8k • 70

upvoted a collection 17 days ago

Phi-4

Phi-4 small language model. • 2 items • Updated 17 days ago • 43

liked a model 17 days ago

microsoft/phi-4

Text Generation • Updated 17 days ago • 194k • 1.56k

liked a dataset about 1 month ago

HuggingFaceH4/MATH-500

Viewer • Updated Nov 15, 2024 • 500 • 12.2k • 50

liked a model about 1 month ago

RLHFlow/Llama3.1-8B-PRM-Deepseek-Data

Text Generation • Updated Nov 9, 2024 • 21.3k • 34

upvoted a collection about 1 month ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co./spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated 20 days ago • 22

liked 3 Spaces about 1 month ago

Running on CPU Upgrade

Anychat

Synthetic Data Generator

Build datasets using natural language

Scaling test-time compute

upvoted a collection about 1 month ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 18 days ago • 547

liked a model about 2 months ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 572k • • 1.75k

upvoted 2 collections about 2 months ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 115

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 19 days ago • 53