39 129 551

Sayantan Das

ucalyptus

https://ucalyptus.me/

AI & ML interests

Generative Modeling

Recent Activity

upvoted an article about 8 hours ago

Remote VAEs for decoding with HF endpoints 🤗

liked a dataset 2 days ago

allenai/olmOCR-mix-0225

liked a model 2 days ago

allenai/olmOCR-7B-0225-preview

View all activity

Organizations

ucalyptus's activity

upvoted an article about 8 hours ago

Article

Remote VAEs for decoding with HF endpoints 🤗

5 days ago

• 30

upvoted a collection 6 days ago

GemmaX2

Collection

GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated 22 days ago • 20

upvoted an article 23 days ago

Article

Open-source DeepResearch – Freeing our search agents

25 days ago

• 1.11k

upvoted 2 articles about 1 month ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 400

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Jan 16

• 69

upvoted 2 papers about 1 month ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published Jan 11 • 29

3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering

Paper • 2501.05131 • Published Jan 9 • 34

upvoted a paper about 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 258

upvoted a collection 2 months ago

Sana

Collection

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 18 days ago • 88

upvoted 2 papers 2 months ago

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Paper • 2412.15214 • Published Dec 19, 2024 • 15

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 135

upvoted a collection 2 months ago

InternVL2.5

Collection

Better than InternVL 2.0 • 18 items • Updated Jan 10 • 85

upvoted an article 2 months ago

Article

Bridging the Gap Between Physical Numerical Simulations and Machine Learning: Introducing The Well

•

Dec 2, 2024

• 18

upvoted an article 5 months ago

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

• 60

upvoted 2 papers 5 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 137

upvoted an article 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 223

upvoted a paper 5 months ago

Let's Verify Step by Step

Paper • 2305.20050 • Published May 31, 2023 • 10

upvoted 2 papers 6 months ago

Text2SQL is Not Enough: Unifying AI and Databases with TAG

Paper • 2408.14717 • Published Aug 27, 2024 • 26

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 123