Arkajyoti Mitra

aeros93

AI & ML interests

Deep Learning

Recent Activity

upvoted a paper about 2 months ago

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

liked a Space 2 months ago

akhaliq/anychat

upvoted a paper 2 months ago

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

View all activity

Organizations

aeros93's activity

upvoted a paper about 2 months ago

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

Paper • 2411.18613 • Published Nov 27, 2024 • 50

liked a Space 2 months ago

Running on CPU Upgrade

1.53k

🏢

Anychat

upvoted a paper 2 months ago

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 72

upvoted an article 7 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 184

upvoted an article 8 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 234

upvoted an article 9 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 246

upvoted a collection 9 months ago

Vision Language Models Papers 🖼️💬📝

Collection

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 35

upvoted an article 9 months ago

Article

seemore: Implement a Vision Language Model from Scratch

•

Jun 23, 2024

• 70

upvoted a paper about 1 year ago

LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes

Paper • 2311.13384 • Published Nov 22, 2023 • 51