-
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
Paper • 2401.09340 • Published • 18 -
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
Paper • 2401.12168 • Published • 25 -
Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding
Paper • 2401.15708 • Published • 10
Nilesh Das
Nkr3ch17
AI & ML interests
NLP, Computer Vision, LLM
Organizations
None yet
Collections
1
models
3
datasets
None public yet