Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper • 2412.09645 • Published Dec 10, 2024 • 35
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE Paper • 2411.16856 • Published Nov 25, 2024 • 13
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation Paper • 2303.13873 • Published Mar 24, 2023
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance Paper • 2403.12409 • Published Mar 19, 2024 • 10
MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors Paper • 2410.16272 • Published Oct 21, 2024
MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era Paper • 2406.09121 • Published Jun 13, 2024 • 1
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention Paper • 2406.12718 • Published Jun 18, 2024 • 1
Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation Paper • 2309.13505 • Published Sep 24, 2023
Color Space Learning for Cross-Color Person Re-Identification Paper • 2405.09487 • Published May 15, 2024
Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining Paper • 2401.08407 • Published Jan 16, 2024
Towards smaller, faster decoder-only transformers: Architectural variants and their implications Paper • 2404.14462 • Published Apr 22, 2024 • 1
LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency Paper • 2404.12872 • Published Apr 19, 2024 • 12
InsActor: Instruction-driven Physics-based Characters Paper • 2312.17135 • Published Dec 28, 2023 • 10
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing Paper • 2310.05922 • Published Oct 9, 2023 • 4