Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment Paper • 2502.04328 • Published 7 days ago • 21
Unleashing Text-to-Image Diffusion Models for Visual Perception Paper • 2303.02153 • Published Mar 3, 2023
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting Paper • 2112.01518 • Published Dec 2, 2021
DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery Paper • 2404.01424 • Published Apr 1, 2024
UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models Paper • 2302.04867 • Published Feb 9, 2023
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation Paper • 2409.03755 • Published Sep 5, 2024 • 3
Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution Paper • 2409.12961 • Published Sep 19, 2024 • 25
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation Paper • 2409.03755 • Published Sep 5, 2024 • 3