VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published 12 days ago • 78
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances Paper • 2410.18775 • Published Oct 24, 2024 • 9
TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition Paper • 2307.12493 • Published Jul 24, 2023
Jailbreaking ChatGPT via Prompt Engineering: An Empirical Study Paper • 2305.13860 • Published May 23, 2023
Prompt Injection attack against LLM-integrated Applications Paper • 2306.05499 • Published Jun 8, 2023 • 1
Efficient Detection of Toxic Prompts in Large Language Models Paper • 2408.11727 • Published Aug 21, 2024 • 13
GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting Paper • 2311.14521 • Published Nov 24, 2023
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning Paper • 2311.18651 • Published Nov 30, 2023
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis Paper • 2308.11473 • Published Aug 22, 2023
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies Paper • 2403.01422 • Published Mar 3, 2024 • 27
Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation Paper • 2404.15506 • Published Mar 22, 2024
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models Paper • 2405.20853 • Published May 31, 2024
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts Paper • 2406.09162 • Published Jun 13, 2024 • 13
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers Paper • 2406.10163 • Published Jun 14, 2024 • 33
MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization Paper • 2408.02555 • Published Aug 5, 2024 • 29
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages Paper • 2407.19672 • Published Jul 29, 2024 • 56
Towards smaller, faster decoder-only transformers: Architectural variants and their implications Paper • 2404.14462 • Published Apr 22, 2024 • 1