VPO Team

AI & ML interests

None defined yet.

Recent Activity

RunpeiDong authored a paper about 7 hours ago

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

jyzhang1208 authored a paper 1 day ago

Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design

jyzhang1208 authored a paper 1 day ago

Offline Meta Reinforcement Learning with In-Distribution Online Adaptation

View all activity

visual-preference's activity

RunpeiDong

authored a paper about 7 hours ago

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Paper • 2502.13143 • Published about 21 hours ago • 25

jyzhang1208

authored 4 papers 1 day ago

Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design

Paper • 2311.00462 • Published Nov 1, 2023

Offline Meta Reinforcement Learning with In-Distribution Online Adaptation

Paper • 2305.19529 • Published May 31, 2023

Symmetry-Aware Robot Design with Structured Subgroups

Paper • 2306.00036 • Published May 31, 2023 • 1

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Paper • 2502.09560 • Published 6 days ago • 29

RunpeiDong

authored a paper 1 day ago

Learning Getting-Up Policies for Real-World Humanoid Robots

Paper • 2502.12152 • Published 2 days ago • 34

RunpeiDong

authored a paper 28 days ago

Taming Teacher Forcing for Masked Autoregressive Video Generation

Paper • 2501.12389 • Published 29 days ago • 10

jyzhang1208

authored a paper 4 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

RunpeiDong

authored a paper 8 months ago

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Paper • 2406.16855 • Published Jun 24, 2024 • 55

RunpeiDong

authored 9 papers 11 months ago

PointDistiller: Structured Knowledge Distillation Towards Efficient and Compact 3D Detection

Paper • 2205.11098 • Published May 23, 2022

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

Paper • 2402.17766 • Published Feb 27, 2024

Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining

Paper • 2302.02318 • Published Feb 5, 2023

CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP

Paper • 2303.04748 • Published Mar 8, 2023

Contrastive Deep Supervision

Paper • 2207.05306 • Published Jul 12, 2022

Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks

Paper • 2112.15139 • Published Dec 30, 2021

VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation

Paper • 2307.16605 • Published Jul 28, 2023

Exploring Recurrent Long-term Temporal Fusion for Multi-view 3D Perception

Paper • 2303.05970 • Published Mar 10, 2023

Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?

Paper • 2212.08320 • Published Dec 16, 2022

RunpeiDong

authored 2 papers over 1 year ago

ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning

Paper • 2307.09474 • Published Jul 18, 2023 • 1

DreamLLM: Synergistic Multimodal Comprehension and Creation

Paper • 2309.11499 • Published Sep 20, 2023 • 58