MMLab@NTU

university

https://www.mmlab-ntu.com/

MMLabNTU

Activity Feed Request to join this org

AI & ML interests

Computer Vision and Deep Learning

Recent Activity

liuziwei7 authored a paper 4 days ago

WHAC: World-grounded Humans and Cameras

liuziwei7 authored a paper 22 days ago

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

sczhou authored a paper 24 days ago

MatAnyone: Stable Video Matting with Consistent Memory Propagation

View all activity

mmlab-ntu's activity

liuziwei7

authored a paper 4 days ago

WHAC: World-grounded Humans and Cameras

Paper • 2403.12959 • Published Mar 19, 2024 • 1

liuziwei7

authored a paper 22 days ago

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published 22 days ago • 27

sczhou

authored a paper 24 days ago

MatAnyone: Stable Video Matting with Consistent Memory Propagation

Paper • 2501.14677 • Published Jan 24 • 30

PeiqingYang

authored a paper 24 days ago

MatAnyone: Stable Video Matting with Consistent Memory Propagation

Paper • 2501.14677 • Published Jan 24 • 30

OAOA

authored a paper about 1 month ago

Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration

Paper • 2406.18516 • Published Jun 26, 2024 • 3

liuziwei7

authored a paper about 1 month ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 24

yumingj

authored a paper about 1 month ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 83

hongfz16

authored a paper about 1 month ago

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Paper • 2501.08983 • Published Jan 15 • 20

liuziwei7

authored a paper about 1 month ago

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Paper • 2501.08983 • Published Jan 15 • 20

FrozenBurning

authored a paper about 1 month ago

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Paper • 2501.08983 • Published Jan 15 • 20

hzxie

authored a paper about 1 month ago

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Paper • 2501.08983 • Published Jan 15 • 20

liuziwei7

authored a paper about 1 month ago

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Paper • 2501.08994 • Published Jan 15 • 15

Ziqi

authored a paper about 1 month ago

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Paper • 2501.08994 • Published Jan 15 • 15

liangyuch

authored a paper about 1 month ago

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13 • 50

liuziwei7

authored a paper about 2 months ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 25

ldkong

authored 5 papers about 2 months ago

FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation

Paper • 2312.04484 • Published Dec 7, 2023

LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes

Paper • 2501.04004 • Published Jan 7 • 1

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 25

LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving

Paper • 2501.04005 • Published Jan 7

OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies

Paper • 2501.00326 • Published Dec 31, 2024 • 1