Yuxuan Wang's picture

6 5 2

Yuxuan Wang PRO

ColorfulAI

·

https://patrick-tssn.github.io/

patrick-tssn

AI & ML interests

Multimodal Learning

Recent Activity

updated a dataset 1 day ago

ColorfulAI/NeedleInAVideoHaystack

authored a paper 8 days ago

VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions

authored a paper 8 days ago

Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation

View all activity

Organizations

Papers 13

arxiv:2501.05037

arxiv:2412.17295

arxiv:2411.17991

arxiv:2409.01151

models 3

ColorfulAI/videollamb-llava-1.5-7b

Video-Text-to-Text • Updated Sep 9, 2024 • 55 • 4

ColorfulAI/videollamb-mem-llava-1.5-7b

Updated Aug 12, 2024 • 4

ColorfulAI/LSTP-Chat

Image-Text-to-Text • Updated Aug 2, 2024 • 4

datasets 3

ColorfulAI/NeedleInAVideoHaystack

Viewer • Updated 1 day ago • 21 • 10

ColorfulAI/EgoPlan_test

Viewer • Updated Sep 15, 2024 • 923 • 140

ColorfulAI/VideoLLaMB-IT

Viewer • Updated Aug 12, 2024 • 1.03M • 52