5 11 35

Jiaming Han

csuhan

https://csuhan.com

csuhan

AI & ML interests

Computer Vision

Recent Activity

liked a dataset 1 day ago

vivym/midjourney-prompts

liked a dataset 1 day ago

ProGamerGov/synthetic-dataset-1m-dalle3-high-quality-captions

liked a dataset 1 day ago

drawthingsai/megalith-10m

View all activity

Organizations

None yet

csuhan's activity

liked 3 datasets 1 day ago

upvoted a paper 25 days ago

Diffusion Adversarial Post-Training for One-Step Video Generation

Paper • 2501.08316 • Published 26 days ago • 32

upvoted a paper 27 days ago

VideoAuteur: Towards Long Narrative Video Generation

Paper • 2501.06173 • Published about 1 month ago • 31

upvoted a paper about 2 months ago

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

Paper • 2412.18597 • Published Dec 24, 2024 • 19

upvoted a paper 2 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 23

authored a paper 2 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 23

upvoted 3 papers 4 months ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 53

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 67

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 80

authored a paper 4 months ago

Remember, Retrieve and Generate: Understanding Infinite Visual Concepts as Your Personalized Assistant

Paper • 2410.13360 • Published Oct 17, 2024 • 8

upvoted a paper 4 months ago

Remember, Retrieve and Generate: Understanding Infinite Visual Concepts as Your Personalized Assistant

Paper • 2410.13360 • Published Oct 17, 2024 • 8

updated a model 4 months ago

csuhan/temp

Updated Oct 9, 2024

updated a model 5 months ago

csuhan/t2i

Updated Sep 27, 2024

updated a model 6 months ago

csuhan/LLaVA_EF

Updated Aug 14, 2024

liked 2 models 7 months ago

stabilityai/stable-diffusion-2-1-unclip

Text-to-Image • Updated Apr 12, 2023 • 15.9k • 280

Intel/llava-gemma-2b

Image-Text-to-Text • Updated Jun 11, 2024 • 4.91k • 43

liked a dataset 7 months ago

UCSC-VLAA/Recap-DataComp-1B

Viewer • Updated Jan 9 • 1.88B • 3.78k • 165

updated a model 8 months ago

csuhan/OneLLM-7B-x-text

Updated Jun 27, 2024