Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Po Hsiang Yu's picture

Po Hsiang Yu

EasyMoneySniper66

AI & ML interests

None yet

Organizations

None yet

Collections 4

Multi-modality LVM

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Paper • 2406.12275 • Published Jun 18, 2024 • 31
TroL: Traversal of Layers for Large Language and Vision Models

Paper • 2406.12246 • Published Jun 18, 2024 • 35
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning

Paper • 2406.15334 • Published Jun 21, 2024 • 9
Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

Paper • 2406.12742 • Published Jun 18, 2024 • 15

Multi-modality LVM Datasets

MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs

Paper • 2406.11833 • Published Jun 17, 2024 • 63
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

Paper • 2406.11230 • Published Jun 17, 2024 • 34
Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models

Paper • 2406.14035 • Published Jun 20, 2024 • 13
Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 53

models

None public yet

datasets

None public yet

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs