-
SLM: Bridge the thin gap between speech and text foundation models
Paper • 2310.00230 • Published -
LLaSM: Large Language and Speech Model
Paper • 2308.15930 • Published • 31 -
SALMONN: Towards Generic Hearing Abilities for Large Language Models
Paper • 2310.13289 • Published • 18 -
Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models
Paper • 2310.05863 • Published • 1
Tomasz Ziętkiewicz
TomaszZietkiewicz
AI & ML interests
NLP
Recent Activity
liked
a model
3 days ago
jinaai/ReaderLM-v2
upvoted
a
collection
8 months ago
mHuBERT-147 models
updated
a collection
8 months ago
Multimodal-Speech-LLMs
Organizations
Collections
1
Papers
1
models
None public yet
datasets
None public yet