-
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Paper • 2403.09622 • Published • 17 -
TableGPT2: A Large Multimodal Model with Tabular Data Integration
Paper • 2411.02059 • Published • 5 -
POINTS1.5: Building a Vision-Language Model towards Real World Applications
Paper • 2412.08443 • Published • 38
Ming
nodejs
AI & ML interests
None yet
Recent Activity
liked
a model
23 days ago
NovaSky-AI/Sky-T1-32B-Preview
liked
a model
24 days ago
hexgrad/Kokoro-82M
liked
a model
about 1 month ago
jianzongwu/DiffSensei
Organizations
None yet
Collections
1
spaces
1
models
None public yet
datasets
None public yet