Image-to-Text
Transformers
PyTorch
phi3_v
text-generation
latex
custom_code

Model Summary

Cephalo is a series of multimodal materials science focused vision large language models (V-LLMs) designed to integrate visual and linguistic data for advanced understanding and interaction in human-AI or multi-agent AI frameworks.

image/png

Model Capabilities

This version of Cephalo, lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha, is trained to convert images of equations to LaTeX code.

Downloads last month
37
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support model that require custom code execution.

Datasets used to train lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha

Collection including lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha