Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
jonathanjordan21
/
mos-mamba-18x130m-trainer-dgx-pile
like
0
Text Generation
Transformers
TensorBoard
Safetensors
MoSMamba
conversational
custom_code
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
No model card
New: Create and edit this model card directly on the website!
Contribute a Model Card
Downloads last month
18
Safetensors
Model size
180M params
Tensor type
F32
·
Inference Examples
Text Generation
Inference API (serverless) does not yet support model repos that contain custom code.