Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AndreaUnibo
/
JetMoE_rank_lstm_full_trained_depth3_n2
like
0
Text Generation
Transformers
Safetensors
jetmoe
trl
sft
Inference Endpoints
4-bit precision
bitsandbytes
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
JetMoE_rank_lstm_full_trained_depth3_n2
Commit History
Upload tokenizer
3424b4e
verified
AndreaUnibo
commited on
Sep 17, 2024
Upload JetMoEForCausalLM
fb94a79
verified
AndreaUnibo
commited on
Sep 17, 2024
initial commit
6f368b0
verified
AndreaUnibo
commited on
Sep 17, 2024