Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
JuncaiL
/
llama-8x265m-moe
like
2
Text Generation
Transformers
PyTorch
wikipedia
allenai/c4
English
llama_moe
MoE
custom_code
arxiv:
2305.09781
Model card
Files
Files and versions
Community
Train
Use this model
main
llama-8x265m-moe
Commit History
Update README.md
8ebafba
verified
JuncaiL
commited on
Mar 25
Update README.md
78ec593
verified
JuncaiL
commited on
Mar 25
Upload README.md
b4d8e93
verified
JuncaiL
commited on
Mar 25
fix state_dict loading in MoE model
d8d97b0
verified
JuncaiL
commited on
Mar 25
update config.json
6c4b1d0
verified
JuncaiL
commited on
Mar 25
upload llama-8x265m-moe model checkpoint
1ffa590
verified
JuncaiL
commited on
Mar 24
initial commit
5ad6a7e
verified
JuncaiL
commited on
Mar 24