Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
fla-hub
/
gsa-1.3B-100B
like
0
Text Generation
Safetensors
cerebras/SlimPajama-627B
English
fla
gsa
arxiv:
2409.07146
License:
mit
Model card
Files
Files and versions
Community
1
nielsr
HF staff
commited on
Sep 21
Commit
dfecf91
•
1 Parent(s):
2c8b93b
Link model to paper
Browse files
This PR links the model to the paper page.
Files changed (1)
hide
show
README.md
+1
-0
README.md
ADDED
Viewed
@@ -0,0 +1 @@
1
+
Model of the paper [Gated Slot Attention for Efficient Linear-Time Sequence Modeling](https://huggingface.co/papers/2409.07146).