assafbk commited on
Commit
f9825e5
1 Parent(s): 18093cc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -1
README.md CHANGED
@@ -5,4 +5,23 @@ tags:
5
  - text-generation
6
  - mamba
7
  - long context
8
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  - text-generation
6
  - mamba
7
  - long context
8
+ ---
9
+
10
+ # DeciMamba Checkpoint
11
+
12
+ The official checkpoint of DeciMamba-130m, finetuned for the Passkey Retrieval task as presented in [DeciMamba: Exploring the Length Extrapolation Potential of Mamba](https://arxiv.org/abs/2406.14528).
13
+
14
+ See our [Github Repo](https://assafbk.github.io/mocha/) for evalution and training scripts.
15
+
16
+ Bibtex:
17
+ ```
18
+ @misc{benkish2024decimambaexploringlengthextrapolation,
19
+ title={DeciMamba: Exploring the Length Extrapolation Potential of Mamba},
20
+ author={Assaf Ben-Kish and Itamar Zimerman and Shady Abu-Hussein and Nadav Cohen and Amir Globerson and Lior Wolf and Raja Giryes},
21
+ year={2024},
22
+ eprint={2406.14528},
23
+ archivePrefix={arXiv},
24
+ primaryClass={cs.LG},
25
+ url={https://arxiv.org/abs/2406.14528},
26
+ }
27
+ ```