lightblue
/

Jamba-v0.1-chat-multilingual

Text Generation

Inference Endpoints

Model card Files Files and versions Community

ptrdvn commited on Apr 1

Commit

3573d07

•

1 Parent(s): 82b76ed

Update README.md

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -22,6 +22,21 @@ Initial subjective testing has shown that this model can chat reasonably well in
 - **Context length:** 256K
 - **Knowledge cutoff date:** March 5, 2024
 ## How to use
 ※ - This code automatically appends the "<|startoftext|>" special token to any input.

 - **Context length:** 256K
 - **Knowledge cutoff date:** March 5, 2024
+## Presequities
+Jamba requires you use `transformers` version 4.39.0 or higher:
+```bash
+pip install transformers>=4.39.0
+```
+In order to run optimized Mamba implementations, you first need to install `mamba-ssm` and `causal-conv1d`:
+```bash
+pip install mamba-ssm causal-conv1d>=1.2.0
+```
+You also have to have the model on a CUDA device.
+You can run the model not using the optimized Mamba kernels, but it is **not** recommended as it will result in significantly lower latencies. In order to do that, you'll need to specify `use_mamba_kernels=False` when loading the model.
 ## How to use
 ※ - This code automatically appends the "<|startoftext|>" special token to any input.