CobraMamba
/

mamba-gpt-3b-v4

Text Generation

large language model

text-generation-inference

Model card Files Files and versions Community

CobraMamba commited on Sep 13, 2023

Commit

25fd7b2

•

1 Parent(s): 900f740

Update README.md

Files changed (1) hide show

README.md +40 -0

README.md CHANGED Viewed

@@ -46,3 +46,43 @@ The training code and data will be open sourced later on Github(https://github.c
 We have fine-tuned the open-lama model and surpassed the original model in multiple evaluation subtasks, making it currently the best performing 3B model with comparable performance to llama-7b
 - Base model: [openlm-research/open_llama_3b_v2](https://huggingface.co/openlm-research/open_llama_3b_v2)

 We have fine-tuned the open-lama model and surpassed the original model in multiple evaluation subtasks, making it currently the best performing 3B model with comparable performance to llama-7b
 - Base model: [openlm-research/open_llama_3b_v2](https://huggingface.co/openlm-research/open_llama_3b_v2)
+## Usage
+To use the model with the `transformers` library on a machine with GPUs, first make sure you have the `transformers`, `accelerate` and `torch` libraries installed.
+```bash
+pip install transformers==4.29.2
+pip install accelerate==0.19.0
+pip install torch==2.0.0
+```
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("CobraMamba/mamba-gpt-3b-v4")
+model = AutoModelForCausalLM.from_pretrained("CobraMamba/mamba-gpt-3b-v4", trust_remote_code=True, torch_dtype=torch.float16)
+# we use alpaca prompt
+input_context = "Your text here"
+input_ids = tokenizer.encode(input_context, return_tensors="pt")
+output = model.generate(input_ids, max_length=128, temperature=0.7)
+output_text = tokenizer.decode(output[0], skip_special_tokens=True)
+print(output_text)
+```
+## Citation
+If this work is helpful, please kindly cite as:
+```bibtex
+@Misc{mamba-gpt-3b-v4,
+  title = {Mamba-GPT-3b-v4},
+  author = {chiliu},
+  howpublished = {\url{https://huggingface.co/CobraMamba/mamba-gpt-3b-v4}},
+  year = {2023}
+}
+```