grimulkan
/

llama2_70b_longlora_fp16_32k_ROPE8

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

grimulkan commited on Jan 19

Commit

201946d

•

1 Parent(s): dab2902

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 license: llama2
 ---
-This is the same as Yukang's [Llama-2-70b-longlora-32k](https://huggingface.co/Yukang/Llama-2-70b-longlora-32k), except that the extra pad token has been stripped from the tokenizer to make it similar to the base Llama model. Please refer to that page for more details.
 It was created by merging [LongAlpaca-70B-lora](https://huggingface.co/Yukang/LongAlpaca-70B-lora) into [Llama-2-70b](https://huggingface.co/meta-llama/Llama-2-70b), replacing the embed and norm layers as described in the [LongLoRA repo](https://github.com/dvlab-research/LongLoRA), and removing the extra row and pad token.

 ---
 license: llama2
 ---
+This is the same as Yukang's [Llama-2-70b-longlora-32k](https://huggingface.co/Yukang/Llama-2-70b-longlora-32k), except that the extra pad token has been stripped from the tokenizer to make it similar to the base Llama model (and it has been merged into the base model). Please refer to that page for more details.
 It was created by merging [LongAlpaca-70B-lora](https://huggingface.co/Yukang/LongAlpaca-70B-lora) into [Llama-2-70b](https://huggingface.co/meta-llama/Llama-2-70b), replacing the embed and norm layers as described in the [LongLoRA repo](https://github.com/dvlab-research/LongLoRA), and removing the extra row and pad token.