chargoddard
/

llama33b-s2a4

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chargoddard commited on Jul 18, 2023

Commit

ebf85fc

·

1 Parent(s): 26a8b94

Create README.md

Files changed (1) hide show

README.md +11 -0

README.md ADDED Viewed

	@@ -0,0 +1,11 @@

+---
+datasets:
+- EleutherAI/wikitext_document_level
+tags:
+- llama
+---
+LLaMA 33b finetuned on `wikitext_document_level` with combined linear and NTK-aware ROPE scaling (alpha=4, scale=2.)
+This model will be coherent up to at least 8k context length, but might work beyond that.
+This is a merged version of [llama33b-s2a4-qlora](https://huggingface.co/chargoddard/llama33b-s2a4-qlora).
+Note that this is *not* an instruct model - this is base LLaMA with an extended sequence length.