ArliAI
/

Llama-3.1-8B-ArliAI-RPMax-v1.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

OwenArli commited on Aug 23

Commit

5eb47d8

•

1 Parent(s): b412357

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -15,12 +15,16 @@ Llama-3.1-8B-ArliAI-RPMax-v1.1 is a variant of the Meta-Llama-3.1-8B model, trai
 v1.1 is just a small fix to not train and save the embeddings layer, since v1.0 had the lm_head unnecessarily trained on accident.
 ### Training Details
 * **Sequence Length**: 8192
 * **Training Duration**: Approximately 1 day on 2x3090Ti
 * **Epochs**: 1 epoch training for minimized repetition sickness
 * **LORA**: 64-rank 128-alpha, resulting in ~2% trainable weights
 ## Quantization

 v1.1 is just a small fix to not train and save the embeddings layer, since v1.0 had the lm_head unnecessarily trained on accident.
+You can access the model at https://arliai.com and ask questions at https://www.reddit.com/r/ArliAI/
 ### Training Details
 * **Sequence Length**: 8192
 * **Training Duration**: Approximately 1 day on 2x3090Ti
 * **Epochs**: 1 epoch training for minimized repetition sickness
 * **LORA**: 64-rank 128-alpha, resulting in ~2% trainable weights
+* **Learning Rate**: 0.00001
+* **Gradient accumulation**: Very low 32 for better learning.
 ## Quantization