Nexesenex
/

Llama_3.x_70b_Smarteaz_V1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Nexesenex commited on 23 days ago

Commit

3780957

·

verified ·

1 Parent(s): b2f9064

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -11,14 +11,14 @@ license: llama3.3
 ---
 # about
-The Teaz series is my third attempt at making merges, after the Kostume and Kermes series.
 This time, the goal was to make a smart model with a low perplexity, in accordance to the principles of the Kermes series, but with a merge of 3 merged models like on the kostume series.
 Huihui's abliterated models were used:
 - Llama 3.3 70b as the pivot of the first/main model.
 - Nemotron 3.1 70b and Deepseek R1 Distill 70b as the pillars.
-- and Tulu 3 70b as the backers of the 2nd and 3rd models.
 Bingo again. I hit 3.45 ppl512 wikieng, 62+ or ARC-C, and 82+ on ARC-E. Absolute top of the class for L3.x 70b, like Kermes is for L3 3.2 3b.

 ---
 # about
+The Teaz series is my third attempt at making merges, this time on L3.x 70b, after the L3.2 3b Kostume and Kermes series.
 This time, the goal was to make a smart model with a low perplexity, in accordance to the principles of the Kermes series, but with a merge of 3 merged models like on the kostume series.
 Huihui's abliterated models were used:
 - Llama 3.3 70b as the pivot of the first/main model.
 - Nemotron 3.1 70b and Deepseek R1 Distill 70b as the pillars.
+- and Tulu 3 70b as the backer of the 2nd and 3rd models.
 Bingo again. I hit 3.45 ppl512 wikieng, 62+ or ARC-C, and 82+ on ARC-E. Absolute top of the class for L3.x 70b, like Kermes is for L3 3.2 3b.