Steelskull
/

L3-MS-Astoria-70b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Steelskull commited on May 8

Commit

9aa31ec

•

1 Parent(s): 1249846

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -119,7 +119,11 @@ code {
       <p>This is my first foray into 70b models, so this is more or less an experiment, please let me know your thoughts on the model and where their can be improvements.<br>
          L3-MS-Astoria-70b combines the strengths of multiple models to deliver a well-rounded, capable assistant. It is aimed at performing general tasks, storytelling, roleplay, and more mature content.<br>
          The model stock merging method attempts to make the model remain focused, tailored, and high-quality.
-      <h2>Config:</h2>
       <pre><code>MODEL_NAME = "L3-MS-Astoria-70b"
 yaml_config = """
 base_model: failspy/llama-3-70B-Instruct-abliterated
@@ -131,7 +135,7 @@ models:
   - model: NeverSleep/Llama-3-Lumimaid-70B-v0.1-alt
 """
 </code></pre>
-      <h3>Source Model Details:</h3>
       <p><strong>migtissera/Tess-2.0-Llama-3-70B-v0.2:</strong><br>
         Tess, short for Tesoro (Treasure in Italian), is a general purpose Large Language Model series. Tess-2.0-Llama-3-70B-v0.2 was trained on the meta-llama/Meta-Llama-3-70B base. The change between v0.1 and this version, v0.2 is that v0.2 has undergone an additional step of uncensoring.
       </p>

       <p>This is my first foray into 70b models, so this is more or less an experiment, please let me know your thoughts on the model and where their can be improvements.<br>
          L3-MS-Astoria-70b combines the strengths of multiple models to deliver a well-rounded, capable assistant. It is aimed at performing general tasks, storytelling, roleplay, and more mature content.<br>
          The model stock merging method attempts to make the model remain focused, tailored, and high-quality.
+      <h2>Quants:</h2>
+    <p>(Thanks to <a href="https://huggingface.co/mradermacher">@Mradermacher!</a>, please send them likes and follows!)</p>
+    <p><a href="https://huggingface.co/mradermacher/L3-MS-Astoria-70b-GGUF">L3-Arcania-4x8b-GGUF (GGUFs)</a></p>
+    <p></p>
+      <h3>Config:</h3>
       <pre><code>MODEL_NAME = "L3-MS-Astoria-70b"
 yaml_config = """
 base_model: failspy/llama-3-70B-Instruct-abliterated
   - model: NeverSleep/Llama-3-Lumimaid-70B-v0.1-alt
 """
 </code></pre>
+      <h4>Source Model Details:</h4>
       <p><strong>migtissera/Tess-2.0-Llama-3-70B-v0.2:</strong><br>
         Tess, short for Tesoro (Treasure in Italian), is a general purpose Large Language Model series. Tess-2.0-Llama-3-70B-v0.2 was trained on the meta-llama/Meta-Llama-3-70B base. The change between v0.1 and this version, v0.2 is that v0.2 has undergone an additional step of uncensoring.
       </p>