jsfs11
/

meta-LLama3-8b-PruneME-TEST-22_30

Text Generation

meta-llama/Meta-Llama-3-8B-Instruct

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jsfs11 commited on Apr 26, 2024

Commit

688126a

·

verified ·

1 Parent(s): 5bfa402

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -11,14 +11,14 @@ base_model:
 - meta-llama/Meta-Llama-3-8B-Instruct
 ---
-# PruneMELLama8bTEST-22_30
 This model was pruned after being analyzed with [PruneMe](https://github.com/arcee-ai/PruneMe)
 *INFO:root:Layer 22 to 30 has the minimum average distance of 0.26598974609375. Consider examining this layer more closely for potential optimization or removal.*
-PruneMELLama8bTEST-22_30 is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
 * [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
 * [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
@@ -45,7 +45,7 @@ from transformers import AutoTokenizer
 import transformers
 import torch
-model = "jsfs11/PruneMELLama8bTEST-22_30"
 messages = [{"role": "user", "content": "What is a large language model?"}]
 tokenizer = AutoTokenizer.from_pretrained(model)

 - meta-llama/Meta-Llama-3-8B-Instruct
 ---
+# meta-LLama3-8b-PruneME-TEST-22_30
 This model was pruned after being analyzed with [PruneMe](https://github.com/arcee-ai/PruneMe)
 *INFO:root:Layer 22 to 30 has the minimum average distance of 0.26598974609375. Consider examining this layer more closely for potential optimization or removal.*
+meta-LLama3-8b-PruneME-TEST-22_30 is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
 * [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
 * [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
 import transformers
 import torch
+model = "jsfs11/meta-LLama3-8b-PruneME-TEST-22_30"
 messages = [{"role": "user", "content": "What is a large language model?"}]
 tokenizer = AutoTokenizer.from_pretrained(model)