PrunaAI
/

OpenLLM-France-Lucie-7B-Instruct-AWQ-4bit-smashed

4-bit precision

Model card Files Files and versions Community

nifleisch commited on 3 days ago

Commit

742ada5

·

verified ·

1 Parent(s): ecada08

Update README.md

Files changed (1) hide show

README.md +8 -10

README.md CHANGED Viewed

@@ -58,18 +58,16 @@ You can run the smashed model with these steps:
     pip install autoawq
     ```
 2. Load & run the model.
-    ```python
-   from transformers import AutoModelForCausalLM, AutoTokenizer
-  from awq import AutoAWQForCausalLM
-  model = AutoAWQForCausalLM.from_quantized("PrunaAI/OpenLLM-France-Lucie-7B-AWQ-4bit-smashed", trust_remote_code=True, device_map='auto')
-  tokenizer = AutoTokenizer.from_pretrained("OpenLLM-France/Lucie-7B")
-  input_ids = tokenizer("Quelle est la couleur des pruneaux?", return_tensors='pt').to(next(model.parameters()).device)["input_ids"]
-  outputs = model.generate(input_ids, max_new_tokens=216)
-  tokenizer.decode(outputs[0])
     ```
 ## Configurations

     pip install autoawq
     ```
 2. Load & run the model.
+    ```python
+    from transformers import AutoModelForCausalLM, AutoTokenizer
+    from awq import AutoAWQForCausalLM
+    model = AutoAWQForCausalLM.from_quantized("PrunaAI/OpenLLM-France-Lucie-7B-AWQ-4bit-smashed", trust_remote_code=True, device_map='auto')
+    tokenizer = AutoTokenizer.from_pretrained("OpenLLM-France/Lucie-7B")
+    input_ids = tokenizer("Quelle est la couleur des pruneaux?", return_tensors='pt').to(next(model.parameters()).device)["input_ids"]
+    outputs = model.generate(input_ids, max_new_tokens=216)
+    tokenizer.decode(outputs[0])
     ```
 ## Configurations