ethzanalytics
/

gpt-j-6B-8bit-sharded

Text Generation

8-bit precision

Model card Files Files and versions Community

pszemraj commited on Sep 5, 2022

Commit

e0eca88

·

1 Parent(s): 0b5d7fa

Update README.md

Files changed (1) hide show

README.md +23 -1

README.md CHANGED Viewed

@@ -1,3 +1,25 @@
 ---
 inference: False
----

 ---
 inference: False
+---
+# ethzanalytics/gpt-j-6B-8bit-sharded
+this is a version of `hivemind/gpt-j-6B-8bit` for low-RAM loading.
+Please refer to the [original model card](https://huggingface.co/hivemind/gpt-j-6B-8bit) for all details.
+## Usage
+> **NOTE:** PRIOR to loading the model you need to "patch" it to be compatible with loading 8bit weights etc. See the original model card above for details on how to do this.
+```python
+tokenizer = AutoTokenizer.from_pretrained("ethzanalytics/gpt-j-6B-8bit-sharded")
+model = GPTJForCausalLM.from_pretrained(
+    "ethzanalytics/gpt-j-6B-8bit-sharded",
+    low_cpu_mem_usage=True,
+    max_shard_size=f"1000MB",
+)
+```