mosaicml
/

mpt-7b-storywriter

Text Generation

text-generation-inference

Model card Files Files and versions Community

jacobfulano commited on May 5, 2023

Commit

af1b522

•

1 Parent(s): aa53cd9

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -49,11 +49,12 @@ model = transformers.AutoModelForCausalLM.from_pretrained('mosaicml/mpt-7b-story
 model.to(device='cuda:0', dtype=torch.bfloat16)
 ```
-Although the model was trained with a sequence length of 2048, ALiBi enables users to increase the maximum sequence length during finetuning and/or inference. For example:
 ```python
 config = transformers.AutoConfig.from_pretrained('mosaicml/mpt-7b-storywriter', trust_remote_code=True)
-config.update({"max_seq_len": 4096})
 model = transformers.AutoModelForCausalLM.from_pretrained('mosaicml/mpt-7b-storywriter', config=config, trust_remote_code=True)
 ```

 model.to(device='cuda:0', dtype=torch.bfloat16)
 ```
+Although the model was trained with a sequence length of 2048 and finetuned with a sequence length of 65536,
+ALiBi enables users to increase the maximum sequence length during finetuning and/or inference. For example:
 ```python
 config = transformers.AutoConfig.from_pretrained('mosaicml/mpt-7b-storywriter', trust_remote_code=True)
+config.update({"max_seq_len": 83968})
 model = transformers.AutoModelForCausalLM.from_pretrained('mosaicml/mpt-7b-storywriter', config=config, trust_remote_code=True)
 ```