mosaicml
/

mpt-7b-storywriter

Text Generation

text-generation-inference

Model card Files Files and versions Community

jfrankle commited on May 6, 2023

Commit

26f3be4

•

1 Parent(s): 6a60d6b

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-license: cc-by-nc-4.0
 tags:
 - Composer
 - MosaicML
@@ -15,7 +15,7 @@ MPT-7B-StoryWriter-65k+ is a model designed to read and write fictional stories
 It was built by finetuning MPT-7B with a context length of 65k tokens on a filtered fiction subset of the [books3 dataset](https://huggingface.co/datasets/the_pile_books3).
 At inference time, thanks to [ALiBi](https://arxiv.org/abs/2108.12409), MPT-7B-StoryWriter-65k+ can extrapolate even beyond 65k tokens.
 We demonstrate generations as long as 84k tokens on a single node of 8 A100-80GB GPUs in our [blogpost](https://www.mosaicml.com/blog/mpt-7b).
-  * License: Creative Commons Attribution Non Commercial 4.0
 This model was trained by [MosaicML](https://www.mosaicml.com) and follows a modified decoder-only transformer architecture.
@@ -25,7 +25,7 @@ May 5, 2023
 ## Model License
-Creative Commons Attribution Non Commercial 4.0
 ## Documentation
@@ -167,6 +167,10 @@ This model was finetuned by Alex Trott and the MosaicML NLP team
 If you're interested in [training](https://www.mosaicml.com/training) and [deploying](https://www.mosaicml.com/inference) your own MPT or LLMs on the MosaicML Platform, [sign up here](https://forms.mosaicml.com/demo?utm_source=huggingface&utm_medium=referral&utm_campaign=mpt-7b).
 ## Citation

 ---
+license: apache-2.0
 tags:
 - Composer
 - MosaicML
 It was built by finetuning MPT-7B with a context length of 65k tokens on a filtered fiction subset of the [books3 dataset](https://huggingface.co/datasets/the_pile_books3).
 At inference time, thanks to [ALiBi](https://arxiv.org/abs/2108.12409), MPT-7B-StoryWriter-65k+ can extrapolate even beyond 65k tokens.
 We demonstrate generations as long as 84k tokens on a single node of 8 A100-80GB GPUs in our [blogpost](https://www.mosaicml.com/blog/mpt-7b).
+  * License: Apache 2.0
 This model was trained by [MosaicML](https://www.mosaicml.com) and follows a modified decoder-only transformer architecture.
 ## Model License
+Apache 2.0
 ## Documentation
 If you're interested in [training](https://www.mosaicml.com/training) and [deploying](https://www.mosaicml.com/inference) your own MPT or LLMs on the MosaicML Platform, [sign up here](https://forms.mosaicml.com/demo?utm_source=huggingface&utm_medium=referral&utm_campaign=mpt-7b).
+## Disclaimer
+The license on this model does not constitute legal advice. We are not responsible for the actions of third parties who use this model. Please cosult an attorney before using this model for commercial purposes.
 ## Citation