mpt-7b-instruct-sharded / generation_config.json

Commit History

Replace model with mpt-7b-instruct, loaded in f16 and sharded to 2GB chunks
8d8911a

João Rafael commited on

increase max_new_tokens default
5e7cdb3

pszemraj commited on

better generation params
8267bf4

pszemraj commited on

add sharded checkpoint
7ab236e

peter szemraj commited on