on MPT-30B gradient checkpointing

#7
by DanielTTY - opened

Could you do the same for the 30B version? or could you explain to us how it should be done here? thanks

Sign up or log in to comment