Llama3.1-Mamba2-8B-distill / trainer_state.json
Junxiong Wang
add models
9224e3e
File too large to display, you can check the raw version instead.