trl-lib
/
qwen1.5-1.8b-sft
like
4
Model card
Files
Files and versions
Metrics
Training metrics
Community