eugrug-60/medical-o1-reasoning-SFT-it_f10_incremental
Viewer
โข
Updated
โข
826
โข
8
datatrove
for all things web-scale data preparation: https://github.com/huggingface/datatrovenanotron
for lightweight 4D parallelism LLM training: https://github.com/huggingface/nanotronlighteval
for in-training fast parallel LLM evaluations: https://github.com/huggingface/lighteval