Spaces:

SmolTuners
/

README

Running

App Files Files Community

Datasets

by s3nh - opened Dec 20, 2024

Discussion

s3nh

Smol Community org Dec 20, 2024

Hello SmolTuners!

As in description, our main mission is to focus on creating 'small llms' which can be usable to more specific tasks. To do this, we definitely have to focus on dataset which are capable to give as an additive value. I opened this discussion to gather and noted some datasets worth to look for, cause it hase to be starter point to ft (despite of quantization and model merging). Have a great day <3

Delta-Vector

Smol Community org Dec 20, 2024

https://huggingface.co./datasets/HuggingFaceTB/smoltalk

was used to finetune SmolLM2 - could be worth a look at, I'd probably filter this for math though.

Delta-Vector

Smol Community org Dec 22, 2024

A thing i've noticed using alot of smaller models is that most often then not, new pretrains of smaller models are not usually the way to go

Instead it's better to finetune upon distilled models such as nvidia/Llama-3.1-Minitron-4B-Width-Base or google/gemma-2-2b-it

s3nh

Smol Community org Dec 22, 2024

Ill have some writeups on my way, ill post an update today evening, lets go !

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment