Provide fine-tuning example notebook using hf transformers

#22

by MakerMotion - opened May 17, 2023

May 17, 2023

Can anyone provide a example fine-tuning notebook with custom data using hf transformers. Specifically I wonder if 'labels' are shifted automatically like GPT2 model or how do I provide the 'labels' to the model on training time.

zachblank

May 23, 2023

@MakerMotion Did you find an answer to this?

MakerMotion

May 23, 2023

@zachblank I think so. Because MPT is not fully implemented on this version of HF, I took a look at their model repo and in this file https://huggingface.co./mosaicml/mpt-7b-instruct/blob/main/modeling_mpt.py on forward() function; it seems like if you provide a labels argument it shifts automatically. [line 244]

zachblank

May 23, 2023

@MakerMotion Thanks! Do you have an example notebook you could share? I'm new at this and still trying to wrap my head around it. Thanks!

sam-mosaic

May 23, 2023

Yes you are correct @MakerMotion

What are you looking for in particular @zachblank . Does this help? https://github.com/mosaicml/llm-foundry/blob/main/scripts/train/finetune_example/README.md

abhi-mosaic

Jun 3, 2023

Closing as stale

abhi-mosaic changed discussion status to closed Jun 3, 2023

tirthajyoti

Jun 19, 2023

Is there a concrete Notebook example of taking the MPT-7B-Instruct model and fine-tuning it with an HF dataset for example multi_news dataset for news summarization?

How to prepare the dataset/ prompt
How to freeze layers and have a small number of trainable parameters (given that you don't have LoRA support yet)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment