Spaces:

babelbox
/

README

Running

App Files Files Community

README / README.md

bjelkenhed

Update README.md

d3912e6 about 2 years ago

preview code

raw

history blame contribute delete

2.89 kB

	---
	title: README
	emoji: 👁
	colorFrom: gray
	colorTo: pink
	sdk: static
	pinned: false
	---

	# Whisper Fine-Tuning Event 🤗

	----------------------------------------------------------------
	Whisper Fine-Tuning Event
	----------------------------------------------------------------

	https://github.com/huggingface/community-events/blob/main/whisper-fine-tuning-event/README.md

	----------------------------------------------------------------
	Sign in to Lambdalabs
	----------------------------------------------------------------

	https://cloud.lambdalabs.com/instances

	and create a new instance with a A100 GPU without disk storage

	----------------------------------------------------------------
	Log in to Huggingface
	----------------------------------------------------------------

	```
	pip install git+https://github.com/huggingface/transformers
	```

	```
	git config --global credential.helper store
	huggingface-cli login
	```

	#### Provide Huggingface access token for your account

	https://huggingface.co./settings/tokens

	----------------------------------------------------------------
	Clone Whisper train project for run scripts
	----------------------------------------------------------------

	```
	git clone https://huggingface.co./bjelkenhed/whisper-train-ts
	```

	----------------------------------------------------------------
	Install requirements and python libraries
	----------------------------------------------------------------

	```
	sudo apt-get install git-lfs
	```

	```
	pip install --pre --force-reinstall torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/nightly/cu117

	pip install -r whisper-medium/requirements.txt

	python -c "import torch; print(torch.cuda.is_available())"

	```


	----------------------------------------------------------------
	Create and clone Huggingface model repository
	----------------------------------------------------------------

	Create a new model named whisper-small-sv (change to your selected name in following sections)

	```
	huggingface-cli repo create whisper-medium-ts-sv
	```

	```
	git clone https://huggingface.co./bjelkenhed/whisper-medium-ts-sv

	cd whisper-medium-ts-sv
	git lfs install
	```

	----------------------------------------------------------------
	Copy runscripts to model
	----------------------------------------------------------------

	```
	cp ~/whisper-small/run.sh ~/whisper-small/run_speech_recognition_seq2seq_streaming.py .
	```

	or

	```
	cp ~/whisper-medium/run.sh ~/whisper-medium/run_speech_recognition_seq2seq_streaming.py .
	```

	or

	```
	cp ~/whisper-large/run.sh ~/whisper-large/run_speech_recognition_seq2seq_streaming.py .
	```

	----------------------------------------------------------------
	Start training
	----------------------------------------------------------------

	```
	tmux new -s mysession
	```

	```
	bash run.sh
	```


	# To resume session

	```
	tmux a -t mysession
	```