Zephyr 7B Beta Llamafiles

See here for a guide on how to use llamafiles!

Both the server and the CLI are based on TheBloke's Zephyr 7B Beta GGUF Q4_K_M model.

Usage

NOTE: Due to the executable being greater than 4GB, it is currently not compatible with Windows. I will update with a Windows friendly version of Zephyr 7B Beta when I can.

# replace with the CLI if you prefer
wget https://huggingface.co./TimeSurgeLabs/zephyr-7b-beta-llamafile/resolve/main/zephyr-beta-server.llamafile
chmod +x zephyr-beta-server.llamafile
./zephyr-beta-server.llamafile
Downloads last month
26
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Collection including TimeSurgeLabs/zephyr-7b-beta-llamafile