--- license: apache-2.0 --- This is a llamafile for [WizardLM-2-7B](https://huggingface.co./microsoft/WizardLM-2-7B). Converted and tested on 4/15/2024. Safetensors came from microsoft's hf, quantized with llama.cpp, zipaligned with llamafile. The q3-k-l sized quant is under 4gb if you want something to share with your windows-only users. Instructions to run q3-k-l on Windows: Just download, add '.exe' to the filename, and open it. Bypass all friendly Microsoft warnings about using your own computer. It doesn't need network access, completely local. Put it on a keychain! Share with friends! Perfect gift for significant other! -= Llamafile =- Llamafiles are a standalone executable that run an LLM server locally on a variety of operating systems including FreeBSD, Windows, Windows via WSL, Linux, and Mac. The same file works everywhere, I've tested several of these on FreeBSD, Windows, Windows via WSL, and Linux. You just download the .llamafile, (chmod +x or rename to .exe as needed), run it, open the chat interface in a browser, and interact. Options can be passed in to expose the api etc. See their [docs](https://github.com/Mozilla-Ocho/llamafile) for details. [Mozilla Blog Announcement for Llamafile](https://hacks.mozilla.org/2023/11/introducing-llamafile/) - Windows: I tried the q3-k-l, it works. - FreeBSD note: Yes, it actually works on a fresh install of FreeBSD.