Spaces:

openaccess-ai-collective
/

ggml-ui

Build error

winglian commited on May 15, 2023

Commit

946b2e7

1 Parent(s): d90dc27

base model in config is unnecessary, and fix grammar

Files changed (2) hide show

chat.py CHANGED Viewed

@@ -86,7 +86,7 @@ with blocks:
             gr.Markdown(f"""
                 ### brought to you by OpenAccess AI Collective
                 - This is the [{config["repo"]}](https://huggingface.co/{config["repo"]}) model file [{config["file"]}](https://huggingface.co/{config["repo"]}/blob/main/{config["file"]})
-                - This Space uses GGML with GPU support, so it can run larger models on smaller GPUs & VRAM quickly.
                 - This is running on a smaller, shared GPU, so it may take a few seconds to respond.
                 - [Duplicate the Space](https://huggingface.co/spaces/openaccess-ai-collective/ggml-ui?duplicate=true) to skip the queue and run in a private space or to use your own GGML models.
                 - When using your own models, simply update the [config.yml](https://huggingface.co/spaces/openaccess-ai-collective/ggml-ui/blob/main/config.yml)")

             gr.Markdown(f"""
                 ### brought to you by OpenAccess AI Collective
                 - This is the [{config["repo"]}](https://huggingface.co/{config["repo"]}) model file [{config["file"]}](https://huggingface.co/{config["repo"]}/blob/main/{config["file"]})
+                - This Space uses GGML with GPU support, so it can quickly run larger models on smaller GPUs & VRAM.
                 - This is running on a smaller, shared GPU, so it may take a few seconds to respond.
                 - [Duplicate the Space](https://huggingface.co/spaces/openaccess-ai-collective/ggml-ui?duplicate=true) to skip the queue and run in a private space or to use your own GGML models.
                 - When using your own models, simply update the [config.yml](https://huggingface.co/spaces/openaccess-ai-collective/ggml-ui/blob/main/config.yml)")

config.yml CHANGED Viewed

@@ -1,8 +1,6 @@
 ---
 repo: TheBloke/wizard-vicuna-13B-GGML
 file: wizard-vicuna-13B.ggml.q5_1.bin
-# if the repo above doesn't include the tokenizer set the base repo it was based on with a valid tokenizer model
-base_model: junelee/wizard-vicuna-13b
 llama_cpp:
   n_ctx: 2048
   n_gpu_layers: 40  # llama 13b has 40 layers

 ---
 repo: TheBloke/wizard-vicuna-13B-GGML
 file: wizard-vicuna-13B.ggml.q5_1.bin
 llama_cpp:
   n_ctx: 2048
   n_gpu_layers: 40  # llama 13b has 40 layers