Spaces:

zero-gpu-explorers
/

README

Running

App Files Files Community

129

Responses from model are showing to other users

#18

by djstrong - opened Apr 18

Discussion

djstrong

ZeroGPU Explorers org Apr 18

•

edited Apr 18

We have LLM chat https://huggingface.co./spaces/speakleash/Bielik-7B-Instruct-v0.1 When popular youtuber published video about our model, many users started using this space. We and users observed that the interface shows responses for questions from other users.
I don't think it is a bug in our code - it is quite standard, similar to other chat spaces.

One of the comments from YouTube: "they have a bug with this model. I noticed that sometimes replies are sent to the wrong recipients. I had a situation where I sat and watched the replies generated by people for 15 minutes (in the place where the reply to me should have been)."

codelion

ZeroGPU Explorers org Apr 18

@djstrong why do you have https://huggingface.co./spaces/speakleash/Bielik-7B-Instruct-v0.1/blob/main/app.py#L132 in your code in the GPU function? Try removing any stateful operation that doesn't require the GPU outside the function that is decorated with @spaces.GPU

djstrong

ZeroGPU Explorers org Apr 18

@codelion Thank you! How to escape from the @spaces.GPU function? Run it in a thread or it still will be inside GPU scope? We need to save answers from the model.

djstrong

ZeroGPU Explorers org Apr 18

So, the main function predictinvoke without @spaces.GPU and inside call model with @spaces.GPU.

@spaces.GPU # the only function with the decorator
def generate_response():
  ...

def predict(message, history, system_prompt, temperature, max_new_tokens, top_k, repetition_penalty, top_p):
  prepare_data()
  yield from generate_response()
  save_results()

djstrong

ZeroGPU Explorers org Apr 18

@codelion Thank you! I have optimized the code.

nxphi47

ZeroGPU Explorers org Apr 23

@djstrong May I check what was at L132 that caused the problem? I saw the print statement and wonder if we cannot print anything in the GPU function.

mrfakename

ZeroGPU Explorers org Apr 23

No you can print stuff it will be routed to the logs

djstrong

ZeroGPU Explorers org Apr 24

@djstrong May I check what was at L132 that caused the problem? I saw the print statement and wonder if we cannot print anything in the GPU function.

There was sending logs to Discord and a repository. However, I don't think it was causing the main problem and @codelion comment was "by the way".

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment