trick for less token usage and less halucination

#19

by gopi87 - opened 27 days ago

gopi87

27 days ago

in text generation ui just update the lama cpp manually add this in the chat ui

Start reply with

lets plan the steps and review the steps

for better response
*dont use flash attention

gopi87

27 days ago

gopi87

27 days ago

even in this way you make model to follow your own custom prompt

gopi87

27 days ago

This comment has been hidden

gopi87

27 days ago

Start reply with

lets plan it one by one

very organized response like o1

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment