CUDA extension not installed (even after manual compile and pip install)
1
#26 opened 5 months ago
by
markemicek
GGUF format
#25 opened about 1 year ago
by
giladgd
TypeError: 'NoneType' object is not iterable in .../models_settings.py
#24 opened about 1 year ago
by
thinktink
Calibration dataset used to perform GPTQ
#23 opened about 1 year ago
by
ht-rohit
A weird bug
11
#22 opened over 1 year ago
by
XceptDev
Offloading to cpu not working?
1
#21 opened over 1 year ago
by
fahadh4ilyas
May I ask why the GPTQ version is slow
#20 opened over 1 year ago
by
lynngao815
can you upload a falcon-40b-GPTQ?
2
#18 opened over 1 year ago
by
Gian-hf
Update README.md
#17 opened over 1 year ago
by
saattrupdan
OOM when running the simple code again in jupyter notebook
2
#16 opened over 1 year ago
by
becks2000
Issues with Auto
3
#15 opened over 1 year ago
by
Devonance
What is the different between GPTQ and QLoRA?
2
#12 opened over 1 year ago
by
Ichsan2895
error when loading sucessful and prompting simple text
19
#11 opened over 1 year ago
by
joseph3553
Custom 4-bit Finetuning 5-7 times faster inference than QLora
#7 opened over 1 year ago
by
rmihaylov
Error when attempting to run.. Appears model files are missing or configuration issue
20
#6 opened over 1 year ago
by
jdc4429
cuda extension not installed
2
#5 opened over 1 year ago
by
becks2000
3bit quantization
1
#3 opened over 1 year ago
by
deleted
GGML?
7
#2 opened over 1 year ago
by
creative420
Unfortunately I can't run on text-generation-webui
11
#1 opened over 1 year ago
by
Suoriks