Interview request: genAI evaluation & documentation
#61 opened 6 months ago
by
meggymuggy
language dependency
#60 opened 8 months ago
by
Jay369
[AUTOMATED] Model Memory Requirements
#59 opened 10 months ago
by
model-sizer-bot
Deployments to Azure and Inference Endpoints
#55 opened 11 months ago
by
mo2024
Very sensitve to any repetition penalty!
#52 opened 11 months ago
by
jukofyork

Text2SQL2Output
#51 opened 11 months ago
by
Sudipta179002
The generated response cannot stop.
1
#50 opened 11 months ago
by
shaohuay
Saving dbrx model and tokenizer in dbfs
5
#49 opened 11 months ago
by
pro-shep

OSError: Unable to load vocabulary from file
7
#47 opened 11 months ago
by
khurramnaseem
TypeError: __init__() got an unexpected keyword argument 'bias'
2
#46 opened 11 months ago
by
dainesn1
[DO NOT REVIEW] Mixtral like config
#45 opened 11 months ago
by
Pernekhan
Why clamp qkv_states, is it common?
#44 opened 11 months ago
by
jay68
Chat template
9
#43 opened 11 months ago
by
ehartford

GGUF quants?
1
#41 opened 11 months ago
by
Iommed
Does the tokenizer of this model have a network to load successfully?
3
#40 opened 11 months ago
by
Rnake
VRAM Requirements?
8
#39 opened 11 months ago
by
dounykim
How to get hands on experience as a newbie
1
#38 opened 11 months ago
by
kimsia
Text2sql template and examples
3
#34 opened 11 months ago
by
daxiongshu
Continuation of the Discussion: More than 10 minutes the status is in Setting `pad_token_id` to `eos_token_id`:100257 for open-end generation. #28
7
#31 opened 11 months ago
by
Madhugraj
Errors During Training for the Original Implementation and the Fixes for the Errors
2
#24 opened 11 months ago
by
v2ray

Instruct dataset
#23 opened 11 months ago
by
Andriy
How to Fine Tune DBRX-Instruct?
7
#18 opened 11 months ago
by
elysiia

Bug on AMD MI 250 with flash-attention
3
#13 opened 11 months ago
by
PierreColombo
The fused expert parameters means load_in_4bit doesn't work properly, nor does LoRA
31
#10 opened 11 months ago
by
tdrussell