Phyo Arkar Lwin
v3ss0n
·
AI & ML interests
None yet
Recent Activity
new activity
18 days ago
mistralai/Mistral-Small-24B-Instruct-2501:Remove gated access?
new activity
18 days ago
unsloth/README:I can't run any of the bnb-4bit quants with TextGenerationInference
new activity
25 days ago
Qwen/Qwen2.5-Max-Demo:Request to Release Qwen2.5-Max as Open Source Model
Organizations
None yet
v3ss0n's activity
Remove gated access?
2
#25 opened 22 days ago
by
davidmezzetti

I can't run any of the bnb-4bit quants with TextGenerationInference
1
#6 opened 18 days ago
by
v3ss0n
Request to Release Qwen2.5-Max as Open Source Model
3
#8 opened about 1 month ago
by
quantflex

fix: strftime_now is unknown (in <string>:1)
8
#17 opened 29 days ago
by
v3ss0n
Why increase censorship?
21
#20 opened 28 days ago
by
notafraud

Request access to the model
1
#22 opened 26 days ago
by
klydekushy
Adding tool call support in chat template
26
#13 opened 29 days ago
by
Navanit-AI

Commit #e969dcf155adde0b0654770948d93d1b2646d3f4 Introduced `strftime_now` and it is unknown in TGI.
3
#8 opened 29 days ago
by
v3ss0n
chat template doesn't include tools
9
#3 opened 29 days ago
by
copasseron
Add system message to chat template
1
#6 opened 29 days ago
by
Rocketknight1

chat template
1
#9 opened 29 days ago
by
lucyknada

Getting error when trying to infernce using example , or lmdeploy.
2
#7 opened 8 months ago
by
v3ss0n
llama.cpp / gguf?
3
#3 opened 9 months ago
by
nacs
How much VRam does it need?
1
#6 opened 8 months ago
by
v3ss0n
Run inference in CPU
3
#1 opened 9 months ago
by
hythythyt3
Quantized model coming?
8
#3 opened 10 months ago
by
dnhkng

GGUF file request
3
#14 opened 10 months ago
by
MicFizzy