asdf
ewre324
AI & ML interests
None yet
Recent Activity
upvoted
an
article
3 days ago
Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial
updated
a model
4 days ago
ewre324/ewre324-R1-Minueza-32M-Distill
updated
a collection
4 days ago
R1 Distill
Organizations
ewre324's activity
SmolLm2-135 R1 Distill
#5 opened 4 days ago
by
ewre324
What is the required GPU size to run Is a 4090 possible and does it support ollama
10
#5 opened 26 days ago
by
sminbb
I'm a newbie. How to use?
1
#4 opened 26 days ago
by
huangkk
34 hour for file tunning ?
4
#7 opened 9 months ago
by
dad1909
Finetuning
3
#1 opened 8 months ago
by
ewre324
How to finetune?
#4 opened 8 months ago
by
ewre324
how to use output model as llm
2
#3 opened 9 months ago
by
narsisfa
Thank you for this nice model. Could you make a q8 gguf, please?
7
#2 opened 10 months ago
by
NikolayKozloff
Quick question: how to load ckpt and do inference
1
#2 opened 10 months ago
by
KevinYi94
Instruct-finetuning dataset
5
#43 opened 10 months ago
by
Andriy
Missing Chat Template
5
#1 opened 10 months ago
by
dfrank
is this the llama-3-8b model clone?
13
#1 opened 10 months ago
by
malhajar
This is the greatest AI chat model yet
6
#30 opened 10 months ago
by
JJJJJPSYCHIC
Excellant model, fine tuning resources
#5 opened 10 months ago
by
ewre324
Seeking information about smashing
3
#2 opened 10 months ago
by
ewre324
Link to code repository
1
#3 opened 10 months ago
by
ewre324
What is the model architecture?
1
#2 opened 10 months ago
by
ewre324
Is the model available to use?
#1 opened 10 months ago
by
ewre324