asdf's picture

19 1 7

asdf

ewre324

·

ewre324

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

updated a model 4 days ago

ewre324/ewre324-R1-Minueza-32M-Distill

updated a collection 4 days ago

View all activity

Organizations

ewre324's activity

New activity in open-r1/README 4 days ago

SmolLm2-135 R1 Distill

#5 opened 4 days ago by

New activity in unsloth/DeepSeek-V3-GGUF 26 days ago

What is the required GPU size to run Is a 4090 possible and does it support ollama

#5 opened 26 days ago by

I'm a newbie. How to use?

#4 opened 26 days ago by

New activity in unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit 6 months ago

RuntimeError: Unsloth: `unsloth/Meta-Llama-3.1-8B-bnb-4bit` is not a base model or a PEFT model.

#3 opened 6 months ago by

New activity in unsloth/llama-3-8b-bnb-4bit 8 months ago

34 hour for file tunning ?

#7 opened 9 months ago by

New activity in Felladrin/Minueza-32M-Deita 8 months ago

Finetuning

#1 opened 8 months ago by

New activity in CausalLM/35b-beta-long 8 months ago

How to finetune?

#4 opened 8 months ago by

New activity in unsloth/llama-3-8b-bnb-4bit 9 months ago

how to use output model as llm

#3 opened 9 months ago by

New activity in unsloth/llama-3-8b-bnb-4bit 10 months ago

Thank you for this nice model. Could you make a q8 gguf, please?

#2 opened 10 months ago by

New activity in unsloth/llama-3-70b-bnb-4bit 10 months ago

Quick question: how to load ckpt and do inference

#2 opened 10 months ago by

New activity in CohereForAI/c4ai-command-r-v01 10 months ago

Instruct-finetuning dataset

#43 opened 10 months ago by

New activity in unsloth/llama-3-8b-bnb-4bit 10 months ago

Missing Chat Template

#1 opened 10 months ago by

New activity in unsloth/llama-3-8b 10 months ago

is this the llama-3-8b model clone?

#1 opened 10 months ago by

New activity in CohereForAI/c4ai-command-r-plus 10 months ago

This is the greatest AI chat model yet

#30 opened 10 months ago by

New activity in CohereForAI/c4ai-command-r-plus-4bit 10 months ago

Excellant model, fine tuning resources

#5 opened 10 months ago by

New activity in PrunaAI/Locutusque-TinyMistral-248M-bnb-8bit-smashed 10 months ago

Seeking information about smashing

#2 opened 10 months ago by

New activity in BEE-spoke-data/smol_llama-101M-GQA 10 months ago

Link to code repository

#3 opened 10 months ago by

New activity in BEE-spoke-data/smol_llama-220M-openhermes 10 months ago

What is the model architecture?

#2 opened 10 months ago by

New activity in qnguyen3/Mixtral-4x400M 10 months ago

Is the model available to use?

#1 opened 10 months ago by