NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation โข Updated 10 days ago โข 9.38k โข 264
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi โข 13 items โข Updated Sep 18, 2024 โข 227
mistralai/Mistral-Small-24B-Instruct-2501 Text Generation โข Updated 26 days ago โข 771k โข โข 832
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 โข 8 items โข Updated 5 days ago โข 379
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper โข 2501.11873 โข Published Jan 21 โข 63
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation โข Updated 5 days ago โข 1.26M โข โข 1.2k
cognitivecomputations/Wizard-Vicuna-30B-Uncensored Text Generation โข Updated May 20, 2024 โข 3.11k โข 155
lmstudio-community/DeepSeek-R1-Distill-Qwen-7B-GGUF Text Generation โข Updated Jan 20 โข 928k โข 67