bartowski/cognitivecomputations_Dolphin3.0-R1-Mistral-24B-GGUF Text Generation • Updated 2 days ago • 16.3k • 37
Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published 17 days ago • 7
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 12 days ago • 32
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Paper • 2501.17433 • Published 11 days ago • 8
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 11 days ago • 51
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 10 days ago • 79
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 10 days ago • 51
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 10 days ago • 22
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published 9 days ago • 33