UltraFeedback: Boosting Language Models with High-quality Feedback Paper • 2310.01377 • Published Oct 2, 2023 • 5
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model Paper • 2310.15477 • Published Oct 24, 2023
Sparse Low-rank Adaptation of Pre-trained Language Models Paper • 2311.11696 • Published Nov 20, 2023 • 2
KoLA: Carefully Benchmarking World Knowledge of Large Language Models Paper • 2306.09296 • Published Jun 15, 2023 • 19
OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models Paper • 2307.03084 • Published Jul 5, 2023 • 1
OpenPrompt: An Open-source Framework for Prompt-learning Paper • 2111.01998 • Published Nov 3, 2021 • 1
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models Paper • 2403.08281 • Published Mar 13, 2024
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following Paper • 2403.03129 • Published Mar 5, 2024
Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2, 2024 • 44
Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process Paper • 2405.11870 • Published May 20, 2024
UltraMedical: Building Specialized Generalists in Biomedicine Paper • 2406.03949 • Published Jun 6, 2024
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding Paper • 2406.12295 • Published Jun 18, 2024
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published Dec 23, 2024 • 40
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding Paper • 2501.18362 • Published 4 days ago • 17
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding Paper • 2501.18362 • Published 4 days ago • 17