Holmes: Benchmark the Linguistic Competence of Language Models Paper • 2404.18923 • Published Apr 29, 2024
JuStRank: Benchmarking LLM Judges for System Ranking Paper • 2412.09569 • Published about 1 month ago • 19
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 25 days ago • 18