Running 113 113 Open-LLM performances are plateauing, letβs make the leaderboard steep again π Update leaderboard for fair model evaluation
Running 931 931 Can You Run It? LLM version π Determine GPU requirements for large language models
stacked-summaries/flan-t5-large-stacked-samsum-1024 Summarization β’ Updated Sep 23, 2023 β’ 58 β’ 10
Running on CPU Upgrade 4.92k 4.92k MTEB Leaderboard π₯ Select benchmarks and languages for text embeddings evaluation
MoritzLaurer/DeBERTa-v3-base-mnli-fever-anli Zero-Shot Classification β’ Updated Apr 11, 2024 β’ 1.05M β’ β’ 199