A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!!

Causaly LTD
Enterprise
company
AI & ML interests
None defined yet.
Collections
2
Most commonly used leaderboards to check model capabilities
-
12.7k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
289
LLM Performance Leaderboard
🐨View LLM Performance Leaderboard
-
4.14k
Chatbot Arena Leaderboard
🏆Display chatbot leaderboard and statistics
-
5.02k
MTEB Leaderboard
🥇Select benchmarks and languages for text embeddings evaluation
models
None public yet
datasets
None public yet