12.4k
Open LLM Leaderboard
π
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
Determine GPU requirements for large language models
Calculate memory needed to train AI models
Merge Lora adapters with a base model
VLMEvalKit Evaluation Results Collection
Request evaluation results for a speech model
More advanced and challenging multi-task evaluation
Add results to model card from Open LLM Leaderboard
VLMEvalKit Eval Results in video understanding benchmark
Compare Open LLM Leaderboard results
Explore and submit LLM benchmark evaluations