OpenCompass

community

https://opencompass.org.cn/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

KennyUTC authored a paper 7 days ago

OCSampler: Compressing Videos to One Clip with Single-step Sampling

nebulae09 authored a paper 7 days ago

Redundancy Principles for MLLMs Benchmarks

KennyUTC authored a paper 7 days ago

Redundancy Principles for MLLMs Benchmarks

View all activity

Organization Card

Community About org cards

OpenCompass Website ^HOT OpenCompass Toolkit ^{TRY IT OUT}

👋 join us on Discord and WeChat

follow us on Github

OpenCompass is a platform focused on evaluation of AGI, include Large Language Model and Multi-modality Model. We aim to:

develop high-quality libraries to reduce the difficulties in evaluation
provide convincing leaderboards for improving the understanding of the large models
create powerful toolchains targeting a variety of abilities and tasks
build solid benchmarks to support the large model research

Collections 1

spaces 10

Running on CPU Upgrade

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

Compass Academic Leaderboard

Compass Academic Leaderboard

MMBench Leaderboard

Open LMM Reasoning Leaderboard

A Leaderboard that demonstrates LMM reasoning capabilities

Open VLM Video Leaderboard

VLMEvalKit Eval Results in video understanding benchmark

CompassJudger Subjective Evaluation Learderboard

CompassJudger Subjective Evaluation Learderboard

models 8

opencompass/anah-v2

Text Generation • Updated Dec 11, 2024 • 13 • 2

opencompass/CompassJudger-1-14B-Instruct

Text Generation • Updated Oct 30, 2024 • 124 • 2

opencompass/CompassJudger-1-32B-Instruct

Text Generation • Updated Oct 30, 2024 • 151 • 14

opencompass/CompassJudger-1-1.5B-Instruct

Updated Oct 22, 2024 • 24 • 1

opencompass/CompassJudger-1-7B-Instruct

Updated Oct 22, 2024 • 3.13k • 6

opencompass/anah-7b

Text Generation • Updated Jul 3, 2024 • 11

opencompass/anah-20b

Text Generation • Updated Jul 3, 2024 • 3

opencompass/mixtral-8x7b-32k

Updated Dec 10, 2023 • 1

datasets 8

opencompass/LiveMathBench

Viewer • Updated 11 days ago • 283 • 187 • 4

opencompass/mmmlu_lite

Viewer • Updated Nov 1, 2024 • 20k • 45 • 2

opencompass/MMBench-Video

Preview • Updated Oct 9, 2024 • 351 • 7

opencompass/NeedleBench

Viewer • Updated Jul 26, 2024 • 524 • 522 • 3

opencompass/anah

Viewer • Updated Jul 3, 2024 • 783 • 70 • 2

opencompass/flames

Viewer • Updated Apr 22, 2024 • 537 • 43

opencompass/CriticBench

Updated Feb 23, 2024 • 135 • 4

opencompass/MMBench

Updated Sep 13, 2023 • 31 • 1