Zhimin Zhao PRO

zhiminy

AI & ML interests

SE4AI, AI4SE, LLMOps, LLM4Code

Recent Activity

updated a dataset 3 days ago
SE-Arena/conversations
updated a dataset 3 days ago
SE-Arena/votes
updated a Space 3 days ago
SE-Arena/Software-Engineering-Arena
View all activity

Organizations

sparse-generative-ai's profile picture Software Engineering Arena's profile picture

zhiminy's activity

reacted to their post with ๐Ÿค— 3 months ago
posted an update 3 months ago
reacted to their post with ๐Ÿ˜Ž 8 months ago
posted an update 8 months ago
reacted to their post with ๐Ÿ‘€ 8 months ago
view post
Post
1998
Hey everyone!

Our team just dropped something cool! ๐ŸŽ‰ We've published a new paper on arxiv diving into the foundation model leaderboards across different platforms. We've analyzed the content, operational workflows, and common issues of these leaderboards. From this, we came up with two new concepts: Leaderboard Operations (LBOps) and leaderboard smells.

We also put together an awesome list with nearly 300 of the latest leaderboards, development tools, and publishing organizations. You can check it out here: https://github.com/SAILResearch/awesome-foundation-model-leaderboards

If you find it useful or interesting, give us a follow or drop a comment. We'd love to hear your thoughts and get your support! โœจ

Link to the paper: https://arxiv.org/abs/2407.04065
posted an update 8 months ago
view post
Post
1998
Hey everyone!

Our team just dropped something cool! ๐ŸŽ‰ We've published a new paper on arxiv diving into the foundation model leaderboards across different platforms. We've analyzed the content, operational workflows, and common issues of these leaderboards. From this, we came up with two new concepts: Leaderboard Operations (LBOps) and leaderboard smells.

We also put together an awesome list with nearly 300 of the latest leaderboards, development tools, and publishing organizations. You can check it out here: https://github.com/SAILResearch/awesome-foundation-model-leaderboards

If you find it useful or interesting, give us a follow or drop a comment. We'd love to hear your thoughts and get your support! โœจ

Link to the paper: https://arxiv.org/abs/2407.04065