Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
158
8
20
Shivalika Singh
shivi
Follow
thesagardevkate's profile picture
emrullahsekerci's profile picture
surajp's profile picture
76 followers
·
32 following
shivalikasingh95
AI & ML interests
None yet
Recent Activity
reacted
to
dvilasuero
's
post
with ❤️
4 days ago
🌐 Announcing Global-MMLU: an improved MMLU Open dataset with evaluation coverage across 42 languages, built with Argilla and the Hugging Face community. Global-MMLU is the result of months of work with the goal of advancing Multilingual LLM evaluation. It's been an amazing open science effort with collaborators from Cohere For AI, Mila - Quebec Artificial Intelligence Institute, EPFL, Massachusetts Institute of Technology, AI Singapore, National University of Singapore, KAIST, Instituto Superior Técnico, Carnegie Mellon University, CONICET, and University of Buenos Aires. 🏷️ +200 contributors used Argilla MMLU questions where regional, dialect, or cultural knowledge was required to answer correctly. 85% of the questions required Western-centric knowledge! Thanks to this annotation process, the open dataset contains two subsets: 1. 🗽 Culturally Agnostic: no specific regional, cultural knowledge is required. 2. ⚖️ Culturally Sensitive: requires dialect, cultural knowledge or geographic knowledge to answer correctly. Moreover, we provide high quality translations of 25 out of 42 languages, thanks again to the community and professional annotators leveraging Argilla on the Hub. I hope this will ensure a better understanding of the limitations and challenges for making open AI useful for many languages. Dataset: https://huggingface.co./datasets/CohereForAI/Global-MMLU
reacted
to
dvilasuero
's
post
with 🔥
4 days ago
🌐 Announcing Global-MMLU: an improved MMLU Open dataset with evaluation coverage across 42 languages, built with Argilla and the Hugging Face community. Global-MMLU is the result of months of work with the goal of advancing Multilingual LLM evaluation. It's been an amazing open science effort with collaborators from Cohere For AI, Mila - Quebec Artificial Intelligence Institute, EPFL, Massachusetts Institute of Technology, AI Singapore, National University of Singapore, KAIST, Instituto Superior Técnico, Carnegie Mellon University, CONICET, and University of Buenos Aires. 🏷️ +200 contributors used Argilla MMLU questions where regional, dialect, or cultural knowledge was required to answer correctly. 85% of the questions required Western-centric knowledge! Thanks to this annotation process, the open dataset contains two subsets: 1. 🗽 Culturally Agnostic: no specific regional, cultural knowledge is required. 2. ⚖️ Culturally Sensitive: requires dialect, cultural knowledge or geographic knowledge to answer correctly. Moreover, we provide high quality translations of 25 out of 42 languages, thanks again to the community and professional annotators leveraging Argilla on the Hub. I hope this will ensure a better understanding of the limitations and challenges for making open AI useful for many languages. Dataset: https://huggingface.co./datasets/CohereForAI/Global-MMLU
updated
a dataset
5 days ago
CohereForAI/Global-MMLU-Lite
View all activity
Articles
A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality
Oct 24
•
60
Universal Image Segmentation with Mask2Former and OneFormer
Jan 19, 2023
•
9
Organizations
shivi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
CohereForAI/Global-MMLU
14 days ago
Quality Issues with JA dev set
1
#5 opened 15 days ago by
akkikiki
Librarian Bot: Add language metadata for dataset
#3 opened 18 days ago by
librarian-bot
Include argilla tag
2
#2 opened 18 days ago by
dvilasuero
Duplicates for NL
2
#4 opened 17 days ago by
BramVanroy
New activity in
CohereForAI/Global-MMLU
19 days ago
[bot] Conversion to Parquet
#1 opened 19 days ago by
parquet-converter
New activity in
CohereForAI/aya_expanse
about 2 months ago
refactor: simplify text validation
2
#2 opened about 2 months ago by
takarajordan
Single T4
5
#1 opened about 2 months ago by
Taylor658
New activity in
CohereForAI/aya-expanse-8b
about 2 months ago
ValueError: Tokenizer class CohereTokenizer does not exist or is not currently imported.
3
#9 opened about 2 months ago by
lucasjin
how did you use merging?
1
#7 opened about 2 months ago by
Hanaimei
First impression: Impressed
1
#2 opened 2 months ago by
urtuuuu
New activity in
CohereForAI/aya-expanse-32b
about 2 months ago
Context Length Set to 8K?
2
#4 opened about 2 months ago by
Mazyod
GQA?
1
#3 opened 2 months ago by
PlatonicSkeptic
New activity in
CohereForAI/aya-expanse-8b
about 2 months ago
Thank you for implement Czech language!
2
#11 opened about 2 months ago by
Frostymisa
New activity in
CohereForAI/aya-expanse-32b
about 2 months ago
Comparing with Command R-plus-08-2024
2
#2 opened 2 months ago by
muratowski
New activity in
CohereForAI/aya-expanse-8b
about 2 months ago
Adding Evaluation Results
#3 opened 2 months ago by
leaderboard-pr-bot
Any chance of a 1B/2B/3B/4B model?
2
#5 opened 2 months ago by
ZeroWw
apply_chat_template with tools not working
1
#4 opened 2 months ago by
inkor
Models appears to be 'too safe', likes to lecture.
1
#1 opened 2 months ago by
QuantPanda
New activity in
CohereForAI/aya-expanse-32b
2 months ago
Wrong "max_position_embeddings" value?
2
#1 opened 2 months ago by
AaronFeng753
New activity in
CohereForAI/aya_collection_language_split
4 months ago
A lot of german examples from MLQA-en TEST dataset don't have a question with its passage
2
#5 opened 4 months ago by
mnauf
Load more