mfajcik commited on
Commit
ba40e06
β€’
1 Parent(s): 2c4907f

Update content.py

Browse files
Files changed (1) hide show
  1. content.py +1 -1
content.py CHANGED
@@ -20,7 +20,7 @@ Here, you can compare models on tasks in the Czech language or submit your own m
20
  - The first step is "pre-submission." After this is complete (significance tests may take up to 2 hours), you can choose to submit the results if you wish.
21
  - NEWS:
22
  - 1.10.2024: Find out more about πŸ‡¨πŸ‡Ώ BenCzechMark in our [Huggingface blogpost](https://huggingface.co/blog/benczechmark)!
23
-
24
 
25
  """
26
  LEADERBOARD_TAB_TITLE_MARKDOWN = """
 
20
  - The first step is "pre-submission." After this is complete (significance tests may take up to 2 hours), you can choose to submit the results if you wish.
21
  - NEWS:
22
  - 1.10.2024: Find out more about πŸ‡¨πŸ‡Ώ BenCzechMark in our [Huggingface blogpost](https://huggingface.co/blog/benczechmark)!
23
+ - 7.11.2024: We acknowledge that one of the Qwen2.5 models correctly predicted our (& Bigbench's) canary string. This confirms the contamination, it was trained on benchmark data. Other [studies](https://arxiv.org/pdf/2409.01790) also suggest the contamination issues of the Qwen family.
24
 
25
  """
26
  LEADERBOARD_TAB_TITLE_MARKDOWN = """