MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published 10 days ago • 31
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co./datasets?other=sentence-transformers • 68 items • Updated 3 days ago • 109