Massive Text Embedding Benchmark

non-profit

https://github.com/embeddings-benchmark

embeddings-benchmark

AI & ML interests

Massive Text Embeddings Benchmark

Recent Activity

orionweller updated a Space about 3 hours ago

mteb/leaderboard

Muennighoff updated a dataset about 5 hours ago

mteb/arena-results

KennethEnevoldsen new activity 2 days ago

mteb/IndicQARetrieval:Librarian Bot: Add language metadata for dataset

View all activity

mteb's activity

orionweller

updated a Space about 3 hours ago

Running on CPU Upgrade

MTEB Leaderboard

Muennighoff

updated a dataset about 5 hours ago

mteb/arena-results

Viewer • Updated about 5 hours ago • 2.98k • 1.92k • 1

KennethEnevoldsen

in mteb/IndicQARetrieval 2 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 2 days ago by

orionweller

updated a dataset 2 days ago

mteb/results

Updated 2 days ago • 2.26k • 1

Muennighoff

updated a Space 2 days ago

MTEB Arena

orionweller

authored 7 papers 6 days ago

NevIR: Negation in Neural Information Retrieval

Paper • 2305.07614 • Published May 12, 2023

Learning from Task Descriptions

Paper • 2011.08115 • Published Nov 16, 2020

MegaWika: Millions of reports and their sources across 50 diverse languages

Paper • 2307.07049 • Published Jul 13, 2023

Defending Against Poisoning Attacks in Open-Domain Question Answering

Paper • 2212.10002 • Published Dec 20, 2022

Learning to Reason via Program Generation, Emulation, and Search

Paper • 2405.16337 • Published May 25

CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation

Paper • 2406.17186 • Published Jun 24 • 1

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

Paper • 2409.11136 • Published Sep 17 • 21

tomaarsen

authored a paper 6 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 7 days ago • 103

orionweller

authored a paper 6 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 7 days ago • 103

swj0419

authored a paper 19 days ago

Negative Token Merging: Image-based Adversarial Feature Guidance

Paper • 2412.01339 • Published 23 days ago • 21

rbroc

authored a paper 20 days ago

Automated speech- and text-based classification of neuropsychiatric conditions in a multidiagnostic setting

Paper • 2301.06916 • Published Jan 13, 2023

mmhamdy

authored a paper 20 days ago

Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

Paper • 2412.02980 • Published 21 days ago • 12

rbroc

authored 3 papers 20 days ago

Large language models surpass human experts in predicting neuroscience results

Paper • 2403.03230 • Published Mar 4 • 4

$S^3$ -- Semantic Signal Separation

Paper • 2406.09556 • Published Jun 13

Introducing ELLIPS: An Ethics-Centered Approach to Research on LLM-Based Inference of Psychiatric Conditions

Paper • 2409.15323 • Published Sep 6 • 2