ArthurCamara commited on
Commit
3e6076b
1 Parent(s): b24c154

Update Readme to add blog post and NanoBEIR

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -4973,10 +4973,11 @@ tags:
4973
  ## Zeta-Alpha-E5-Mistral
4974
 
4975
  We introduce Zeta Alpha's first public embedding model, a retrieval-specialized, 7B parameter embedding model trained on top of [E5-mistral-7b-instruct](https://huggingface.co/intfloat/e5-mistral-7b-instruct).
4976
- This model marks the first published model from Zeta Alpha's open science embedding models. The full dataset and detailed training recipe will be made available (very) soon.
4977
 
4978
- We will also make available our evaluation dataset (Called `Mini` and `Nano` versions of `BEIR`), which allowed us to quickly evaluate the model's performance while training.
4979
 
 
4980
 
4981
  ### Lora Weights
4982
 
 
4973
  ## Zeta-Alpha-E5-Mistral
4974
 
4975
  We introduce Zeta Alpha's first public embedding model, a retrieval-specialized, 7B parameter embedding model trained on top of [E5-mistral-7b-instruct](https://huggingface.co/intfloat/e5-mistral-7b-instruct).
4976
+ This model marks the first published model from Zeta Alpha's open science embedding models.
4977
 
4978
+ Check out our blog post for a complete breakdown of the training set we used and all the training details: [Zeta Alpha blog](https://www.zeta-alpha.com/post/fine-tuning-an-llm-for-state-of-the-art-retrieval-zeta-alpha-s-top-10-submission-to-the-the-mteb-be)
4979
 
4980
+ We are also making available our internal evaluation set, called [NanoBEIR](https://huggingface.co/collections/zeta-alpha-ai/nanobeir-66e1a0af21dfd93e620cd9f6), a collection of Nano (i.e., 50 queries+~10k documents) per BEIR dataset.
4981
 
4982
  ### Lora Weights
4983