Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
11
Adrian
adriansanz
Follow
jbarbera99's profile picture
rgaricano's profile picture
2 followers
·
24 following
AI & ML interests
None yet
Recent Activity
reacted
to
singhsidhukuldeep
's
post
with 🔥
about 17 hours ago
O1 Embedder: Transforming Retrieval Models with Reasoning Capabilities Researchers from University of Science and Technology of China and Beijing Academy of Artificial Intelligence have developed a novel retrieval model that mimics the slow-thinking capabilities of reasoning-focused LLMs like OpenAI's O1 and DeepSeek's R1. Unlike traditional embedding models that directly match queries with documents, O1 Embedder first generates thoughtful reflections about the query before performing retrieval. This two-step process significantly improves performance on complex retrieval tasks, especially those requiring intensive reasoning or zero-shot generalization to new domains. The technical implementation is fascinating: - The model integrates two essential functions: Thinking and Embedding - It uses an "Exploration-Refinement" data synthesis workflow where initial thoughts are generated by an LLM and refined by a retrieval committee - A multi-task training method fine-tunes a pre-trained LLM to generate retrieval thoughts via behavior cloning while simultaneously learning embedding capabilities through contrastive learning - Memory-efficient joint training enables both tasks to share encoding results, dramatically increasing batch size The results are impressive - O1 Embedder outperforms existing methods across 12 datasets in both in-domain and out-of-domain scenarios. For example, it achieves a 3.9% improvement on Natural Questions and a 3.0% boost on HotPotQA compared to models without thinking capabilities. This approach represents a significant paradigm shift in retrieval technology, bridging the gap between traditional dense retrieval and the reasoning capabilities of large language models. What do you think about this approach? Could "thinking before retrieval" transform how we build search systems?
updated
a Space
5 days ago
adriansanz/test
published
a Space
5 days ago
adriansanz/test
View all activity
Organizations
adriansanz
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
4 months ago
adriansanz/final_g_o_v2-tmp_balanced
Viewer
•
Updated
Nov 14, 2024
•
5.17k
•
50
•
1
liked
4 models
4 months ago
alinia/salamandra-7b-aligned-EADOP
Text Generation
•
Updated
Oct 9, 2024
•
16
•
5
luzalbaposse/HelloBERT
Text Classification
•
Updated
Jul 24, 2024
•
136
•
1
Cohere/rerank-multilingual-v3.0
Updated
Apr 11, 2024
•
12
llm-blender/PairRM
Text Generation
•
Updated
Jan 22, 2024
•
11.8k
•
196
liked
a model
5 months ago
BSC-LT/salamandra-2b
Text Generation
•
Updated
9 days ago
•
6.36k
•
22
liked
2 models
6 months ago
adriansanz/ST-tramits-sitges-002-5ep
Sentence Similarity
•
Updated
Aug 30, 2024
•
3
•
1
adriansanz/sqv-v2
Sentence Similarity
•
Updated
Sep 12, 2024
•
5
•
1
liked
2 models
7 months ago
BAAI/bge-m3
Sentence Similarity
•
Updated
Jul 3, 2024
•
2.67M
•
•
1.8k
adriansanz/fm-tc-hybrid_VIC_final
Text Classification
•
Updated
Jul 31, 2024
•
109
•
1
liked
a model
10 months ago
adriansanz/080524_15ep_02
Zero-Shot Classification
•
Updated
May 14, 2024
•
19
•
1