Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
36.4
TFLOPS
15
68
346
alkinun
AtAndDev
Follow
Aryanne's profile picture
lucianosb's profile picture
thermal666's profile picture
25 followers
·
75 following
alkinun
alkinun
AI & ML interests
LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..
Recent Activity
reacted
to
singhsidhukuldeep
's
post
with 👀
about 17 hours ago
I just came across a groundbreaking paper titled "Hypencoder: Hypernetworks for Information Retrieval" by researchers from the University of Massachusetts Amherst that introduces a fundamentally new paradigm for search technology. Most current retrieval models rely on simple inner product calculations between query and document vectors, which severely limits their expressiveness. The authors prove theoretically that inner product similarity functions fundamentally constrain what types of relevance relationships can be captured. Hypencoder takes a radically different approach: instead of encoding a query as a vector, it generates a small neural network (called a "q-net") that acts as a learned relevance function. This neural network takes document representations as input and produces relevance scores. Under the hood, Hypencoder uses: - Attention-based hypernetwork layers (hyperhead layers) that transform contextualized query embeddings into weights and biases for the q-net - A document encoder that produces vector representations similar to existing models - A graph-based greedy search algorithm for efficient retrieval that can search 8.8M documents in under 60ms The results are impressive - Hypencoder significantly outperforms strong dense retrieval models on standard benchmarks like MS MARCO and TREC Deep Learning Track. The performance gap widens even further on complex retrieval tasks like tip-of-the-tongue queries and instruction-following retrieval. What makes this approach particularly powerful is that neural networks are universal approximators, allowing Hypencoder to express far more complex relevance relationships than inner product similarity functions. The framework is also flexible enough to replicate any existing neural retrieval method while adding the ability to learn query-dependent weights.
liked
a Space
5 days ago
mteb/leaderboard
replied
to
their
post
13 days ago
@nroggendorff is that you sama?
View all activity
Organizations
Posts
5
view post
Post
2380
@
nroggendorff
is that you sama?
See translation
view post
Post
1881
everywhere i go i see his face
See translation
View all Posts
spaces
3
Sort: Recently updated
Sleeping
DeepSense.ai
🐢
Bicycle and E-Bike Detection Model
Sleeping
marco-qwq-7B
💻
Sleeping
AIDC AI Marco O1
💻
Generate responses for AI chat
models
7
Sort: Recently updated
AtAndDev/marco-qwq-7B
Text Generation
•
Updated
Dec 8, 2024
•
13
AtAndDev/Ogno-Monarch-Neurotic-9B-Passthrough
Text Generation
•
Updated
Mar 1, 2024
•
14
AtAndDev/Ogno-Monarch-Neurotic-7B-Dare-Ties
Text Generation
•
Updated
Mar 1, 2024
•
19
AtAndDev/Marcoro14-7B-Slerp
Text Generation
•
Updated
Mar 1, 2024
•
10
AtAndDev/CapybaraMarcoroni-7B
Text Generation
•
Updated
Jan 7, 2024
•
1.92k
AtAndDev/ShortKing-3b-v0.2
Text Generation
•
Updated
Oct 2, 2023
•
78
•
2
AtAndDev/ShortKing-1.4b-v0.1
Text Generation
•
Updated
Sep 29, 2023
•
2.09k
•
2
datasets
12
Sort: Recently updated
AtAndDev/symbolm
Viewer
•
Updated
Jan 23
•
20k
•
74
AtAndDev/symlm
Viewer
•
Updated
Jan 16
•
10.1k
•
74
AtAndDev/chain-of-diffusion
Viewer
•
Updated
Jan 7
•
6.45k
•
75
AtAndDev/clip-bicycle-e-bike
Viewer
•
Updated
Jan 2
•
6k
•
85
AtAndDev/QwQ-LongCoT-59k-cleaned
Viewer
•
Updated
Dec 6, 2024
•
59.2k
•
129
AtAndDev/sedir-clean
Viewer
•
Updated
Dec 5, 2024
•
11.8k
•
54
AtAndDev/sedir-unclean
Viewer
•
Updated
Dec 5, 2024
•
19.9k
•
71
AtAndDev/ultrachat_200k_formatted
Viewer
•
Updated
Oct 10, 2024
•
208k
•
62
AtAndDev/MedInstruct
Viewer
•
Updated
Jul 20, 2024
•
216
•
50
AtAndDev/MedRag-textbooks-stella_en_400M_v5
Viewer
•
Updated
Jul 14, 2024
•
126k
•
56
Expand 12 datasets