Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper • 2501.13629 • Published 18 days ago • 43
McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised Sentence Similarity • Updated Apr 30, 2024 • 19.4k • 48
Robust Multi-bit Text Watermark with LLM-based Paraphrasers Paper • 2412.03123 • Published Dec 4, 2024 • 5