-
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs
Paper • 2402.12030 • Published -
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 3.51M • • 2.62k -
meta-llama/Llama-2-7b-chat-hf
Text Generation • Updated • 1.35M • 4.16k -
EleutherAI/pythia-160m-deduped
Text Generation • Updated • 37.1k • 3
Nicolas-BZRD
Nicolas-BZRD
AI & ML interests
PhD Student | NLP - LLMs - Adaptation real-world problem
Optimization
Recent Activity
liked
a model
26 days ago
answerdotai/ModernBERT-base
liked
a Space
about 1 month ago
burtenshaw/recap
Organizations
Collections
1
models
92
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_uld_loss
Text2Text Generation
•
Updated
•
6
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_text_teacher
Text2Text Generation
•
Updated
•
7
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
•
8
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
•
7
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
•
5
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
•
5
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
•
6
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
•
4
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_uld_loss
Text Generation
•
Updated
•
13
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_text_teacher
Text Generation
•
Updated
•
7
datasets
33
Nicolas-BZRD/gsm8k-ar-Qwen2-72B-Instruct
Viewer
•
Updated
•
7.47k
•
43
Nicolas-BZRD/gsm8k-ar-Meta-Llama-3.1-70B-Instruct
Viewer
•
Updated
•
7.47k
•
37
Nicolas-BZRD/gsm8k-ar-gemma-2-27b-it
Viewer
•
Updated
•
7.47k
•
40
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-pubmed_qa_50k
Viewer
•
Updated
•
50.5k
•
38
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-squad
Viewer
•
Updated
•
87.6k
•
42
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-squad
Viewer
•
Updated
•
87.6k
•
41
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-dialogsum
Viewer
•
Updated
•
12.4k
•
40
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-qed
Viewer
•
Updated
•
7.62k
•
44
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-FairytaleQA
Viewer
•
Updated
•
9.57k
•
52
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-FairytaleQA
Viewer
•
Updated
•
9.57k
•
40