-
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs
Paper • 2402.12030 • Published -
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 1.06M • • 2.57k -
meta-llama/Llama-2-7b-chat-hf
Text Generation • Updated • 793k • • 3.97k -
EleutherAI/pythia-160m-deduped
Text Generation • Updated • 11.7k • 3
Nicolas-BZRD
Nicolas-BZRD
AI & ML interests
PhD Student | NLP - LLMs - Adaptation real-world problem
Optimization
Organizations
Collections
1
models
92
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_uld_loss
Text2Text Generation
•
Updated
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_text_teacher
Text2Text Generation
•
Updated
•
1
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
•
2
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
•
2
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
•
2
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_uld_loss
Text Generation
•
Updated
•
4
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_text_teacher
Text Generation
•
Updated
•
7
datasets
33
Nicolas-BZRD/gsm8k-ar-Qwen2-72B-Instruct
Viewer
•
Updated
•
7.47k
•
38
Nicolas-BZRD/gsm8k-ar-Meta-Llama-3.1-70B-Instruct
Viewer
•
Updated
•
7.47k
•
37
Nicolas-BZRD/gsm8k-ar-gemma-2-27b-it
Viewer
•
Updated
•
7.47k
•
43
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-pubmed_qa_50k
Viewer
•
Updated
•
50.5k
•
62
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-squad
Viewer
•
Updated
•
87.6k
•
48
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-squad
Viewer
•
Updated
•
87.6k
•
38
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-dialogsum
Viewer
•
Updated
•
12.4k
•
38
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-qed
Viewer
•
Updated
•
7.62k
•
52
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-FairytaleQA
Viewer
•
Updated
•
9.57k
•
47
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-FairytaleQA
Viewer
•
Updated
•
9.57k
•
49