EuroBERT 🇪🇺 Scaling Multilingual Encoders for European Languages. EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 28 days ago • 75 EuroBERT/EuroBERT-210m Fill-Mask • Updated 8 days ago • 17.5k • 64 EuroBERT/EuroBERT-610m Fill-Mask • Updated 8 days ago • 7.75k • 28 EuroBERT/EuroBERT-2.1B Fill-Mask • Updated 8 days ago • 2.43k • 49
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 28 days ago • 75
LLMs Distillation The ULD loss, based on optimal transport, enables distillation across different LLM families without requiring shared tokenizers. Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs Paper • 2402.12030 • Published Feb 19, 2024 mistralai/Mistral-7B-Instruct-v0.2 Text Generation • Updated Sep 27, 2024 • 3.18M • • 2.7k meta-llama/Llama-2-7b-chat-hf Text Generation • Updated Apr 17, 2024 • 1.28M • • 4.36k EleutherAI/pythia-160m-deduped Text Generation • Updated Jul 9, 2023 • 77.6k • 3
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs Paper • 2402.12030 • Published Feb 19, 2024
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_uld_loss Text2Text Generation • Updated Feb 19, 2024 • 2
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_text_teacher Text2Text Generation • Updated Feb 19, 2024 • 3
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_uld_loss Text2Text Generation • Updated Feb 19, 2024 • 3
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_text_teacher Text2Text Generation • Updated Feb 19, 2024 • 6
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_uld_loss Text2Text Generation • Updated Feb 19, 2024 • 4
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_text_teacher Text2Text Generation • Updated Feb 19, 2024 • 3
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_uld_loss Text2Text Generation • Updated Feb 19, 2024 • 3
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_text_teacher Text2Text Generation • Updated Feb 19, 2024 • 2
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_uld_loss Text Generation • Updated Feb 19, 2024 • 24
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_text_teacher Text Generation • Updated Feb 19, 2024 • 23
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-pubmed_qa_50k Viewer • Updated Mar 13, 2024 • 50.5k • 44
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-squad Viewer • Updated Mar 13, 2024 • 87.6k • 33