🧩 Verbalized Rebus @ CLiC-it 2024 Collection Materials for the paper "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses" • 13 items • Updated 19 days ago • 3
view article Article 🔥 Argilla 2.0: the data-centric tool for AI makers 🤗 By dvilasuero • 25 days ago • 31
view article Article Mixedbread 🤝 deepset: Announcing our New German/English Embedding Model By shadeMe • Jul 19 • 15
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • Jun 4 • 66
Refusal in Language Models Is Mediated by a Single Direction Paper • 2406.11717 • Published Jun 17 • 2
abliterated-v3 Collection Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3 • 87
view article Article ⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2 By burtenshaw • Jun 3 • 24
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 • 135
Advanced Natural-based interaction for the ITAlian language: LLaMAntino-3-ANITA Paper • 2405.07101 • Published May 11 • 1
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Apr 22 • 78
🇮🇹 Italian NLP Resources Collection Collection of models, datasets and demos relevant to Italian NLP 🇮🇹 • 212 items • Updated about 19 hours ago • 19
About ORPO Collection Contains some information and experiments fine-tuning LLMs using 🤗 `trl.ORPOTrainer` • 8 items • Updated May 7 • 5
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 206
Flan-T5 release Collection The Flan-T5 covers 4 checkpoints of different sizes each time. It also includes upgrades versions trained using Universal sampling • 7 items • Updated 23 days ago • 18