Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2403.13187

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it!

Qualitatively characterizing neural network optimization problems

Paper • 1412.6544 • Published Dec 19, 2014 • 4
Convergent Learning: Do different neural networks learn the same representations?

Paper • 1511.07543 • Published Nov 24, 2015 • 2
Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models

Paper • 1909.11299 • Published Sep 25, 2019 • 1
Model Fusion via Optimal Transport

Paper • 1910.05653 • Published Oct 12, 2019 • 1

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 82
NEFTune: Noisy Embeddings Improve Instruction Finetuning

Paper • 2310.05914 • Published Oct 9, 2023 • 13
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 55
Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon

Paper • 2401.03462 • Published Jan 7 • 25

papers interesting

about 1 month ago

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 44

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 44

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 44
Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13 • 41
ViTAR: Vision Transformer with Any Resolution

Paper • 2403.18361 • Published Mar 27 • 48
Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 98

Foundational Model

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 44

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 44

Papers - Image - Model Merging

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 44

Papers - Image - Frankenmerging

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 44
Model Stock: All we need is just a few fine-tuned models

Paper • 2403.19522 • Published Mar 28 • 9

Papers - Frankenmerging

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 44
Model Stock: All we need is just a few fine-tuned models

Paper • 2403.19522 • Published Mar 28 • 9
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published 9 days ago • 79

Previous
1
2
3
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs