Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
216
93
630
Maxime Labonne
PRO
mlabonne
Follow
dhananjay2912's profile picture
CultriX's profile picture
gokyo's profile picture
1692 followers
·
60 following
https://mlabonne.github.io/blog
maximelabonne
mlabonne
AI & ML interests
Post-training, model editing, quantization
Articles
Fine-tune Llama 3 with ORPO
about 1 month ago
•
181
Create Mixtures of Experts with MergeKit
Mar 28
•
9
Merge Large Language Models with mergekit
Jan 9
•
20
Organizations
mlabonne
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mlabonne/orpo-dpo-mix-40k
1 day ago
suggestion
1
#4 opened 2 days ago by
DeepMount00
New activity in
mlabonne/model-family-tree
3 days ago
Doesn't create a tree for some pages
3
#2 opened 4 days ago by
xzuyn
New activity in
mlabonne/FrankenLlama-3-12B-Instruct
9 days ago
Сan you increase LLAMA3 8b simply by duplicating some layers?
3
#2 opened 10 days ago by
Regrin
New activity in
mlabonne/chessllm
13 days ago
chess
1
#2 opened 14 days ago by
LeroyDyer
New activity in
ucalyptus/prem-615M-chat
13 days ago
How to up-merge?
1
#1 opened 14 days ago by
ucalyptus
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
15 days ago
fix snippet
1
#8 opened 15 days ago by
philschmid
fine-tuning is needed after self-merging?
1
#7 opened 15 days ago by
oodgnas
Why did you convert to float16 and not bfloat16?
1
#6 opened 15 days ago by
PhilipMay
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
16 days ago
How to score the creative writing
1
#5 opened 16 days ago by
zhouzr
New activity in
mlabonne/FrankenLlama-3-12B-Instruct
16 days ago
How good is this model?
1
#1 opened 16 days ago by
Regrin
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
17 days ago
Attention, stupid question
2
#4 opened 17 days ago by
Debich
New activity in
mlabonne/Meta-Llama-3-225B-Instruct
18 days ago
mergekit config pls :)
4
#1 opened 18 days ago by
ehartford
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
18 days ago
Would love to try a quantized version!
27
#2 opened 21 days ago by
dillfrescott
Mention?
1
#3 opened 19 days ago by
ehartford
New activity in
mlabonne/NeuralMonarch-7B
20 days ago
Could you please share the merging config with us?
1
#3 opened 20 days ago by
PhilipMay
New activity in
mlabonne/AlphaMonarch-7B
20 days ago
Could you please share the merging config with us?
1
#7 opened 20 days ago by
PhilipMay
New activity in
HuggingFaceH4/open_llm_leaderboard
20 days ago
Resubmit mlabonne/OrpoLlama-3-8B
7
#725 opened 22 days ago by
mlabonne
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
21 days ago
C_H_U_N_K_Y-L_L_A_M_A
1
#1 opened 21 days ago by
rombodawg
New activity in
Muhammad2003/OrpoLlama3-8B
22 days ago
Base model
3
#1 opened 22 days ago by
mlabonne
New activity in
lilacai/lilac
26 days ago
Runtime error
#2 opened 26 days ago by
mlabonne
New activity in
mlabonne/arena-preferences
26 days ago
Librarian Bot: Add language metadata for dataset
#2 opened 26 days ago by
librarian-bot
New activity in
mlabonne/ChimeraLlama-3-8B-v2
29 days ago
Any plans on uploading the model itself?
2
#2 opened 29 days ago by
bartowski
New activity in
mlabonne/ChimeraLlama-3-8B
29 days ago
Create generation_config.json
1
#1 opened 29 days ago by
bartowski
New activity in
mlabonne/ChimeraLlama-3-8B-v2
29 days ago
for your consideration
4
#1 opened 30 days ago by
LaferriereJC
New activity in
mlabonne/arena-preferences
about 1 month ago
[bot] Conversion to Parquet
#1 opened about 1 month ago by
parquet-converter
New activity in
flytech/python-codes-25k
about 1 month ago
Question about dataset generation
4
#3 opened about 1 month ago by
mlabonne
New activity in
mlabonne/OrpoLlama-3-8B
about 1 month ago
Repetition from tuning via https://huggingface.co/blog/mlabonne/orpo-llama-3
4
#2 opened about 1 month ago by
Satya93
New activity in
mlabonne/Llama-3-SLERP-8B
about 1 month ago
What's the purpose of this?
4
#1 opened about 1 month ago by
xms991
New activity in
mlabonne/OrpoLlama-3-8B
about 1 month ago
Update README.md
2
#3 opened about 1 month ago by
hadraoui
Looking forward to full release!
4
#1 opened about 1 month ago by
bartowski
New activity in
mlabonne/orpo-dpo-mix-40k
about 1 month ago
Suggestion
1
#3 opened about 1 month ago by
neovalle
Great job!
3
#2 opened about 1 month ago by
alvarobartt
[bot] Conversion to Parquet
#1 opened about 1 month ago by
parquet-converter
New activity in
mlabonne/chatml_dpo_pairs
about 1 month ago
Add DPO tag
1
#2 opened about 1 month ago by
davanstrien
New activity in
mlabonne/Yet_Another_LLM_Leaderboard
about 1 month ago
The like metric values are not correct...
1
#11 opened about 1 month ago by
zhiminy
New activity in
automerger/YamshadowExperiment28-7B
about 1 month ago
Update README.md
#3 opened about 1 month ago by
mlabonne
Update README.md
#2 opened about 1 month ago by
mlabonne
Update README.md
#1 opened about 1 month ago by
mlabonne
New activity in
mlabonne/NeuralHermes-2.5-Mistral-7B
about 1 month ago
W&B Link Returns 404
2
#10 opened about 1 month ago by
ZennyKenny
New activity in
mlabonne/NeuralBeagle14-7B
about 2 months ago
Adding Evaluation Results
#10 opened about 2 months ago by
dragonSwing
New activity in
mlabonne/Zebrafish-7B
about 2 months ago
ty!
1
#1 opened about 2 months ago by
gate369
New activity in
mlabonne/UltraMerge-7B
about 2 months ago
Dataset
2
#2 opened about 2 months ago by
mrfakename
License
2
#3 opened about 2 months ago by
mrfakename
New activity in
mlabonne/Jambalpaca-v0.1
about 2 months ago
Jamba Notebook
2
#1 opened about 2 months ago by
Severian
New activity in
mlabonne/AlphaMonarch-7B-2bit-HQQ
about 2 months ago
Amazing model
4
#1 opened about 2 months ago by
CatUkraine
New activity in
mlabonne/UltraMerge-7B
about 2 months ago
🚩 Report
3
#1 opened about 2 months ago by
electroglyph
New activity in
mlabonne/ultrafeedback-binarized-preferences-cleaned
about 2 months ago
Librarian Bot: Add language metadata for dataset
#1 opened about 2 months ago by
librarian-bot
New activity in
macadeliccc/Mistral-7B-v0.2-OpenHermes
about 2 months ago
Evaluation
1
#1 opened about 2 months ago by
mlabonne
New activity in
mlabonne/Beyonder-4x7B-v3
about 2 months ago
AQLM version please
2
#2 opened about 2 months ago by
AiModelsMarket
About Moe vocab extended model with non vocab extended model
1
#3 opened about 2 months ago by
ancv
New activity in
mlabonne/Beyonder-4x7B-v3-GGUF
about 2 months ago
Excellent work on this, sir!
3
#2 opened 2 months ago by
dillfrescott
New activity in
mlabonne/Beyonder-4x7B-v3
2 months ago
Add Exl2 quant link
2
#1 opened 2 months ago by
bartowski
New activity in
mlabonne/Beyonder-4x7B-v3-GGUF
2 months ago
Update README.md
2
#1 opened 2 months ago by
kgourgou
New activity in
mlabonne/FrankenMonarch-7B
2 months ago
Why merge the same model 5 times?
2
#1 opened 2 months ago by
UniversalLove333
add GGUF link
#2 opened 2 months ago by
seyf1elislam
New activity in
mlabonne/AutoMerger
2 months ago
I had a similar idea recently
2
#5 opened 2 months ago by
CultriX
New activity in
mlabonne/Yet_Another_LLM_Leaderboard
2 months ago
Reasoning behind including TruthfulQA?
1
#10 opened 2 months ago by
Phil337
New activity in
mlabonne/AutoMerger
2 months ago
allow multiple people to access automerger at once
2
#6 opened 2 months ago by
mrfakename
New activity in
mlabonne/llm-auto-eval
2 months ago
Multiple GPU's
2
#3 opened 2 months ago by
CultriX
New activity in
mlabonne/gemma-7b-it-GGUF
2 months ago
Failed to load
5
#3 opened 3 months ago by
Priderock
Load more