Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
43.9
TFLOPS
218
95
647
Maxime Labonne
PRO
mlabonne
Follow
zhenweiding's profile picture
AiModelsMarket's profile picture
saikirangorthi's profile picture
1753 followers
·
60 following
https://mlabonne.github.io/blog
maximelabonne
mlabonne
AI & ML interests
Post-training, model editing, quantization
Articles
Fine-tune Llama 3 with ORPO
Apr 22
•
191
Create Mixtures of Experts with MergeKit
Mar 28
•
9
Merge Large Language Models with mergekit
Jan 9
•
21
Organizations
mlabonne
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mlabonne/Daredevil-8B
2 days ago
Adding Evaluation Results
#1 opened 2 days ago by
leaderboard-pr-bot
New activity in
mlabonne/OrpoLlama-3-8B
3 days ago
Model does not stop generating new tokens.
5
#4 opened 3 days ago by
MuntasirHossain
New activity in
mlabonne/orpo-dpo-mix-40k
7 days ago
suggestion
1
#4 opened 8 days ago by
DeepMount00
New activity in
mlabonne/model-family-tree
9 days ago
Doesn't create a tree for some pages
3
#2 opened 10 days ago by
xzuyn
New activity in
mlabonne/FrankenLlama-3-12B-Instruct
15 days ago
Сan you increase LLAMA3 8b simply by duplicating some layers?
3
#2 opened 16 days ago by
Regrin
New activity in
mlabonne/chessllm
19 days ago
chess
1
#2 opened 20 days ago by
LeroyDyer
New activity in
ucalyptus/prem-615M-chat
19 days ago
How to up-merge?
1
#1 opened 20 days ago by
ucalyptus
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
21 days ago
fix snippet
1
#8 opened 21 days ago by
philschmid
fine-tuning is needed after self-merging?
1
#7 opened 21 days ago by
oodgnas
Why did you convert to float16 and not bfloat16?
1
#6 opened 21 days ago by
PhilipMay
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
22 days ago
How to score the creative writing
1
#5 opened 22 days ago by
zhouzr
New activity in
mlabonne/FrankenLlama-3-12B-Instruct
22 days ago
How good is this model?
1
#1 opened 22 days ago by
Regrin
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
23 days ago
Attention, stupid question
2
#4 opened 23 days ago by
Debich
New activity in
mlabonne/Meta-Llama-3-225B-Instruct
24 days ago
mergekit config pls :)
4
#1 opened 24 days ago by
ehartford
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
24 days ago
Would love to try a quantized version!
27
#2 opened 27 days ago by
dillfrescott
Mention?
1
#3 opened 24 days ago by
ehartford
New activity in
mlabonne/NeuralMonarch-7B
26 days ago
Could you please share the merging config with us?
1
#3 opened 26 days ago by
PhilipMay
New activity in
mlabonne/AlphaMonarch-7B
26 days ago
Could you please share the merging config with us?
1
#7 opened 26 days ago by
PhilipMay
New activity in
open-llm-leaderboard/open_llm_leaderboard
26 days ago
Resubmit mlabonne/OrpoLlama-3-8B
7
#725 opened 28 days ago by
mlabonne
New activity in
mlabonne/Meta-Llama-3-120B-Instruct
27 days ago
C_H_U_N_K_Y-L_L_A_M_A
1
#1 opened 27 days ago by
rombodawg
New activity in
Muhammad2003/OrpoLlama3-8B
28 days ago
Base model
3
#1 opened 28 days ago by
mlabonne
New activity in
lilacai/lilac
about 1 month ago
Runtime error
#2 opened about 1 month ago by
mlabonne
New activity in
mlabonne/arena-preferences
about 1 month ago
Librarian Bot: Add language metadata for dataset
#2 opened about 1 month ago by
librarian-bot
New activity in
mlabonne/ChimeraLlama-3-8B-v2
about 1 month ago
Any plans on uploading the model itself?
2
#2 opened about 1 month ago by
bartowski
New activity in
mlabonne/ChimeraLlama-3-8B
about 1 month ago
Create generation_config.json
1
#1 opened about 1 month ago by
bartowski
New activity in
mlabonne/ChimeraLlama-3-8B-v2
about 1 month ago
for your consideration
4
#1 opened about 1 month ago by
LaferriereJC
New activity in
mlabonne/arena-preferences
about 1 month ago
[bot] Conversion to Parquet
#1 opened about 1 month ago by
parquet-converter
New activity in
flytech/python-codes-25k
about 1 month ago
Question about dataset generation
4
#3 opened about 1 month ago by
mlabonne
New activity in
mlabonne/OrpoLlama-3-8B
about 1 month ago
Repetition from tuning via https://huggingface.co/blog/mlabonne/orpo-llama-3
4
#2 opened about 1 month ago by
Satya93
New activity in
mlabonne/Llama-3-SLERP-8B
about 1 month ago
What's the purpose of this?
4
#1 opened about 1 month ago by
xms991
New activity in
mlabonne/OrpoLlama-3-8B
about 1 month ago
Update README.md
2
#3 opened about 1 month ago by
hadraoui
Looking forward to full release!
4
#1 opened about 1 month ago by
bartowski
New activity in
mlabonne/orpo-dpo-mix-40k
about 1 month ago
Suggestion
1
#3 opened about 1 month ago by
neovalle
Great job!
3
#2 opened about 1 month ago by
alvarobartt
[bot] Conversion to Parquet
#1 opened about 1 month ago by
parquet-converter
New activity in
mlabonne/chatml_dpo_pairs
about 2 months ago
Add DPO tag
1
#2 opened about 2 months ago by
davanstrien
New activity in
mlabonne/Yet_Another_LLM_Leaderboard
about 2 months ago
The like metric values are not correct...
1
#11 opened about 2 months ago by
zhiminy
New activity in
automerger/YamshadowExperiment28-7B
about 2 months ago
Update README.md
#3 opened about 2 months ago by
mlabonne
Update README.md
#2 opened about 2 months ago by
mlabonne
Update README.md
#1 opened about 2 months ago by
mlabonne
New activity in
mlabonne/NeuralHermes-2.5-Mistral-7B
about 2 months ago
W&B Link Returns 404
2
#10 opened about 2 months ago by
ZennyKenny
New activity in
mlabonne/NeuralBeagle14-7B
about 2 months ago
Adding Evaluation Results
#10 opened about 2 months ago by
dragonSwing
New activity in
mlabonne/Zebrafish-7B
about 2 months ago
ty!
1
#1 opened about 2 months ago by
gate369
New activity in
mlabonne/UltraMerge-7B
about 2 months ago
Dataset
2
#2 opened about 2 months ago by
mrfakename
License
2
#3 opened about 2 months ago by
mrfakename
New activity in
mlabonne/Jambalpaca-v0.1
about 2 months ago
Jamba Notebook
2
#1 opened about 2 months ago by
Severian
New activity in
mlabonne/AlphaMonarch-7B-2bit-HQQ
about 2 months ago
Amazing model
4
#1 opened about 2 months ago by
CatUkraine
New activity in
mlabonne/UltraMerge-7B
2 months ago
🚩 Report
3
#1 opened 2 months ago by
electroglyph
New activity in
mlabonne/ultrafeedback-binarized-preferences-cleaned
2 months ago
Librarian Bot: Add language metadata for dataset
#1 opened 2 months ago by
librarian-bot
New activity in
macadeliccc/Mistral-7B-v0.2-OpenHermes
2 months ago
Evaluation
1
#1 opened 2 months ago by
mlabonne
New activity in
mlabonne/Beyonder-4x7B-v3
2 months ago
AQLM version please
2
#2 opened 2 months ago by
AiModelsMarket
About Moe vocab extended model with non vocab extended model
1
#3 opened 2 months ago by
ancv
New activity in
mlabonne/Beyonder-4x7B-v3-GGUF
2 months ago
Excellent work on this, sir!
3
#2 opened 2 months ago by
dillfrescott
New activity in
mlabonne/Beyonder-4x7B-v3
2 months ago
Add Exl2 quant link
2
#1 opened 2 months ago by
bartowski
New activity in
mlabonne/Beyonder-4x7B-v3-GGUF
2 months ago
Update README.md
2
#1 opened 2 months ago by
kgourgou
New activity in
mlabonne/FrankenMonarch-7B
2 months ago
Why merge the same model 5 times?
2
#1 opened 2 months ago by
UniversalLove333
add GGUF link
#2 opened 2 months ago by
seyf1elislam
New activity in
mlabonne/AutoMerger
2 months ago
I had a similar idea recently
2
#5 opened 2 months ago by
CultriX
New activity in
mlabonne/Yet_Another_LLM_Leaderboard
2 months ago
Reasoning behind including TruthfulQA?
1
#10 opened 2 months ago by
Phil337
New activity in
mlabonne/AutoMerger
2 months ago
allow multiple people to access automerger at once
2
#6 opened 2 months ago by
mrfakename
Load more