322 131 829

Maxime Labonne PRO

mlabonne

https://mlabonne.github.io/blog

AI & ML interests

Post-training, model editing, quantization

Recent Activity

new activity about 6 hours ago

open-r1/OpenThoughts-114k-math:32,390 wrong math answers?

liked a dataset about 6 hours ago

mlabonne/OpenThoughts-79k-filtered

updated a dataset about 6 hours ago

mlabonne/OpenThoughts-79k-filtered

View all activity

Articles

Organizations

mlabonne's activity

New activity in open-r1/OpenThoughts-114k-math about 6 hours ago

32,390 wrong math answers?

#3 opened about 6 hours ago by

mlabonne

New activity in RUC-AIBOX/STILL-2 1 day ago

fix broken URL to paper

#1 opened 1 day ago by

mlabonne

New activity in mlabonne/Llama-3.1-70B-Instruct-lorablated 7 days ago

Which model to download?

#10 opened 8 days ago by

shayeryan

New activity in mlabonne/llama-2-7b-guanaco about 1 month ago

Adding `safetensors` variant of this model

#1 opened about 1 month ago by

SFconvertbot

New activity in mlabonne/FineTome-100k about 1 month ago

Any sources on methodology?

#4 opened about 1 month ago by

sadaisystems

New activity in mlabonne/codellama-2-7b about 2 months ago

Adding `safetensors` variant of this model

#5 opened about 2 months ago by

SFconvertbot

New activity in mlabonne/Yet_Another_LLM_Leaderboard about 2 months ago

Can we also submit our evaluated scores on this leaderboard

#12 opened about 2 months ago by

asharsha30

New activity in mlabonne/Meta-Llama-3-120B-Instruct 2 months ago

Is this model multimodal?

#11 opened 2 months ago by

stleandro

New activity in mlabonne/orca-agentinstruct-1M-v1-cleaned 2 months ago

Removing empty system messages

#2 opened 2 months ago by

ari9dam

New activity in mlabonne/Hermes-3-Llama-3.1-8B-lorablated 2 months ago

Update README.md

#2 opened 2 months ago by

Alex283qhrba

New activity in mlabonne/dummy-llama-2 3 months ago

Adding `safetensors` variant of this model

#1 opened 3 months ago by

SFconvertbot

New activity in mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated 3 months ago

Help Replicating

#15 opened 3 months ago by

edwards98

New activity in mlabonne/orpo-dpo-mix-40k 4 months ago

Update README.md

#7 opened 4 months ago by

DeepMount00

New activity in openai/MMMLU 4 months ago

Fix typo ZH_CH to ZH_CN

#14 opened 4 months ago by

mlabonne

New activity in mlabonne/Hermes-3-Llama-3.1-70B-lorablated 4 months ago

Adding Evaluation Results

#1 opened 4 months ago by

leaderboard-pr-bot

New activity in mlabonne/UltraLlama-3.1-8B 4 months ago

What is magpie?

#2 opened 4 months ago by

ddh0

Is the adapter merged into the weights?

#1 opened 4 months ago by

bartowski

New activity in mlabonne/BigQwen2.5-52B-Instruct 4 months ago

Adding Evaluation Results

#1 opened 4 months ago by

leaderboard-pr-bot

New activity in mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated 4 months ago

request: Llama-3.2 abliterated

#14 opened 4 months ago by

SFBAI

New activity in mlabonne/Meta-Llama-3-120B-Instruct 4 months ago

Out of memory when using mergekit

#10 opened 4 months ago by

kongzym

Maxime Labonne PRO

AI & ML interests

Recent Activity

Articles

The Large Language Model Course

Decoding Strategies in Large Language Models

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

The Rise of Agentic Data Generation

Uncensor any LLM with abliteration

Fine-tune Llama 3 with ORPO

Create Mixtures of Experts with MergeKit

Merge Large Language Models with mergekit

Organizations

mlabonne's activity

32,390 wrong math answers?

fix broken URL to paper

Which model to download?

Adding `safetensors` variant of this model

Any sources on methodology?

Adding `safetensors` variant of this model

Can we also submit our evaluated scores on this leaderboard

Is this model multimodal?

Removing empty system messages

Update README.md

Adding `safetensors` variant of this model

Help Replicating

Update README.md

Fix typo ZH_CH to ZH_CN

Adding Evaluation Results

What is magpie?

Is the adapter merged into the weights?

Adding Evaluation Results

request: Llama-3.2 abliterated

Out of memory when using mergekit