736 58 258

Younes Belkada

ybelkada

AI & ML interests

Large Language Models, Quantization, Vision, Multimodality, Diffusion models

Recent Activity

New activity 4 days ago

ybelkada/t5-11b-sharded:Adding `safetensors` variant of this model

New activity 8 days ago

ybelkada/mpt-7b-bf16-sharded:Adding `safetensors` variant of this model

New activity 10 days ago

mlx-community/falcon-mamba-7b-bf16:Upload folder using huggingface_hub

View all activity

Articles

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 93

Introducing RWKV — An RNN with the advantages of a transformer

May 15, 2023

• 14

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

• 20

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 34

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 63

Organizations

Posts 4

Post

2582

Falcon Mamba now available now in llama.cpp !
Check out GGUF files uploaded here: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a

Post

3398

FalconMamba 7B - a new model from TII (Technology Innovation Institute) is out !

- Blogpost: https://huggingface.co/blog/falconmamba
- Link to collection: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
- Link to playground: tiiuae/falcon-mamba-playground

View all posts

Collections 1

Papers 8

spaces 25

Running

🦙

GGUF My Repo

No application file

👀

Test Zero

Sleeping

🐠

Dlai Test 2

No application file

🚀

Blip Imagecaptioning Dlai

Running

⚡

Open Source List Models

Running on Zero

🌖

Llava 1.5 Dlai

models 143

ybelkada/t5-11b-sharded

Translation • Updated 4 days ago • 47 • 1

ybelkada/mpt-7b-bf16-sharded

Text Generation • Updated 8 days ago • 54

ybelkada/gpt-j-6b-sharded-bf16

Text Generation • Updated 15 days ago • 6.92k • 2

ybelkada/t5-3b-sharded

Text2Text Generation • Updated 30 days ago • 51 • 1

ybelkada/test-gguf-trainer-Q8_0-GGUF

Updated May 28 • 5

ybelkada/test-gguf-trainer

Text Generation • Updated May 28 • 8 • 1

ybelkada/tiny-random-llama-Q6_K-GGUF

Updated May 28 • 4

ybelkada/test-gguf-trainer-Q4_K_M-GGUF

Updated May 27 • 8

ybelkada/tiny-random-llama-Q4_K_M-GGUF

Updated May 22 • 3

ybelkada/tiny-random-llama

Text Generation • Updated May 22 • 13

datasets 12

ybelkada/model_cards_correct_tag

Viewer • Updated Mar 19 • 54 • 42

ybelkada/model-info-library-name

Updated Jan 23 • 4

ybelkada/test-model-info-library-name

Viewer • Updated Jan 23 • 1 • 46

ybelkada/documentation-images

Viewer • Updated Jan 19 • 2 • 39.1k

ybelkada/oasst1-tiny-subset

Viewer • Updated May 11, 2023 • 44.1k • 43 • 2

ybelkada/oasst1

Viewer • Updated May 11, 2023 • 44.1k • 48 • 1

ybelkada/food101-tiny

Viewer • Updated May 5, 2023 • 100 • 38

ybelkada/test-onepiece-dataset

Viewer • Updated May 5, 2023 • 10 • 46

ybelkada/common_voice_mr_11_0_copy

Viewer • Updated Apr 4, 2023 • 10.8k • 237

ybelkada/english_quotes_copy

Viewer • Updated Apr 4, 2023 • 2.51k • 3.94k

Younes Belkada

AI & ML interests

Recent Activity

Articles

Welcome FalconMamba: The first strong attention-free 7B model

Welcome Llama 3 - Meta's new open LLM

GaLore: Advancing Large Model Training on Consumer-grade Hardware

quanto: a pytorch quantization toolkit

Fine-Tuning Gemma Models in Hugging Face

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Overview of natively supported quantization schemes in 🤗 Transformers

Making LLMs lighter with AutoGPTQ and transformers

Fine-tune Llama 2 with DPO

The Falcon has landed in the Hugging Face ecosystem