752 60 279

Younes Belkada

ybelkada

AI & ML interests

Large Language Models, Quantization, Vision, Multimodality, Diffusion models

Articles

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 100

Introducing RWKV — An RNN with the advantages of a transformer

May 15, 2023

• 14

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

• 21

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 34

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 66

Organizations

Posts 4

Post

2867

Falcon Mamba now available now in llama.cpp !
Check out GGUF files uploaded here: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a

Post

3675

FalconMamba 7B - a new model from TII (Technology Innovation Institute) is out !

- Blogpost: https://huggingface.co/blog/falconmamba
- Link to collection: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
- Link to playground: tiiuae/falcon-mamba-playground

View all posts

Collections 1

Papers 8

spaces 25

Sleeping

🦙

GGUF My Repo

No application file

👀

Test Zero

Running

🐠

Dlai Test 2

No application file

🚀

Blip Imagecaptioning Dlai

Running

⚡

Open Source List Models

Runtime error

🌖

Llava 1.5 Dlai

models 143

ybelkada/tiny-random-T5ForConditionalGeneration-calibrated

Text2Text Generation • Updated 15 days ago • 1.02M

ybelkada/t5-11b-sharded

Translation • Updated Nov 21 • 19 • 1

ybelkada/mpt-7b-bf16-sharded

Text Generation • Updated Nov 17 • 29

ybelkada/gpt-j-6b-sharded-bf16

Text Generation • Updated Nov 10 • 553 • 2

ybelkada/t5-3b-sharded

Text2Text Generation • Updated Oct 26 • 84 • 1

ybelkada/test-gguf-trainer-Q8_0-GGUF

Updated May 28

ybelkada/test-gguf-trainer

Text Generation • Updated May 28 • 17 • 1

ybelkada/tiny-random-llama-Q6_K-GGUF

Updated May 28 • 7

ybelkada/test-gguf-trainer-Q4_K_M-GGUF

Updated May 27

ybelkada/tiny-random-llama-Q4_K_M-GGUF

Updated May 22 • 2

datasets 12

ybelkada/model_cards_correct_tag

Viewer • Updated Mar 19 • 54 • 45

ybelkada/model-info-library-name

Updated Jan 23 • 20

ybelkada/test-model-info-library-name

Viewer • Updated Jan 23 • 1 • 38

ybelkada/documentation-images

Viewer • Updated Jan 19 • 2 • 35.9k

ybelkada/oasst1-tiny-subset

Viewer • Updated May 11, 2023 • 44.1k • 45 • 2

ybelkada/oasst1

Viewer • Updated May 11, 2023 • 44.1k • 46 • 1

ybelkada/food101-tiny

Viewer • Updated May 5, 2023 • 100 • 38

ybelkada/test-onepiece-dataset

Viewer • Updated May 5, 2023 • 10 • 40

ybelkada/common_voice_mr_11_0_copy

Viewer • Updated Apr 4, 2023 • 10.8k • 214

ybelkada/english_quotes_copy

Viewer • Updated Apr 4, 2023 • 2.51k • 5.6k

Younes Belkada

AI & ML interests

Articles

Welcome FalconMamba: The first strong attention-free 7B model

Welcome Llama 3 - Meta's new open LLM

GaLore: Advancing Large Model Training on Consumer-grade Hardware

quanto: a pytorch quantization toolkit

Fine-Tuning Gemma Models in Hugging Face

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Overview of natively supported quantization schemes in 🤗 Transformers

Making LLMs lighter with AutoGPTQ and transformers

Fine-tune Llama 2 with DPO

The Falcon has landed in the Hugging Face ecosystem