724 58 257

Younes Belkada

ybelkada

AI & ML interests

Large Language Models, Quantization, Vision, Multimodality, Diffusion models

Articles

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 98

Introducing RWKV — An RNN with the advantages of a transformer

May 15, 2023

• 14

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

• 21

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 34

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 65

Organizations

Posts 4

Post

2754

Falcon Mamba now available now in llama.cpp !
Check out GGUF files uploaded here: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a

Post

3568

FalconMamba 7B - a new model from TII (Technology Innovation Institute) is out !

- Blogpost: https://huggingface.co/blog/falconmamba
- Link to collection: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
- Link to playground: tiiuae/falcon-mamba-playground

View all posts

Collections 1

Papers 8

spaces 25

Sleeping

🦙

GGUF My Repo

No application file

👀

Test Zero

Sleeping

🐠

Dlai Test 2

No application file

🚀

Blip Imagecaptioning Dlai

Running

⚡

Open Source List Models

Runtime error

🌖

Llava 1.5 Dlai

models 143

ybelkada/tiny-random-T5ForConditionalGeneration-calibrated

Text2Text Generation • Updated 4 days ago • 1.25M

ybelkada/t5-11b-sharded

Translation • Updated 22 days ago • 42 • 1

ybelkada/mpt-7b-bf16-sharded

Text Generation • Updated 26 days ago • 34

ybelkada/gpt-j-6b-sharded-bf16

Text Generation • Updated Nov 10 • 795 • 2

ybelkada/t5-3b-sharded

Text2Text Generation • Updated Oct 26 • 89 • 1

ybelkada/test-gguf-trainer-Q8_0-GGUF

Updated May 28 • 1

ybelkada/test-gguf-trainer

Text Generation • Updated May 28 • 13 • 1

ybelkada/tiny-random-llama-Q6_K-GGUF

Updated May 28 • 4

ybelkada/test-gguf-trainer-Q4_K_M-GGUF

Updated May 27 • 3

ybelkada/tiny-random-llama-Q4_K_M-GGUF

Updated May 22 • 3

datasets 12

ybelkada/model_cards_correct_tag

Viewer • Updated Mar 19 • 54 • 48

ybelkada/model-info-library-name

Updated Jan 23 • 16

ybelkada/test-model-info-library-name

Viewer • Updated Jan 23 • 1 • 44

ybelkada/documentation-images

Viewer • Updated Jan 19 • 2 • 40.1k

ybelkada/oasst1-tiny-subset

Viewer • Updated May 11, 2023 • 44.1k • 49 • 2

ybelkada/oasst1

Viewer • Updated May 11, 2023 • 44.1k • 52 • 1

ybelkada/food101-tiny

Viewer • Updated May 5, 2023 • 100 • 41

ybelkada/test-onepiece-dataset

Viewer • Updated May 5, 2023 • 10 • 43

ybelkada/common_voice_mr_11_0_copy

Viewer • Updated Apr 4, 2023 • 10.8k • 238

ybelkada/english_quotes_copy

Viewer • Updated Apr 4, 2023 • 2.51k • 6.3k

Younes Belkada

AI & ML interests

Articles

Welcome FalconMamba: The first strong attention-free 7B model

Welcome Llama 3 - Meta's new open LLM

GaLore: Advancing Large Model Training on Consumer-grade Hardware

quanto: a pytorch quantization toolkit

Fine-Tuning Gemma Models in Hugging Face

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Overview of natively supported quantization schemes in 🤗 Transformers

Making LLMs lighter with AutoGPTQ and transformers

Fine-tune Llama 2 with DPO

The Falcon has landed in the Hugging Face ecosystem