3591 29 12

Loïck BOURDOIS

lbourdois

https://lbourdois.github.io/blog/

AI & ML interests

👀

Recent Activity

updated a collection 3 days ago

French caption datasets

updated a collection 3 days ago

French VQA datasets

updated a collection 3 days ago

French VQA datasets

View all activity

Organizations

Posts 4

Post

2152

We introduce FAT5 (Flash Attention T5) ⚡

An implementation of T5 in PyTorch with UL2 objective optimized for GPGPU for both training and inference thanks to 13 different optimizations.
The main one is that we have designed a CUDA kernel to expand the Flash Attention by @tridao with RPE biases and supports other PE such as RoPE, ALiBi or FIRE.
The result kernel is 2 times faster than a SPDA implementation.
We also use Triton kernels to optimize certain parts of the architecture, such as the cross-entropy and RMSNorm layer.

The various kernels have been carefully built to be compatible with BF16 and torch.compile to go even faster and achieve efficient pretraining.

All other optimizations are described in a 📝 subsequent blog post available on @huggingface 🤗: CATIE-AQ/FAT5-report.

This methodology enabled us to efficiently pretrain as a proof of concept a FAT5 with 147M parameters in French in a reasonable time (1,461H for 419B tokens), with limited resources (1 A100 i.e. a computational budget of ~ €1,900) and a low carbon footprint (13.5kg eq CO2).

The model's weights are also available on Hugging Face: CATIE-AQ/FAT5-small.
Not very useful in practice, it's a PoC and not an instructed model (it's planned for later).

All the code is available on GitHub if you want to pretrain your own model in your own language or for a specific domain: https://github.com/catie-aq/flashT5 ⭐

Ending by indicating that was a joint project with @BorisAlbar at hf.co/CATIE-AQ.

View all Posts

Articles 3

Article

115

Introduction to State Space Models (SSM)

View all Articles

Collections 9

spaces 2

Running

Free online AI courses in French

📚

French translations of four AI courses

Sleeping

SSM Blog Posts

📝

Blog posts about State Space Models (SSM)

models

None public yet

datasets 10

Loïck BOURDOIS

AI & ML interests

Recent Activity

Organizations

Posts 4

Articles 3

Introduction to State Space Models (SSM)

Collections 9

Free online AI courses in French

lbourdois/en-fr-nyu-dl-course-corpus

SSM Blog Posts

FAT5 (Flash Attention T5) report

Le FAT5 : Flash Attention T5

CATIE-AQ/FAT5-small

spaces 2

Free online AI courses in French

SSM Blog Posts

models

datasets 10

lbourdois/radios_et_podcasts_en_ligne_15

lbourdois/VQA-worldcuisines-vqa-clean

lbourdois/OCR-neulab-PangeaInstruct-OCR-clean

lbourdois/OCR-liboaccn-OPUS-MIT-5M-clean

lbourdois/caption-maya-multimodal-pretrain-clean

lbourdois/MTEB_leaks_and_duplications

lbourdois/panlex

lbourdois/LLE

lbourdois/language_tags

lbourdois/en-fr-nyu-dl-course-corpus

Loïck BOURDOIS

AI & ML interests

Recent Activity

Organizations

Posts 4

Articles 3

Introduction to State Space Models (SSM)

Collections 9

Free online AI courses in French

SSM Blog Posts

FAT5 (Flash Attention T5) report

Le FAT5 : Flash Attention T5

spaces 2 Sort: Recently updated

Free online AI courses in French

SSM Blog Posts

models

datasets 10 Sort: Recently updated

spaces 2

datasets 10