File size: 318 Bytes
02af082
 
05b398a
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
# Jamba

- βœ… qlora w/ deepspeed Zero-2 needs at least 2x GPUs and
  - 35GiB VRAM per GPU w minimal context length
  - 56GiB VRAM per GPU (w multipack enabled)
- βœ… qlora w/ deepspeed Zero-3 needs at least 2x GPUs and 67GiB VRAM (wtf?)
- βœ… qlora single-gpu, ~51GiB VRAM
- βœ… multipack
- ❓ FSDP
- ❓ 8-bit LoRA