Commit History

E2e device cuda (#575)
2414673
unverified

winglian commited on

Model parallel (#538)
f6060a6
unverified

winglian commited on

Add training callback to send predictions to WandB table (#521)
5b67ea9
unverified

Glavin001 commited on

Fix pretraining with iterable/streaming Dataset (#556)
2f586d1
unverified

Jan Philipp Harries Jan Philipp Harries commited on

Early stopping metric (#537)
e30f1e3
unverified

winglian commited on

recommend padding when using sample packing (#531)
3437149
unverified

winglian commited on

Add support for GPTQ using native transformers/peft (#468)
3355706
unverified

winglian commited on

move is_llama_derived_model into normalize_config (#524)
44454ae
unverified

tmm1 commited on

ReLoRA implementation (with quantization) (#322)
bde3c5a
unverified

chargoddard winglian commited on

recast loralayer, norm, lmhead + embed token weights per original qlora (#393)
96deb6b
unverified

winglian commited on

Fix(config): Update handling of deepspeed config (#404)
c01015f
unverified

Nanobit commited on

try to detect accelerate and only use device_map=None in that case (#373)
094fc2c
unverified

tmm1 commited on

improve GPU logging to break out pytorch cache and system mem
7b55fe6

tmm1 commited on

extract module for working with cfg
8cec513

tmm1 commited on