Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

qwerrwe / configs /llama_7B_alpaca.yml

Commit History

swap batch size for gradient accumulation steps to decouple from num gpu

c2a0792

winglian commited on May 31, 2023

Update wandb_log_model on llama_7B_alpaca.yml

d77d736
unverified

Viktorius Suwandi commited on May 29, 2023

fix lora target module, require explicit flash attention, fix min logging steps, don't use adam8bit for int4, hash prepared datasets, support hf hub datasets

87e073d

winglian commited on Apr 17, 2023

4bit quantized support (wip)

77fca25

winglian commited on Apr 17, 2023

deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches

d1aed4c

winglian commited on Apr 16, 2023

add llama 7b config and fiz lora_fan_in_fan_out for llama (copy pasta bug)

d060c80

winglian commited on Apr 15, 2023