Commits · Dovakiins/qwerrwe

Fix: lowercase `True` values in config (#713)

ace70b3
unverified

atgctg commited on Oct 10, 2023

Get qlora mistral-7b fine tuning working on a single 4090 (#708)

295b266
unverified

lukemarsden commited on Oct 10, 2023

fix unneeded space (#699)

f91db19
unverified

mhenrichsen commited on Oct 7, 2023

lint

83a950b
unverified

mhenrichsen commited on Oct 7, 2023

new lr, sample pack

4c8ddf2

mhenrichsen commited on Oct 6, 2023

Fix: Higher vram usage for mistral and sample_packing (#691)

669f1d0
unverified

Nanobit commited on Oct 6, 2023

Adding qlora config for Mistral (#675)

d4a88e4
unverified

Abhishek Mishra commited on Oct 6, 2023

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

winglian commited on Oct 3, 2023

Update mistral/README.md (#647)

b88f515
unverified

Adarsh Shirawalmath commited on Sep 28, 2023

Feat: Add example for Mistral (#644)

eb41f76
unverified

Nanobit commited on Sep 28, 2023

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

winglian commited on Sep 26, 2023

Feat: Add support for upstream FA2 (#626)

19a600a
unverified

Nanobit commited on Sep 26, 2023

default model changed

4fecbfe

mhenrichsen commited on Sep 24, 2023

support to disable exllama for gptq (#604)

faecff9
unverified

winglian commited on Sep 19, 2023

more sane defaults for openllama 3b used for quickstarts (#602)

674c576
unverified

winglian commited on Sep 19, 2023

btlm and falcon monkey patches for flash attn (#566)

6b9b229
unverified

winglian commited on Sep 17, 2023

make phi training work with Loras (#588)

62eaee7
unverified

winglian commited on Sep 16, 2023

Support Sample packing for phi arch (#586)

12a2dbb
unverified

winglian commited on Sep 15, 2023

Fix Codellama examples (#582)

1aa4007
unverified

Doan Minh Phuong commited on Sep 15, 2023

Phi examples (#569)

2284209
unverified

winglian commited on Sep 14, 2023

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

Glavin001 commited on Sep 13, 2023

recommend padding when using sample packing (#531)

3437149
unverified

winglian commited on Sep 6, 2023

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

winglian commited on Sep 5, 2023

pad_to_worst_case_seq_len boolean, for testing memory limits (#498)

8e197f6
unverified

Birch-san

tmm1 commited on Aug 28, 2023

Feat(cfg): Add code-llama configs for all sizes (#479)

3513071
unverified

mhenrichsen mhenrichsen commited on Aug 27, 2023

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

chargoddard commited on Aug 27, 2023

improve llama pad token handling (#475)

cb9797e
unverified

winglian commited on Aug 24, 2023

don't use mask expansion for inference (#392)

1687be6
unverified

winglian commited on Aug 15, 2023

new llama-2 default settings (#370)

fdffef5
unverified

mhenrichsen Mads Henrichsen commited on Aug 14, 2023

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

Morgan McGuire Morgan McGuire

winglian commited on Aug 12, 2023

set group_by_length to false in examples

36fefcf

tmm1 commited on Aug 7, 2023

feat/llama-2 examples (#319)

dc71d88
unverified

mhenrichsen Mads Henrichsen commited on Aug 3, 2023

Add XGen info to README and example config

3881143

ethanhs commited on Jul 21, 2023

Use AutoTokenizer for redpajama example

945c419

sroecker commited on Jun 14, 2023

Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum

16bb627
unverified

winglian commited on Jun 14, 2023

Merge branch 'main' into flash-optimum

fd2c981
unverified

winglian commited on Jun 12, 2023

tweak config to work

2ba4ae8

winglian commited on Jun 12, 2023

Merge pull request #193 from OpenAccess-AI-Collective/config-fixes-20230612

94f310c
unverified

winglian commited on Jun 12, 2023

Fix config path after config moved

52cde69

Nanobit commited on Jun 12, 2023