Commits · Dovakiins/qwerrwe

tweaks to data loading, 8 bit adam, accelerate and deepspeed

097d367

winglian commited on Apr 22, 2023

shuffle and split dataset after save/load

4f2584f

winglian commited on Apr 20, 2023

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release

8d43785

winglian commited on Apr 20, 2023

various bugfixes

94f5e41

winglian commited on Apr 19, 2023

fix bug when model_type not explicitly passed

bb991fd

winglian commited on Apr 19, 2023

improve inference

d653859

winglian commited on Apr 19, 2023

quickstart instructions for starting from runpod (#5)

0a472e1
unverified

winglian commited on Apr 18, 2023

attempt xformers hijack attention

8746b70

winglian commited on Apr 18, 2023

WIP large refactor to make finetune script a little more manageable (#3)

6045345
unverified

winglian commited on Apr 18, 2023

add support for alpaca reflect training (#2)

81de0ef
unverified

winglian commited on Apr 18, 2023

Tokenization open assistant (#1)

87d7825
unverified

winglian commited on Apr 18, 2023

suppport for alpaca-like instruction datasets without inputs

e107643

winglian commited on Apr 18, 2023

casts the prepared data to int16 (doesn't help with training memory)

2db9436

winglian commited on Apr 18, 2023

4bit quantized support (wip)

77fca25

winglian commited on Apr 17, 2023

various bugfixes

80b2ed2

winglian commited on Apr 15, 2023

config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes

f2a2029

winglian commited on Apr 14, 2023

black formatting

a6028d3

winglian commited on Apr 14, 2023

make it work with pythia in the cloud

8d959a7

winglian commited on Apr 14, 2023

WIP for axolotl trainer

ce24f5e

winglian commited on Apr 14, 2023

Spaces:

Dovakiins
/

qwerrwe

Build error

Commit History

tweaks to data loading, 8 bit adam, accelerate and deepspeed

097d367

shuffle and split dataset after save/load

4f2584f

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release

8d43785

various bugfixes

94f5e41

fix bug when model_type not explicitly passed

bb991fd

improve inference

d653859

quickstart instructions for starting from runpod (#5)

0a472e1
unverified

attempt xformers hijack attention

8746b70

WIP large refactor to make finetune script a little more manageable (#3)

6045345
unverified

add support for alpaca reflect training (#2)

81de0ef
unverified

Tokenization open assistant (#1)

87d7825
unverified

suppport for alpaca-like instruction datasets without inputs

e107643

casts the prepared data to int16 (doesn't help with training memory)

2db9436

4bit quantized support (wip)

77fca25

various bugfixes

80b2ed2

config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes

f2a2029

black formatting

a6028d3

make it work with pythia in the cloud

8d959a7

WIP for axolotl trainer

ce24f5e

Commit History

tweaks to data loading, 8 bit adam, accelerate and deepspeed 097d367

shuffle and split dataset after save/load 4f2584f

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release 8d43785

various bugfixes 94f5e41

fix bug when model_type not explicitly passed bb991fd

improve inference d653859

quickstart instructions for starting from runpod (#5) 0a472e1 unverified

attempt xformers hijack attention 8746b70

WIP large refactor to make finetune script a little more manageable (#3) 6045345 unverified

add support for alpaca reflect training (#2) 81de0ef unverified

Tokenization open assistant (#1) 87d7825 unverified

suppport for alpaca-like instruction datasets without inputs e107643

casts the prepared data to int16 (doesn't help with training memory) 2db9436

4bit quantized support (wip) 77fca25

various bugfixes 80b2ed2

config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes f2a2029

black formatting a6028d3

make it work with pythia in the cloud 8d959a7

WIP for axolotl trainer ce24f5e

tweaks to data loading, 8 bit adam, accelerate and deepspeed

097d367

shuffle and split dataset after save/load

4f2584f

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release

8d43785

various bugfixes

94f5e41

fix bug when model_type not explicitly passed

bb991fd

improve inference

d653859

quickstart instructions for starting from runpod (#5)

0a472e1
unverified

attempt xformers hijack attention

8746b70

WIP large refactor to make finetune script a little more manageable (#3)

6045345
unverified

add support for alpaca reflect training (#2)

81de0ef
unverified

Tokenization open assistant (#1)

87d7825
unverified

suppport for alpaca-like instruction datasets without inputs

e107643

casts the prepared data to int16 (doesn't help with training memory)

2db9436

4bit quantized support (wip)

77fca25

various bugfixes

80b2ed2

config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes

f2a2029

black formatting

a6028d3

make it work with pythia in the cloud

8d959a7

WIP for axolotl trainer

ce24f5e