Commits · Dovakiins/qwerrwe

update outputs path so that we can mount workspace to /workspace/data (#1623)

4fde300
unverified

winglian commited on May 15

DBRX Model Support (#1462)

132eb74
unverified

winglian commited on Apr 12

LISA (#1469)

0ddfb24
unverified

winglian

tmm1 commited on Apr 1

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)

2a1589f
unverified

winglian commited on Mar 21

FDSP + QLoRA (#1378)

9b6ee83
unverified

winglian commited on Mar 8

fix(examples): remove is_*_derived as it's parsed automatically (#1297)

a7a9a14
unverified

Nanobit commited on Feb 21

Add seq2seq eval benchmark callback (#1274)

5a5d474
unverified

LeonardoEmili commited on Feb 13

Peft lotfq (#1222)

4cb7900
unverified

winglian commited on Jan 28

Mixtral fixes 20240124 (#1192) [skip ci]

54d2ac1
unverified

winglian commited on Jan 24

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

winglian

Nanobit commited on Jan 22

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

jrc joecummings

winglian commited on Jan 18

added tiny llama examples for lora and qlora (#1027)

c75f916
unverified

Tim Dolan commited on Jan 3

new evals_per_epoch and saves_per_epoch to make things cleaner (#944)

5f79b82
unverified

winglian commited on Dec 12, 2023

Feat(wandb): Refactor to be more flexible (#767)

a1da39c
unverified

Nanobit commited on Dec 4, 2023

various bugfixes (#856)

1470650
unverified

winglian commited on Nov 15, 2023

don't compile deepspeed or bitsandbytes from source (#837)

f544ab2
unverified

winglian commited on Nov 9, 2023

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

winglian commited on Oct 28, 2023

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

winglian commited on Oct 23, 2023

Implement fused modules (#747)

15d3a65
unverified

casperhansen

winglian commited on Oct 21, 2023

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

winglian commited on Oct 3, 2023

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

winglian commited on Sep 26, 2023

default model changed

4fecbfe

mhenrichsen commited on Sep 24, 2023

support to disable exllama for gptq (#604)

faecff9
unverified

winglian commited on Sep 19, 2023

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

Glavin001 commited on Sep 13, 2023

recommend padding when using sample packing (#531)

3437149
unverified

winglian commited on Sep 6, 2023

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

winglian commited on Sep 5, 2023

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

chargoddard commited on Aug 27, 2023

don't use mask expansion for inference (#392)

1687be6
unverified

winglian commited on Aug 15, 2023

new llama-2 default settings (#370)

fdffef5
unverified

mhenrichsen Mads Henrichsen commited on Aug 14, 2023

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

Morgan McGuire Morgan McGuire

winglian commited on Aug 12, 2023

set group_by_length to false in examples

36fefcf

tmm1 commited on Aug 7, 2023

feat/llama-2 examples (#319)

dc71d88
unverified

mhenrichsen Mads Henrichsen commited on Aug 3, 2023

Spaces:

Dovakiins
/

qwerrwe

Build error

Commit History

update outputs path so that we can mount workspace to /workspace/data (#1623)

4fde300
unverified

DBRX Model Support (#1462)

132eb74
unverified

LISA (#1469)

0ddfb24
unverified

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)

2a1589f
unverified

FDSP + QLoRA (#1378)

9b6ee83
unverified

fix(examples): remove is_*_derived as it's parsed automatically (#1297)

a7a9a14
unverified

Add seq2seq eval benchmark callback (#1274)

5a5d474
unverified

Peft lotfq (#1222)

4cb7900
unverified

Mixtral fixes 20240124 (#1192) [skip ci]

54d2ac1
unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

added tiny llama examples for lora and qlora (#1027)

c75f916
unverified

new evals_per_epoch and saves_per_epoch to make things cleaner (#944)

5f79b82
unverified

Feat(wandb): Refactor to be more flexible (#767)

a1da39c
unverified

various bugfixes (#856)

1470650
unverified

don't compile deepspeed or bitsandbytes from source (#837)

f544ab2
unverified

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

Implement fused modules (#747)

15d3a65
unverified

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

default model changed

4fecbfe

support to disable exllama for gptq (#604)

faecff9
unverified

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

recommend padding when using sample packing (#531)

3437149
unverified

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

don't use mask expansion for inference (#392)

1687be6
unverified

new llama-2 default settings (#370)

fdffef5
unverified

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

set group_by_length to false in examples

36fefcf

feat/llama-2 examples (#319)

dc71d88
unverified

Commit History

update outputs path so that we can mount workspace to /workspace/data (#1623) 4fde300 unverified

DBRX Model Support (#1462) 132eb74 unverified

LISA (#1469) 0ddfb24 unverified

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428) 2a1589f unverified

FDSP + QLoRA (#1378) 9b6ee83 unverified

fix(examples): remove is_*_derived as it's parsed automatically (#1297) a7a9a14 unverified

Add seq2seq eval benchmark callback (#1274) 5a5d474 unverified

Peft lotfq (#1222) 4cb7900 unverified

Mixtral fixes 20240124 (#1192) [skip ci] 54d2ac1 unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci] 782b6a4 unverified

Add shifted sparse attention (#973) [skip-ci] 1d70f24 unverified

added tiny llama examples for lora and qlora (#1027) c75f916 unverified

new evals_per_epoch and saves_per_epoch to make things cleaner (#944) 5f79b82 unverified

Feat(wandb): Refactor to be more flexible (#767) a1da39c unverified

various bugfixes (#856) 1470650 unverified

don't compile deepspeed or bitsandbytes from source (#837) f544ab2 unverified

fix eval_steps to be a sane default (#797) 8b79ff0 unverified

simplify by removing duplicate base_model_config (#772) 2d8def6 unverified

Implement fused modules (#747) 15d3a65 unverified

prepared dataset caching, other misc fixes (#665) e50a64e unverified

eval_table isn't quite stable enough to be in default llama configs (#637) d887ad8 unverified

default model changed 4fecbfe

support to disable exllama for gptq (#604) faecff9 unverified

Add training callback to send predictions to WandB table (#521) 5b67ea9 unverified

recommend padding when using sample packing (#531) 3437149 unverified

Add support for GPTQ using native transformers/peft (#468) 3355706 unverified

Add example Llama 2 ReLoRA config (#471) fe4d6ba unverified

don't use mask expansion for inference (#392) 1687be6 unverified

new llama-2 default settings (#370) fdffef5 unverified

Add wandb_entity to wandb options, update example configs, update README (#361) 7019509 unverified

set group_by_length to false in examples 36fefcf

feat/llama-2 examples (#319) dc71d88 unverified

update outputs path so that we can mount workspace to /workspace/data (#1623)

4fde300
unverified

DBRX Model Support (#1462)

132eb74
unverified

LISA (#1469)

0ddfb24
unverified

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)

2a1589f
unverified

FDSP + QLoRA (#1378)

9b6ee83
unverified

fix(examples): remove is_*_derived as it's parsed automatically (#1297)

a7a9a14
unverified

Add seq2seq eval benchmark callback (#1274)

5a5d474
unverified

Peft lotfq (#1222)

4cb7900
unverified

Mixtral fixes 20240124 (#1192) [skip ci]

54d2ac1
unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

added tiny llama examples for lora and qlora (#1027)

c75f916
unverified

new evals_per_epoch and saves_per_epoch to make things cleaner (#944)

5f79b82
unverified

Feat(wandb): Refactor to be more flexible (#767)

a1da39c
unverified

various bugfixes (#856)

1470650
unverified

don't compile deepspeed or bitsandbytes from source (#837)

f544ab2
unverified

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

Implement fused modules (#747)

15d3a65
unverified

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

default model changed

4fecbfe

support to disable exllama for gptq (#604)

faecff9
unverified

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

recommend padding when using sample packing (#531)

3437149
unverified

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

don't use mask expansion for inference (#392)

1687be6
unverified

new llama-2 default settings (#370)

fdffef5
unverified

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

set group_by_length to false in examples

36fefcf

feat/llama-2 examples (#319)

dc71d88
unverified