Commits · Dovakiins/qwerrwe

ADD: warning hub model (#1301)

601c08b
unverified

JohanWork

Nanobit commited on Apr 30

feat: validate sample packing requires flash_attention (#1465)

bf4cd67
unverified

Nanobit commited on Apr 5

make sure to capture non-null defaults from config validation (#1415)

601b77b
unverified

winglian commited on Mar 26

fix for protected model_ namespace w pydantic (#1345)

6b3b271
unverified

winglian commited on Feb 28

more fixes 20240228 (#1342) [skip ci]

0f985e1
unverified

winglian commited on Feb 28

Pydantic 2.x cfg (#1239)

cc3cebf
unverified

winglian commited on Feb 26

Peft lotfq (#1222)

4cb7900
unverified

winglian commited on Jan 28

ADD: warning if hub_model_id ist set but not any save strategy (#1202)

af29d81
unverified

JohanWork

winglian commited on Jan 26

Phi2 multipack (#1173)

814aee6
unverified

winglian commited on Jan 23

Deprecate max packed sequence len (#1141)

2ce5c0d
unverified

winglian commited on Jan 20

Add `layers_to_transform` for `lora_config` (#1118)

8487b97
unverified

xzuyn commited on Jan 16

add gptneox embeddings, fix phi2 inputs, also fix the casting (#1083)

78c5b19
unverified

winglian commited on Jan 11

be more robust about checking embedding modules for lora finetunes (#1074) [skip ci]

0f10080
unverified

winglian commited on Jan 10

attempt to also run e2e tests that needs gpus (#1070)

788649f
unverified

winglian commited on Jan 10

Feat: Warns to add to modules_to_save when adding tokens or switching special_tokens (#787)

1ffa386
unverified

Nanobit commited on Dec 22, 2023

Feat(wandb): Refactor to be more flexible (#767)

a1da39c
unverified

Nanobit commited on Dec 4, 2023

Feat: Add warmup_ratio (#893)

fb12895
unverified

Nanobit commited on Nov 25, 2023

Fix: Warn when fullfinetune without adapter (#770)

44c9d01
unverified

Nanobit commited on Oct 22, 2023

Fix: eval table conflict with eval_sample_packing (#769)

9923b72
unverified

Nanobit commited on Oct 22, 2023

Fix(cfg): Add validation for save_strategy and eval_strategy (#633)

383f88d
unverified

Nanobit commited on Sep 28, 2023

use fastchat conversations template (#578)

e7d3e2d
unverified

winglian commited on Sep 27, 2023

Fix: Fail bf16 check when running on cpu during merge (#631)

cfbce02
unverified

Nanobit commited on Sep 25, 2023

recommend padding when using sample packing (#531)

3437149
unverified

winglian commited on Sep 6, 2023

extract module for working with cfg

8cec513

tmm1 commited on Aug 13, 2023

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

winglian commited on Aug 12, 2023

params are adam_, not adamw_

19cf0bd

winglian commited on Jul 8, 2023

Additional test case per pr

ad5ca4f

winglian commited on Jun 15, 2023

add validation and tests for adamw hyperparam

cb9d3af

winglian commited on Jun 15, 2023

Merge branch 'main' into flash-optimum

fd2c981
unverified

winglian commited on Jun 12, 2023

new validation for mpt w grad checkpoints

14668fa

winglian commited on Jun 11, 2023

add streaming dataset support for pretraining datasets

eea2731

winglian commited on Jun 10, 2023

Validate falcon with fsdp

babf0fd

Nanobit commited on Jun 8, 2023

Update doc for grad_accu and add validation tests for batch size

3c71c8d

Nanobit commited on May 31, 2023

black formatting

6fa40bf

winglian commited on May 31, 2023

add support for gradient accumulation steps

3aad5f3

winglian commited on May 31, 2023

Apply isort then black

37293dc

Nanobit commited on May 29, 2023

Ignore unsupported-binary-operation

0dd35c7

Nanobit commited on May 29, 2023

Black formatting

b832a0a

Nanobit commited on May 29, 2023

Lint validation

1f3c3f5

Nanobit commited on May 29, 2023

update for pr feedback

fd5f965

winglian commited on May 28, 2023

new hf_use_auth_token setting so login to hf isn't required

1c33eb8

winglian commited on May 28, 2023

Feat: Update validate_config and add tests

52dd92a

Nanobit commited on May 28, 2023

Commit History

ADD: warning hub model (#1301) 601c08b unverified

feat: validate sample packing requires flash_attention (#1465) bf4cd67 unverified

make sure to capture non-null defaults from config validation (#1415) 601b77b unverified

fix for protected model_ namespace w pydantic (#1345) 6b3b271 unverified

more fixes 20240228 (#1342) [skip ci] 0f985e1 unverified

Pydantic 2.x cfg (#1239) cc3cebf unverified

Peft lotfq (#1222) 4cb7900 unverified

ADD: warning if hub_model_id ist set but not any save strategy (#1202) af29d81 unverified

Phi2 multipack (#1173) 814aee6 unverified

Deprecate max packed sequence len (#1141) 2ce5c0d unverified

Add `layers_to_transform` for `lora_config` (#1118) 8487b97 unverified

add gptneox embeddings, fix phi2 inputs, also fix the casting (#1083) 78c5b19 unverified

be more robust about checking embedding modules for lora finetunes (#1074) [skip ci] 0f10080 unverified

attempt to also run e2e tests that needs gpus (#1070) 788649f unverified

Feat: Warns to add to modules_to_save when adding tokens or switching special_tokens (#787) 1ffa386 unverified

Feat(wandb): Refactor to be more flexible (#767) a1da39c unverified

Feat: Add warmup_ratio (#893) fb12895 unverified

Fix: Warn when fullfinetune without adapter (#770) 44c9d01 unverified

Fix: eval table conflict with eval_sample_packing (#769) 9923b72 unverified

Fix(cfg): Add validation for save_strategy and eval_strategy (#633) 383f88d unverified

use fastchat conversations template (#578) e7d3e2d unverified

Fix: Fail bf16 check when running on cpu during merge (#631) cfbce02 unverified

recommend padding when using sample packing (#531) 3437149 unverified

extract module for working with cfg 8cec513

Attention mask and position id fixes for packing (#285) 2bb0b78 unverified

params are adam_*, not adamw_* 19cf0bd

Additional test case per pr ad5ca4f

add validation and tests for adamw hyperparam cb9d3af

Merge branch 'main' into flash-optimum fd2c981 unverified

new validation for mpt w grad checkpoints 14668fa

add streaming dataset support for pretraining datasets eea2731

Validate falcon with fsdp babf0fd

Update doc for grad_accu and add validation tests for batch size 3c71c8d

black formatting 6fa40bf

add support for gradient accumulation steps 3aad5f3

Apply isort then black 37293dc

Ignore unsupported-binary-operation 0dd35c7

Black formatting b832a0a

Lint validation 1f3c3f5

update for pr feedback fd5f965

new hf_use_auth_token setting so login to hf isn't required 1c33eb8

Feat: Update validate_config and add tests 52dd92a

ADD: warning hub model (#1301)

601c08b
unverified

feat: validate sample packing requires flash_attention (#1465)

bf4cd67
unverified

make sure to capture non-null defaults from config validation (#1415)

601b77b
unverified

fix for protected model_ namespace w pydantic (#1345)

6b3b271
unverified

more fixes 20240228 (#1342) [skip ci]

0f985e1
unverified

Pydantic 2.x cfg (#1239)

cc3cebf
unverified

Peft lotfq (#1222)

4cb7900
unverified

ADD: warning if hub_model_id ist set but not any save strategy (#1202)

af29d81
unverified

Phi2 multipack (#1173)

814aee6
unverified

Deprecate max packed sequence len (#1141)

2ce5c0d
unverified

Add `layers_to_transform` for `lora_config` (#1118)

8487b97
unverified

add gptneox embeddings, fix phi2 inputs, also fix the casting (#1083)

78c5b19
unverified

be more robust about checking embedding modules for lora finetunes (#1074) [skip ci]

0f10080
unverified

attempt to also run e2e tests that needs gpus (#1070)

788649f
unverified

Feat: Warns to add to modules_to_save when adding tokens or switching special_tokens (#787)

1ffa386
unverified

Feat(wandb): Refactor to be more flexible (#767)

a1da39c
unverified

Feat: Add warmup_ratio (#893)

fb12895
unverified

Fix: Warn when fullfinetune without adapter (#770)

44c9d01
unverified

Fix: eval table conflict with eval_sample_packing (#769)

9923b72
unverified

Fix(cfg): Add validation for save_strategy and eval_strategy (#633)

383f88d
unverified

use fastchat conversations template (#578)

e7d3e2d
unverified

Fix: Fail bf16 check when running on cpu during merge (#631)

cfbce02
unverified

recommend padding when using sample packing (#531)

3437149
unverified

extract module for working with cfg

8cec513

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

params are adam_, not adamw_

19cf0bd

Additional test case per pr

ad5ca4f

add validation and tests for adamw hyperparam

cb9d3af

Merge branch 'main' into flash-optimum

fd2c981
unverified

new validation for mpt w grad checkpoints

14668fa

add streaming dataset support for pretraining datasets

eea2731

Validate falcon with fsdp

babf0fd

Update doc for grad_accu and add validation tests for batch size

3c71c8d

black formatting

6fa40bf

add support for gradient accumulation steps

3aad5f3

Apply isort then black

37293dc

Ignore unsupported-binary-operation

0dd35c7

Black formatting

b832a0a

Lint validation

1f3c3f5

update for pr feedback

fd5f965

new hf_use_auth_token setting so login to hf isn't required

1c33eb8

Feat: Update validate_config and add tests

52dd92a