Commits · Dovakiins/qwerrwe

Phi-3 conversation format, example training script and perplexity metric (#1582)

cf64284
unverified

roborovski

winglian commited on Jun 4

add support for rpo_alpha (#1681)

c996881
unverified

winglian commited on Jun 4

re-enable DPO for tests in modal ci (#1374)

1f151c0
unverified

winglian commited on Jun 3

Fix the broken link in README (#1678) [skip ci]

5cde065
unverified

saeedesmaili commited on Jun 3

need to add back drop_last for sampler (#1676)

05b0bd0
unverified

winglian commited on May 31

cleanup the deepspeed proxy model at the end of training (#1675)

d4f6c65
unverified

winglian commited on May 30

load explicit splits on datasets (#1652)

a944f7b
unverified

winglian commited on May 30

set chat_template in datasets config automatically (#1664)

9d4225a
unverified

winglian commited on May 30

use mixins for orpo and kto configs so they work with axolotl customizations (#1674)

f7332ac
unverified

winglian commited on May 30

re-enable phi for tests in modal ci (#1373)

16d46b7
unverified

winglian commited on May 29

revert multipack batch sampler changes (#1672)

a6b37bd
unverified

winglian commited on May 29

handle the system role too for chat templates (#1671)

b752080
unverified

winglian commited on May 29

make sure the CI fails when pytest script fails (#1669)

fe650dd
unverified

winglian commited on May 29

Fix README quick start example usage model dirs (#1668)

49b967b
unverified

Abe Voelker commited on May 28

Correct name of MixtralBlockSparseTop2MLP (L -> l) (#1667)

65db903
unverified

seungduk commited on May 28

Fix: ensure correct handling of `val_set_size` as `float` or `int` (#1655)

6a5a725
unverified

Davide Caroselli

winglian commited on May 28

fix lint issue that snuck through (#1665)

f5febc7
unverified

winglian commited on May 28

Fix Lora config error for Llama3 (#1659)

230e0ac
unverified

oaishi commited on May 28

Generalizing the chat_template prompt strategy (#1660) [skip ci]

cc11c6b
unverified

fozziethebeat commited on May 28

Fix Google Colab notebook 2024-05 (#1662) [skip ci]

5f91064
unverified

Maciek commited on May 28

update deps (#1663) [skip ci]

ef22351
unverified

winglian commited on May 28

document how to use `share_strategy="no"` (#1653) [skip ci]

8a20a7b
unverified

charlesfrye commited on May 24

Switch to parallel FFD bin packing algorithm. (#1619)

367b2e8
unverified

winglian

daaave commited on May 23

support for custom messages field in sharegpt (#1651)

bbfed31
unverified

winglian commited on May 23

Update tiny-llama qlora.yml addressing eval packing error (#1638)

84bb806
unverified

Jaydeep Thik commited on May 22

enable loraplus setting for dpo trainer (#1646)

a27d5e1
unverified

thepowerfuldeez commited on May 22

allow report_to for multiple providers (#1647)

6299eb5
unverified

winglian commited on May 22

Fix llama3 chat_template (extra <|eot_id|> on last turn) (#1635)

7c2bf30
unverified

leonardlin

winglian commited on May 21

Add KTO support (#1640)

22ae21a
unverified

benredmond

winglian commited on May 20

fixes to save on fractional save_steps (#1643)

ba45531
unverified

winglian commited on May 20

Unsloth optims for Llama (#1609)

8a1572a
unverified

winglian commited on May 20

add save_only_model option (#1634)

702a669
unverified

emozilla commited on May 17

fix ray install (#1630)

891ae8a
unverified

winglian commited on May 16

more fixes to work with runpod + skypilot (#1629)

0c49ecc
unverified

winglian commited on May 16

cloud image w/o tmux (#1628)

6011343
unverified

winglian commited on May 16

install rsync too (#1627)

419b2a6
unverified

winglian commited on May 16

fix setting the authorized keys when there are more than one in the env var (#1626)

2501a37
unverified

winglian commited on May 16

fix symlinks for axolotl outputs (#1625)

e6937e8
unverified

winglian commited on May 15

bump versions of deps (#1621)

039e2a0
unverified

winglian commited on May 15

update outputs path so that we can mount workspace to /workspace/data (#1623)

4fde300
unverified

winglian commited on May 15

update torch 2.2.1 -> 2.2.2 (#1622)

3319780
unverified

winglian commited on May 15

Fix `total_num_steps` (#1566)

81da7d2
unverified

bofenghuang commited on May 15

FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584)

1e1921b
unverified

alimosavian Ali Mosavian

winglian commited on May 14

make sure to save on the last step (#1615)

1634ac8
unverified

winglian commited on May 14

fix attention mask collation (#1603)

0298273
unverified

winglian commited on May 14

add dstack section (#1612) [skip ci]

5d97e65
unverified

chansung

winglian commited on May 14

Llama3 dpo (#1610)

2147cf6
unverified

winglian

Nero10578 commited on May 11

feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553)

50421c8
unverified

Ram Ram

winglian commited on May 11

adding llama3 fastchat conversation monkeypatch (#1539)

b32c08f
unverified

Antoni-Joan Solergibert

winglian commited on May 10

ignore the fsdp_config section too (#1606) [skip ci]

fff06af
unverified

winglian commited on May 9

Commit History

Phi-3 conversation format, example training script and perplexity metric (#1582) cf64284 unverified

add support for rpo_alpha (#1681) c996881 unverified

re-enable DPO for tests in modal ci (#1374) 1f151c0 unverified

Fix the broken link in README (#1678) [skip ci] 5cde065 unverified

need to add back drop_last for sampler (#1676) 05b0bd0 unverified

cleanup the deepspeed proxy model at the end of training (#1675) d4f6c65 unverified

load explicit splits on datasets (#1652) a944f7b unverified

set chat_template in datasets config automatically (#1664) 9d4225a unverified

use mixins for orpo and kto configs so they work with axolotl customizations (#1674) f7332ac unverified

re-enable phi for tests in modal ci (#1373) 16d46b7 unverified

revert multipack batch sampler changes (#1672) a6b37bd unverified

handle the system role too for chat templates (#1671) b752080 unverified

make sure the CI fails when pytest script fails (#1669) fe650dd unverified

Fix README quick start example usage model dirs (#1668) 49b967b unverified

Correct name of MixtralBlockSparseTop2MLP (L -> l) (#1667) 65db903 unverified

Fix: ensure correct handling of `val_set_size` as `float` or `int` (#1655) 6a5a725 unverified

fix lint issue that snuck through (#1665) f5febc7 unverified

Fix Lora config error for Llama3 (#1659) 230e0ac unverified

Generalizing the chat_template prompt strategy (#1660) [skip ci] cc11c6b unverified

Fix Google Colab notebook 2024-05 (#1662) [skip ci] 5f91064 unverified

update deps (#1663) [skip ci] ef22351 unverified

document how to use `share_strategy="no"` (#1653) [skip ci] 8a20a7b unverified

Switch to parallel FFD bin packing algorithm. (#1619) 367b2e8 unverified

support for custom messages field in sharegpt (#1651) bbfed31 unverified

Update tiny-llama qlora.yml addressing eval packing error (#1638) 84bb806 unverified

enable loraplus setting for dpo trainer (#1646) a27d5e1 unverified

allow report_to for multiple providers (#1647) 6299eb5 unverified

Fix llama3 chat_template (extra <|eot_id|> on last turn) (#1635) 7c2bf30 unverified

Add KTO support (#1640) 22ae21a unverified

fixes to save on fractional save_steps (#1643) ba45531 unverified

Unsloth optims for Llama (#1609) 8a1572a unverified

add save_only_model option (#1634) 702a669 unverified

fix ray install (#1630) 891ae8a unverified

more fixes to work with runpod + skypilot (#1629) 0c49ecc unverified

cloud image w/o tmux (#1628) 6011343 unverified

install rsync too (#1627) 419b2a6 unverified

fix setting the authorized keys when there are more than one in the env var (#1626) 2501a37 unverified

fix symlinks for axolotl outputs (#1625) e6937e8 unverified

bump versions of deps (#1621) 039e2a0 unverified

update outputs path so that we can mount workspace to /workspace/data (#1623) 4fde300 unverified

update torch 2.2.1 -> 2.2.2 (#1622) 3319780 unverified

Fix `total_num_steps` (#1566) 81da7d2 unverified

FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584) 1e1921b unverified

make sure to save on the last step (#1615) 1634ac8 unverified

fix attention mask collation (#1603) 0298273 unverified

add dstack section (#1612) [skip ci] 5d97e65 unverified

Llama3 dpo (#1610) 2147cf6 unverified

feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553) 50421c8 unverified

adding llama3 fastchat conversation monkeypatch (#1539) b32c08f unverified

ignore the fsdp_config section too (#1606) [skip ci] fff06af unverified

Phi-3 conversation format, example training script and perplexity metric (#1582)

cf64284
unverified

add support for rpo_alpha (#1681)

c996881
unverified

re-enable DPO for tests in modal ci (#1374)

1f151c0
unverified

Fix the broken link in README (#1678) [skip ci]

5cde065
unverified

need to add back drop_last for sampler (#1676)

05b0bd0
unverified

cleanup the deepspeed proxy model at the end of training (#1675)

d4f6c65
unverified

load explicit splits on datasets (#1652)

a944f7b
unverified

set chat_template in datasets config automatically (#1664)

9d4225a
unverified

use mixins for orpo and kto configs so they work with axolotl customizations (#1674)

f7332ac
unverified

re-enable phi for tests in modal ci (#1373)

16d46b7
unverified

revert multipack batch sampler changes (#1672)

a6b37bd
unverified

handle the system role too for chat templates (#1671)

b752080
unverified

make sure the CI fails when pytest script fails (#1669)

fe650dd
unverified

Fix README quick start example usage model dirs (#1668)

49b967b
unverified

Correct name of MixtralBlockSparseTop2MLP (L -> l) (#1667)

65db903
unverified

Fix: ensure correct handling of `val_set_size` as `float` or `int` (#1655)

6a5a725
unverified

fix lint issue that snuck through (#1665)

f5febc7
unverified

Fix Lora config error for Llama3 (#1659)

230e0ac
unverified

Generalizing the chat_template prompt strategy (#1660) [skip ci]

cc11c6b
unverified

Fix Google Colab notebook 2024-05 (#1662) [skip ci]

5f91064
unverified

update deps (#1663) [skip ci]

ef22351
unverified

document how to use `share_strategy="no"` (#1653) [skip ci]

8a20a7b
unverified

Switch to parallel FFD bin packing algorithm. (#1619)

367b2e8
unverified

support for custom messages field in sharegpt (#1651)

bbfed31
unverified

Update tiny-llama qlora.yml addressing eval packing error (#1638)

84bb806
unverified

enable loraplus setting for dpo trainer (#1646)

a27d5e1
unverified

allow report_to for multiple providers (#1647)

6299eb5
unverified

Fix llama3 chat_template (extra <|eot_id|> on last turn) (#1635)

7c2bf30
unverified

Add KTO support (#1640)

22ae21a
unverified

fixes to save on fractional save_steps (#1643)

ba45531
unverified

Unsloth optims for Llama (#1609)

8a1572a
unverified

add save_only_model option (#1634)

702a669
unverified

fix ray install (#1630)

891ae8a
unverified

more fixes to work with runpod + skypilot (#1629)

0c49ecc
unverified

cloud image w/o tmux (#1628)

6011343
unverified

install rsync too (#1627)

419b2a6
unverified

fix setting the authorized keys when there are more than one in the env var (#1626)

2501a37
unverified

fix symlinks for axolotl outputs (#1625)

e6937e8
unverified

bump versions of deps (#1621)

039e2a0
unverified

update outputs path so that we can mount workspace to /workspace/data (#1623)

4fde300
unverified

update torch 2.2.1 -> 2.2.2 (#1622)

3319780
unverified

Fix `total_num_steps` (#1566)

81da7d2
unverified

FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584)

1e1921b
unverified

make sure to save on the last step (#1615)

1634ac8
unverified

fix attention mask collation (#1603)

0298273
unverified

add dstack section (#1612) [skip ci]

5d97e65
unverified

Llama3 dpo (#1610)

2147cf6
unverified

feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553)

50421c8
unverified

adding llama3 fastchat conversation monkeypatch (#1539)

b32c08f
unverified

ignore the fsdp_config section too (#1606) [skip ci]

fff06af
unverified