Commits · Dovakiins/qwerrwe

improve GPU logging to break out pytorch cache and system mem

7b55fe6

tmm1 commited on Aug 13, 2023

extract module for working with cfg

8cec513

tmm1 commited on Aug 13, 2023

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

winglian commited on Aug 12, 2023

Fix(save): Save as safetensors (#363)

a276c9c
unverified

Nanobit commited on Aug 12, 2023

feat(merge): save tokenizer on merge (#362)

289d5c4
unverified

Nanobit commited on Aug 12, 2023

Merge pull request #356 from tmm1/load_model-args

11ddccb
unverified

tmm1 commited on Aug 10, 2023

simplify load_model signature

7181022

tmm1 commited on Aug 9, 2023

log GPU memory usage

e303d64

tmm1 commited on Aug 9, 2023

fix FSDP save of final model (#329)

894cba0
unverified

winglian commited on Jul 31, 2023

add runpod envs to .bashrc, fix bnb env (#316)

cf62cfd
unverified

winglian commited on Jul 22, 2023

misc fixes

d75adb9

winglian commited on Jul 17, 2023

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

theobjectivedad commited on Jul 15, 2023

Adding logging enhancement

553a86b

theobjectivedad commited on Jul 14, 2023

Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum

16bb627
unverified

winglian commited on Jun 14, 2023

chore: Refactor inf_kwargs out

dc77c8e

Nanobit commited on Jun 13, 2023

Merge branch 'main' into flash-optimum

fd2c981
unverified

winglian commited on Jun 12, 2023

Merge pull request #177 from NanoCode012/fix/landmark-patch

8002ffb
unverified

winglian commited on Jun 12, 2023

Merge pull request #159 from AngainorDev/patch-1

8e568bb
unverified

Nanobit commited on Jun 12, 2023

Fix strict and Lint

b565ecf

Angainor commited on Jun 11, 2023

Fix set mem_id for inference and refactor

974dc00

Nanobit commited on Jun 11, 2023

Set mem cache args on inference

572d114

Nanobit commited on Jun 11, 2023

fix formatting

958da70

winglian commited on Jun 10, 2023

pass a prompt in from stdin for inference

c4e4f81

winglian commited on Jun 10, 2023

Update scripts/finetune.py

759e867
unverified

winglian

Nanobit commited on Jun 10, 2023

address PR feedback

0c6f928

winglian commited on Jun 10, 2023

add streaming dataset support for pretraining datasets

eea2731

winglian commited on Jun 10, 2023

more tweaks to do pre-training with bettertransformers

1210dc8

winglian commited on Jun 1, 2023

experimental expansion of ctx len

488a67d

winglian commited on May 31, 2023

add flash attn context for efficient training and attempt setting model to train mode:

8792199

winglian commited on May 27, 2023

add support for opimum bettertransformers

1edc30c

winglian commited on May 27, 2023

Merge branch 'main' into patch-1

79e2a6f
unverified

Angainor Development commited on Jun 10, 2023

Remove explicit definition of cfg.inference

c250898
unverified

Angainor Development commited on Jun 10, 2023

formatting for linter

f36e227
unverified

winglian commited on Jun 10, 2023

Add streaming inference & fix stopping at EOS

fec6bcc

Glavin001 commited on Jun 10, 2023

Feed cfg.inference

bd3b537
unverified

Angainor Development commited on Jun 9, 2023

Set matmul tf32

52765ac

Nanobit commited on Jun 8, 2023

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len

4ac9e25

winglian commited on Jun 6, 2023

fix device map

74ebbf4

winglian commited on Jun 2, 2023

fix batch size calculation

5a631b3

winglian commited on May 31, 2023

Merge pull request #119 from NanoCode012/feat/update-inference

fac4600
unverified

Nanobit commited on May 31, 2023

Increase max_new_tokens

33d4017
unverified

Nanobit

winglian commited on May 31, 2023

Merge pull request #120 from OpenAccess-AI-Collective/model-from-path

c7021e1
unverified

winglian commited on May 31, 2023

black formatting

6fa40bf

winglian commited on May 31, 2023

add support for gradient accumulation steps

3aad5f3

winglian commited on May 31, 2023

fix up tokenizer config, isort fix

39a208c

winglian commited on May 31, 2023

Feat: Swap to GenerationConfig

988aeb9

Nanobit commited on May 31, 2023

Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq

bbc5bc5
unverified

winglian commited on May 30, 2023

Fix security issue or ignore false positives

a1f9850

Nanobit commited on May 29, 2023

Apply isort then black

37293dc

Nanobit commited on May 29, 2023

Delete extract_lora.py

96e8378

Nanobit commited on May 29, 2023

Commit History

improve GPU logging to break out pytorch cache and system mem 7b55fe6

extract module for working with cfg 8cec513

Attention mask and position id fixes for packing (#285) 2bb0b78 unverified

Fix(save): Save as safetensors (#363) a276c9c unverified

feat(merge): save tokenizer on merge (#362) 289d5c4 unverified

Merge pull request #356 from tmm1/load_model-args 11ddccb unverified

simplify load_model signature 7181022

log GPU memory usage e303d64

fix FSDP save of final model (#329) 894cba0 unverified

add runpod envs to .bashrc, fix bnb env (#316) cf62cfd unverified

misc fixes d75adb9

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var b1f4f7a

Adding logging enhancement 553a86b

Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum 16bb627 unverified

chore: Refactor inf_kwargs out dc77c8e

Merge branch 'main' into flash-optimum fd2c981 unverified

Merge pull request #177 from NanoCode012/fix/landmark-patch 8002ffb unverified

Merge pull request #159 from AngainorDev/patch-1 8e568bb unverified

Fix strict and Lint b565ecf

Fix set mem_id for inference and refactor 974dc00

Set mem cache args on inference 572d114

fix formatting 958da70

pass a prompt in from stdin for inference c4e4f81

Update scripts/finetune.py 759e867 unverified

address PR feedback 0c6f928

add streaming dataset support for pretraining datasets eea2731

more tweaks to do pre-training with bettertransformers 1210dc8

experimental expansion of ctx len 488a67d

add flash attn context for efficient training and attempt setting model to train mode: 8792199

add support for opimum bettertransformers 1edc30c

Merge branch 'main' into patch-1 79e2a6f unverified

Remove explicit definition of cfg.inference c250898 unverified

formatting for linter f36e227 unverified

Add streaming inference & fix stopping at EOS fec6bcc

Feed cfg.inference bd3b537 unverified

Set matmul tf32 52765ac

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len 4ac9e25

fix device map 74ebbf4

fix batch size calculation 5a631b3

Merge pull request #119 from NanoCode012/feat/update-inference fac4600 unverified

Increase max_new_tokens 33d4017 unverified

Merge pull request #120 from OpenAccess-AI-Collective/model-from-path c7021e1 unverified

black formatting 6fa40bf

add support for gradient accumulation steps 3aad5f3

fix up tokenizer config, isort fix 39a208c

Feat: Swap to GenerationConfig 988aeb9

Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq bbc5bc5 unverified

Fix security issue or ignore false positives a1f9850

Apply isort then black 37293dc

Delete extract_lora.py 96e8378

improve GPU logging to break out pytorch cache and system mem

7b55fe6

extract module for working with cfg

8cec513

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

Fix(save): Save as safetensors (#363)

a276c9c
unverified

feat(merge): save tokenizer on merge (#362)

289d5c4
unverified

Merge pull request #356 from tmm1/load_model-args

11ddccb
unverified

simplify load_model signature

7181022

log GPU memory usage

e303d64

fix FSDP save of final model (#329)

894cba0
unverified

add runpod envs to .bashrc, fix bnb env (#316)

cf62cfd
unverified

misc fixes

d75adb9

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

Adding logging enhancement

553a86b

Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum

16bb627
unverified

chore: Refactor inf_kwargs out

dc77c8e

Merge branch 'main' into flash-optimum

fd2c981
unverified

Merge pull request #177 from NanoCode012/fix/landmark-patch

8002ffb
unverified

Merge pull request #159 from AngainorDev/patch-1

8e568bb
unverified

Fix strict and Lint

b565ecf

Fix set mem_id for inference and refactor

974dc00

Set mem cache args on inference

572d114

fix formatting

958da70

pass a prompt in from stdin for inference

c4e4f81

Update scripts/finetune.py

759e867
unverified

address PR feedback

0c6f928

add streaming dataset support for pretraining datasets

eea2731

more tweaks to do pre-training with bettertransformers

1210dc8

experimental expansion of ctx len

488a67d

add flash attn context for efficient training and attempt setting model to train mode:

8792199

add support for opimum bettertransformers

1edc30c

Merge branch 'main' into patch-1

79e2a6f
unverified

Remove explicit definition of cfg.inference

c250898
unverified

formatting for linter

f36e227
unverified

Add streaming inference & fix stopping at EOS

fec6bcc

Feed cfg.inference

bd3b537
unverified

Set matmul tf32

52765ac

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len

4ac9e25

fix device map

74ebbf4

fix batch size calculation

5a631b3

Merge pull request #119 from NanoCode012/feat/update-inference

fac4600
unverified

Increase max_new_tokens

33d4017
unverified

Merge pull request #120 from OpenAccess-AI-Collective/model-from-path

c7021e1
unverified

black formatting

6fa40bf

add support for gradient accumulation steps

3aad5f3

fix up tokenizer config, isort fix

39a208c

Feat: Swap to GenerationConfig

988aeb9

Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq

bbc5bc5
unverified

Fix security issue or ignore false positives

a1f9850

Apply isort then black

37293dc

Delete extract_lora.py

96e8378