Commits · Dovakiins/qwerrwe

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

Glavin001 commited on Sep 13, 2023

reorg a bit

fc8766e

tmm1 commited on Sep 5, 2023

use flash_attn rmsnorm when available (#526)

72a6fe1
unverified

tmm1 commited on Sep 4, 2023

use flash_attn xentropy when available (#525)

5fe30b1
unverified

tmm1 commited on Sep 4, 2023

fix checkpints on multigpu (#481)

31f3e71
unverified

winglian commited on Aug 26, 2023

ReLoRA implementation (with quantization) (#322)

bde3c5a
unverified

chargoddard

winglian commited on Aug 24, 2023

fix eval regression caused in 13f7efaf74fcd3c4514277ccb71914c589873f6a

a213d99

tmm1 commited on Aug 21, 2023

is_causal fix for evals?

fbf49a4

winglian commited on Aug 21, 2023

fix evals (#447)

ee26281
unverified

winglian commited on Aug 21, 2023

standardize attn hijack patches (#381)

06edf17
unverified

tmm1

winglian commited on Aug 18, 2023

fix check for flash attn branching (#377)

343ac84
unverified

winglian commited on Aug 13, 2023

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

winglian commited on Aug 12, 2023

Update XFormers Attention Monkeypatch to handle Llama-2 70B (GQA) (#339)

10405b9
unverified

ssmi153 commited on Aug 6, 2023

move flash-attn monkey patch alongside the others

312a9fa

tmm1 commited on Aug 3, 2023

fix sdp attention to use the flash/mem-efficient context manaager

a032c9f

winglian commited on Jul 20, 2023

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

theobjectivedad commited on Jul 15, 2023

Adding logging enhancement

553a86b

theobjectivedad commited on Jul 14, 2023

Fix set mem_id for inference and refactor

974dc00

Nanobit commited on Jun 11, 2023

Clean up landmark patching

a6190c8

Nanobit commited on Jun 11, 2023

Refactor landmark attention patch

919727b

Nanobit commited on Jun 9, 2023

add support to extend context with xpos rope

a03a7d7

winglian commited on Jun 10, 2023

Fix grad checkpoint and outputs param

2a801b0

Nanobit commited on Jun 9, 2023

Feat: Add landmark attention

55b8542

Nanobit commited on Jun 9, 2023

don't worry about dupes

c56818b

winglian commited on May 31, 2023

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

1076bcb
unverified

winglian

Nanobit commited on May 31, 2023

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

2daa683
unverified

winglian

Nanobit commited on May 31, 2023

black formatting

ad0ea6a

winglian commited on May 31, 2023

copy xformers attn from ooba since we removed dep on alpaca_lora_4bit

6cb2310

winglian commited on May 31, 2023

Spaces:

Dovakiins
/

qwerrwe

Build error

Commit History

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

reorg a bit

fc8766e

use flash_attn rmsnorm when available (#526)

72a6fe1
unverified

use flash_attn xentropy when available (#525)

5fe30b1
unverified

fix checkpints on multigpu (#481)

31f3e71
unverified

ReLoRA implementation (with quantization) (#322)

bde3c5a
unverified

fix eval regression caused in 13f7efaf74fcd3c4514277ccb71914c589873f6a

a213d99

is_causal fix for evals?

fbf49a4

fix evals (#447)

ee26281
unverified

standardize attn hijack patches (#381)

06edf17
unverified

fix check for flash attn branching (#377)

343ac84
unverified

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

Update XFormers Attention Monkeypatch to handle Llama-2 70B (GQA) (#339)

10405b9
unverified

move flash-attn monkey patch alongside the others

312a9fa

fix sdp attention to use the flash/mem-efficient context manaager

a032c9f

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

Adding logging enhancement

553a86b

Fix set mem_id for inference and refactor

974dc00

Clean up landmark patching

a6190c8

Refactor landmark attention patch

919727b

add support to extend context with xpos rope

a03a7d7

Fix grad checkpoint and outputs param

2a801b0

Feat: Add landmark attention

55b8542

don't worry about dupes

c56818b

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

1076bcb
unverified

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

2daa683
unverified

black formatting

ad0ea6a

copy xformers attn from ooba since we removed dep on alpaca_lora_4bit

6cb2310

Commit History

Add training callback to send predictions to WandB table (#521) 5b67ea9 unverified

reorg a bit fc8766e

use flash_attn rmsnorm when available (#526) 72a6fe1 unverified

use flash_attn xentropy when available (#525) 5fe30b1 unverified

fix checkpints on multigpu (#481) 31f3e71 unverified

ReLoRA implementation (with quantization) (#322) bde3c5a unverified

fix eval regression caused in 13f7efaf74fcd3c4514277ccb71914c589873f6a a213d99

is_causal fix for evals? fbf49a4

fix evals (#447) ee26281 unverified

standardize attn hijack patches (#381) 06edf17 unverified

fix check for flash attn branching (#377) 343ac84 unverified

Attention mask and position id fixes for packing (#285) 2bb0b78 unverified

Update XFormers Attention Monkeypatch to handle Llama-2 70B (GQA) (#339) 10405b9 unverified

move flash-attn monkey patch alongside the others 312a9fa

fix sdp attention to use the flash/mem-efficient context manaager a032c9f

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var b1f4f7a

Adding logging enhancement 553a86b

Fix set mem_id for inference and refactor 974dc00

Clean up landmark patching a6190c8

Refactor landmark attention patch 919727b

add support to extend context with xpos rope a03a7d7

Fix grad checkpoint and outputs param 2a801b0

Feat: Add landmark attention 55b8542

don't worry about dupes c56818b

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py 1076bcb unverified

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py 2daa683 unverified

black formatting ad0ea6a

copy xformers attn from ooba since we removed dep on alpaca_lora_4bit 6cb2310

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

reorg a bit

fc8766e

use flash_attn rmsnorm when available (#526)

72a6fe1
unverified

use flash_attn xentropy when available (#525)

5fe30b1
unverified

fix checkpints on multigpu (#481)

31f3e71
unverified

ReLoRA implementation (with quantization) (#322)

bde3c5a
unverified

fix eval regression caused in 13f7efaf74fcd3c4514277ccb71914c589873f6a

a213d99

is_causal fix for evals?

fbf49a4

fix evals (#447)

ee26281
unverified

standardize attn hijack patches (#381)

06edf17
unverified

fix check for flash attn branching (#377)

343ac84
unverified

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

Update XFormers Attention Monkeypatch to handle Llama-2 70B (GQA) (#339)

10405b9
unverified

move flash-attn monkey patch alongside the others

312a9fa

fix sdp attention to use the flash/mem-efficient context manaager

a032c9f

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

Adding logging enhancement

553a86b

Fix set mem_id for inference and refactor

974dc00

Clean up landmark patching

a6190c8

Refactor landmark attention patch

919727b

add support to extend context with xpos rope

a03a7d7

Fix grad checkpoint and outputs param

2a801b0

Feat: Add landmark attention

55b8542

don't worry about dupes

c56818b

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

1076bcb
unverified

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

2daa683
unverified

black formatting

ad0ea6a

copy xformers attn from ooba since we removed dep on alpaca_lora_4bit

6cb2310