Commit History
add support to extend context with xpos rope
a03a7d7
Add streaming inference & fix stopping at EOS
fec6bcc
Merge pull request #179 from OpenAccess-AI-Collective/fix-max_seq_len
931e606
unverified
fix for max sequence len across different model types
7f09106
Merge pull request #178 from PocketDocLabs/main
6b50200
unverified
Update README.md to reflect current gradient checkpointing support
16f9e28
unverified
Merge pull request #176 from NanoCode012/fix/peft-import
b9083a7
unverified
Fix backward compat for peft
aefb2fc
Merge pull request #169 from NanoCode012/feat/landmark
b5aa8d8
unverified
Merge pull request #171 from OpenAccess-AI-Collective/NanoCode012-falcon-lora-matrix
4d6490b
unverified
Fix falcon support lora
b242b69
unverified
Merge pull request #170 from OpenAccess-AI-Collective/NanoCode012-lambdalabs-fix
320beb2
unverified
Feed cfg.inference
bd3b537
unverified
Angainor Development
commited on
WIP: Rely on cfg.inference
813cfa4
unverified
Angainor Development
commited on
Improve lambda labs instruction
2e13cef
unverified
Fix grad checkpoint and outputs param
2a801b0
Fix patching via import instead of hijacking
e44c9e0
Feat: Add landmark attention
55b8542
Merge pull request #168 from bratao/main
febe902
unverified
Disable Wandb
f4df266
Bruno Cabral
commited on
Merge pull request #167 from NanoCode012/fix/redundant-save-eval-steps
281dc3d
unverified
Refactor out unmodified save_steps and eval_steps
2ef4634
Merge pull request #166 from NanoCode012/fix/seed
7eae903
unverified
Merge pull request #132 from utensil/falcon-7b-qlora
c8242de
unverified
Set to use cfg.seed or 42 for backward compat
2cfe9e9
Trim trailing whitespace
79a8f52
unverified
Merge pull request #164 from NanoCode012/fix/falcon-fsdp-validate
afaa0d2
unverified
Fix failing test
bfd27ba
Validate falcon with fsdp
babf0fd
Merge pull request #163 from NanoCode012/feat/matmul-tf32
81911d1
unverified
Set matmul tf32
52765ac
Merge pull request #143 from NanoCode012/fix/deprecate-prepare-8bit-training
73e9ea4
unverified
Merge pull request #162 from NanoCode012/fix/custom-prompt-readme
f8d3798
unverified
Merge pull request #161 from NanoCode012/fix/peft-setup
04a1b77
unverified
Move custom prompts out of hidden
2097a09
Add peft install for quickstart
cfff94b
Update peft and gptq instruction
2b222de
Fix future deprecate prepare_model_for_int8_training
df9528f
Fix training over existing lora
193c73b
unverified
Angainor Development
commited on