Update README.md to reflect current gradient checkpointing support 16f9e28 unverified PocketDoc commited on Jun 9, 2023
Merge pull request #169 from NanoCode012/feat/landmark b5aa8d8 unverified Nanobit commited on Jun 9, 2023
Merge pull request #132 from utensil/falcon-7b-qlora c8242de unverified Nanobit commited on Jun 8, 2023
Merge pull request #162 from NanoCode012/fix/custom-prompt-readme f8d3798 unverified Nanobit commited on Jun 8, 2023
Merge pull request #142 from NanoCode012/feat/custom-prompt-readme ecfe8d0 unverified winglian commited on Jun 2, 2023
Update doc for grad_accu and add validation tests for batch size 3c71c8d Nanobit commited on May 31, 2023
Merge pull request #130 from OpenAccess-AI-Collective/gas f94dd62 unverified winglian commited on May 31, 2023
swap batch size for gradient accumulation steps to decouple from num gpu c2a0792 winglian commited on May 31, 2023
Merge pull request #118 from NanoCode012/feat/torch-readme 0e4be62 unverified Nanobit commited on May 31, 2023
Merge pull request #124 from OpenAccess-AI-Collective/xformers-fix 2d0ba3b unverified winglian commited on May 31, 2023
Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq bbc5bc5 unverified winglian commited on May 30, 2023
new hf_use_auth_token setting so login to hf isn't required 1c33eb8 winglian commited on May 28, 2023
Merge branch 'main' into refactor/rename-4b-to-gptq 147241c unverified winglian commited on May 27, 2023
Merge pull request #62 from OpenAccess-AI-Collective/qlora-fixes bbfc333 unverified winglian commited on May 26, 2023
qlora merge and load requires that base model isn't loaded in 4 or 8 bit 3f6017d winglian commited on May 26, 2023