eval_table isn't quite stable enough to be in default llama configs (#637) d887ad8 unverified winglian commited on Sep 26, 2023
more sane defaults for openllama 3b used for quickstarts (#602) 674c576 unverified winglian commited on Sep 19, 2023
btlm and falcon monkey patches for flash attn (#566) 6b9b229 unverified winglian commited on Sep 17, 2023
Add training callback to send predictions to WandB table (#521) 5b67ea9 unverified Glavin001 commited on Sep 13, 2023
recommend padding when using sample packing (#531) 3437149 unverified winglian commited on Sep 6, 2023
Add support for GPTQ using native transformers/peft (#468) 3355706 unverified winglian commited on Sep 5, 2023
pad_to_worst_case_seq_len boolean, for testing memory limits (#498) 8e197f6 unverified Birch-san tmm1 commited on Aug 28, 2023
Feat(cfg): Add code-llama configs for all sizes (#479) 3513071 unverified mhenrichsen mhenrichsen commited on Aug 27, 2023
new llama-2 default settings (#370) fdffef5 unverified mhenrichsen Mads Henrichsen commited on Aug 14, 2023
Add wandb_entity to wandb options, update example configs, update README (#361) 7019509 unverified Morgan McGuire Morgan McGuire winglian commited on Aug 12, 2023
Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum 16bb627 unverified winglian commited on Jun 14, 2023
Merge pull request #193 from OpenAccess-AI-Collective/config-fixes-20230612 94f310c unverified winglian commited on Jun 12, 2023
Merge pull request #132 from utensil/falcon-7b-qlora c8242de unverified Nanobit commited on Jun 8, 2023
Default `wandb_project` to empty as suggested a52f481 unverified utensil Nanobit commited on Jun 8, 2023
Add comments/alternatives for falcon-qlora configs ca11ae9 unverified utensil commited on Jun 3, 2023
swap batch size for gradient accumulation steps to decouple from num gpu c2a0792 winglian commited on May 31, 2023
Merge pull request #105 from viktoriussuwandi/viktoriussuwandi-patch 4df9da7 unverified winglian commited on May 30, 2023
Merge pull request #106 from fearnworks/qlora-openllama-3b-example 2531ea2 unverified winglian commited on May 30, 2023