allow overriding of model_config parameters from the YML (#853) 1bc1186 unverified winglian commited on Nov 16, 2023
lint fix that didn't get caught by linter (#866) 332984d unverified winglian commited on Nov 15, 2023
Docs: add instructions to 1-click launching on public clouds (#862) b33c1d5 unverified zongheng commited on Nov 15, 2023
feat(doc): add more info on train_on_split (#855) 306fe19 unverified Nanobit commited on Nov 15, 2023
update table for rwkv4 support, fix process count for dataset (#822) cdc71f7 unverified winglian commited on Nov 5, 2023
Add docker advanced instruction to README (#792) 2e71ff0 unverified gordicaleksa commited on Oct 27, 2023
chore(readme): Improve documentation on conversation field (#782) 20aa4b5 unverified Nanobit commited on Oct 24, 2023
fix(doc): update default doc according to arg (#714) 5855dde unverified Nanobit commited on Oct 10, 2023
fix(doc): Add note on inference w sample packing (#712) 11c48c5 unverified Nanobit commited on Oct 10, 2023
refactor to set eval_batch_size earlier if unset, so we can warn if mismatched (#662) 2642cae unverified winglian commited on Oct 3, 2023
skip some flash attn patches unless explicitly enabled (#643) 895f0a0 unverified winglian commited on Sep 27, 2023
Added quotes to the pip install -e command to fix an incompatibility with shells that do glob expansion like zsh (#632) 5e5296a unverified Fernando Tarin Morales commited on Sep 25, 2023
Feat(data): Allow loading local csv and text (#594) 00dce35 unverified Nanobit commited on Sep 17, 2023
support custom field for completion from yml (#580) f7a2263 unverified winglian commited on Sep 15, 2023
refactor scripts/finetune.py into new cli modules (#550) 861ceca unverified winglian Nanobit commited on Sep 15, 2023
Add training callback to send predictions to WandB table (#521) 5b67ea9 unverified Glavin001 commited on Sep 13, 2023
document that packaging needs to be installed before flash-attn (#559) 9845c5e unverified winglian commited on Sep 12, 2023
ergonomic update to optimizer config doc (#548) 6d57f2f unverified The Objective Dad commited on Sep 11, 2023
update readme to point to direct link to runpod template, cleanup install instrucitons (#532) 34c0a86 unverified winglian commited on Sep 8, 2023
Fix(doc): Inform Windows users to use WSL/docker (#518) f51c9c5 unverified Nanobit commited on Sep 1, 2023
Added advanced DDP args (#515) 396a7a7 unverified Jan Philipp Harries Jan Philipp Harries commited on Aug 31, 2023
Fix(doc): Clarify no amp to full yaml docs (#496) 48c5647 unverified Nanobit commited on Aug 29, 2023
pad_to_worst_case_seq_len boolean, for testing memory limits (#498) 8e197f6 unverified Birch-san tmm1 commited on Aug 28, 2023