workaround for transformers bug requireing do_sample for saveing pretrained (#1206) ba944e6 unverified winglian commited on Jan 25
fix: switch to using the HuggingFace Transformers NEFT implementation (#941) ef24342 unverified dg-kalle commited on Dec 13, 2023
refactor neft patch to be more re-usable similar to trl's impl (#796) 827ec3d unverified winglian commited on Oct 29, 2023
set fsdp state dict (#584) be75668 unverified Jan Philipp Harries Jan Philipp Harries commited on Sep 15, 2023