Commit History
Add StableLM 2 Example Scripts (#1327) [skip ci]
f30d062
unverified
ncoop57
commited on
hotfix to exclude_unset from pydantic config when converting back to a dict (#1334)
269c543
unverified
winglian
commited on
hotfix for missing outputs params (#1333)
e7eed20
unverified
winglian
commited on
hotfix for lora rank (#1332)
cf00231
unverified
winglian
commited on
hotfix for capabilities loading (#1331)
7de912e
unverified
winglian
commited on
chore: update readme to be more clear (#1326) [skip ci]
c6b01e0
unverified
Nanobit
commited on
Pydantic 2.x cfg (#1239)
cc3cebf
unverified
winglian
commited on
make mlflow optional (#1317)
5894f0e
unverified
winglian
commited on
Use yaml codeblock for config.yaml field (#1303) [skip ci]
5cf226e
unverified
dg-kalle
commited on
fix(readme): Clarify doc for tokenizer_config (#1323) [skip ci]
2ed52bd
unverified
Nanobit
commited on
deprecate: pytorch 2.0.1 image (#1315) [skip ci]
a359579
unverified
Nanobit
commited on
multipack for gemma (#1313)
2752d5f
unverified
winglian
commited on
Adding Google's gemma Model (#1312)
9e300ac
unverified
aaditya
commited on
fix(readme): update inference md link (#1311) [skip ci]
3d2cd80
unverified
Nanobit
commited on
Add instructions for playing with qlora model to colab example (#1290)
6ab69ec
unverified
Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291)
3c00f40
unverified
David Meikle
commited on
fix(examples): remove is_*_derived as it's parsed automatically (#1297)
a7a9a14
unverified
Nanobit
commited on
Validation always happens on first step (#1300)
e2786cc
unverified
LeonardoEmili
commited on
Add seq2seq eval benchmark callback (#1274)
5a5d474
unverified
LeonardoEmili
commited on
Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)
8430db2
unverified
jinwonkim93
commited on
allow the optimizer prune ratio for ReLoRA to be configurable (#1287)
4b997c3
unverified
winglian
commited on
Add MPS support (#1264)
fac2d98
unverified
don't use load and push together (#1284)
ea00dd0
unverified
winglian
commited on
Update README.md (#1281)
b2a4cb4
unverified
hamel
commited on
run the docker image builds and push on gh action gpu runners (#1218)
aaf54dc
unverified
winglian
commited on
add support for https remote yamls (#1277)
9bca7db
unverified
hamel
commited on
allow remote data paths (#1278)
91cf4ee
unverified
hamel
commited on
copy edits (#1276)
1daecd1
unverified
winglian
commited on
Add link to axolotl cloud image on latitude (#1275)
4a654b3
unverified
winglian
commited on
simplify haldning for newer multipack patches so they can be added in a single place (#1270)
5698943
unverified
winglian
commited on
contributor avatars (#1269)
411293b
unverified
winglian
commited on
Fix bug preventing model_kwargs being injected (#1262)
73f1bda
unverified
Zac Brannelly
commited on
lock pytorch (#1247) [skip ci]
1c7ed26
unverified
JohanWork
commited on
Add more save strategies for DPO training. (#1255)
13eea21
unverified
Philip May
commited on
Fix typo `bloat16` -> `bfloat16` (#1257)
1072f28
unverified
chiragjn
commited on
Pretrain transforms (#1261)
c7cf381
unverified
winglian
commited on
relora: magnitude pruning of the optimizer (#1245)
8c2e05a
unverified
winglian
commited on
add contact info for dedicated support for axolotl [skip ci] (#1243)
dfd1885
unverified
winglian
commited on
support for true batches with multipack (#1230)
00568c1
unverified
winglian
commited on
Peft deepspeed resume (#1227)
c67fb71
unverified
winglian
commited on
Support for additional_special_tokens (#1221) [skip ci]
25e037f
unverified
Update rlhf.md (#1237) [skip ci]
52c83d3
unverified
hamel
commited on
add a helpful motd for cloud image (#1235) [skip ci]
d113331
unverified
winglian
commited on
set torch version to what is installed during axolotl install (#1234)
8f2b591
unverified
winglian
commited on
Fix and document test_datasets (#1228)
5787e1a
unverified
Fix typo (#1231) [skip ci]
8608d80
unverified
xhedit
commited on
Peft lotfq (#1222)
4cb7900
unverified
winglian
commited on