Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

qwerrwe / src /axolotl /utils

100 contributors

History: 533 commits

bofenghuang's picture

Fix `total_num_steps` (#1566)

81da7d2 unverified 7 months ago

callbacks
make sure to save on the last step (#1615) 7 months ago
config
FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584) 7 months ago
data
wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548) 7 months ago
gradient_checkpointing
Unsloth gradient checkpointing offload (#1528) 7 months ago
samplers
support for true batches with multipack (#1230) 10 months ago
__init__.py

136 Bytes

WIP: Support table logging for mlflow, too (#1506) 8 months ago
bench.py

2.44 kB

FDSP + QLoRA (#1378) 9 months ago
chat_templates.py

4.15 kB

feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553) 7 months ago
collators.py

9.73 kB

fix attention mask collation (#1603) 7 months ago
dict.py

327 Bytes

Pydantic 2.x cfg (#1239) 9 months ago
distributed.py

7.14 kB

DBRX Model Support (#1462) 8 months ago
freeze.py

8.33 kB

ORPO (#1419) 8 months ago
lora_embeddings.py

375 Bytes

Phi2 multipack (#1173) 10 months ago
mlflow_.py

543 Bytes

ADD: push checkpoints to mlflow artifact registry (#1295) [skip ci] 9 months ago
model_shard_quant.py

8.95 kB

DBRX Model Support (#1462) 8 months ago
models.py
37.7 kB

feat: exclude mamba blocks for jamba (#1578) 7 months ago
schedulers.py

7.66 kB

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273) 10 months ago
tokenization.py

5.16 kB

Add debug option for RL dataset preprocessing (#1404) 7 months ago
trainer.py

16.1 kB

Fix `total_num_steps` (#1566) 7 months ago
wandb_.py

609 Bytes

Feat(wandb): Refactor to be more flexible (#767) 12 months ago