Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
81da7d2
qwerrwe
/
src
/
axolotl
/
utils
100 contributors
History:
533 commits
bofenghuang
Fix `total_num_steps` (#1566)
81da7d2
unverified
7 months ago
callbacks
make sure to save on the last step (#1615)
7 months ago
config
FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584)
7 months ago
data
wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548)
7 months ago
gradient_checkpointing
Unsloth gradient checkpointing offload (#1528)
7 months ago
samplers
support for true batches with multipack (#1230)
10 months ago
__init__.py
Safe
136 Bytes
WIP: Support table logging for mlflow, too (#1506)
8 months ago
bench.py
Safe
2.44 kB
FDSP + QLoRA (#1378)
9 months ago
chat_templates.py
Safe
4.15 kB
feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553)
7 months ago
collators.py
Safe
9.73 kB
fix attention mask collation (#1603)
7 months ago
dict.py
Safe
327 Bytes
Pydantic 2.x cfg (#1239)
9 months ago
distributed.py
Safe
7.14 kB
DBRX Model Support (#1462)
8 months ago
freeze.py
Safe
8.33 kB
ORPO (#1419)
8 months ago
lora_embeddings.py
Safe
375 Bytes
Phi2 multipack (#1173)
10 months ago
mlflow_.py
Safe
543 Bytes
ADD: push checkpoints to mlflow artifact registry (#1295) [skip ci]
9 months ago
model_shard_quant.py
Safe
8.95 kB
DBRX Model Support (#1462)
8 months ago
models.py
37.7 kB
feat: exclude mamba blocks for jamba (#1578)
7 months ago
schedulers.py
Safe
7.66 kB
Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)
10 months ago
tokenization.py
Safe
5.16 kB
Add debug option for RL dataset preprocessing (#1404)
7 months ago
trainer.py
Safe
16.1 kB
Fix `total_num_steps` (#1566)
7 months ago
wandb_.py
Safe
609 Bytes
Feat(wandb): Refactor to be more flexible (#767)
12 months ago