Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Enxin
/
sparse
like
0
TensorBoard
Safetensors
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
sparse
/
ms-swift
/
swift
/
trainers
/
rlhf_trainer
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Enxin
Upload folder using huggingface_hub
96fe658
verified
about 1 month ago
__pycache__
Upload folder using huggingface_hub
about 1 month ago
__init__.py
1.32 kB
Upload folder using huggingface_hub
about 1 month ago
cpo_trainer.py
Safe
1.29 kB
Upload folder using huggingface_hub
about 1 month ago
dpo_trainer.py
6.38 kB
Upload folder using huggingface_hub
about 1 month ago
gkd_trainer.py
7.6 kB
Upload folder using huggingface_hub
about 1 month ago
grpo_trainer.py
91.9 kB
Upload folder using huggingface_hub
about 1 month ago
kto_trainer.py
Safe
2.41 kB
Upload folder using huggingface_hub
about 1 month ago
orpo_trainer.py
Safe
633 Bytes
Upload folder using huggingface_hub
about 1 month ago
ppo_trainer.py
3.6 kB
Upload folder using huggingface_hub
about 1 month ago
reward_trainer.py
3.97 kB
Upload folder using huggingface_hub
about 1 month ago
rlhf_mixin.py
5.75 kB
Upload folder using huggingface_hub
about 1 month ago
utils.py
9.61 kB
Upload folder using huggingface_hub
about 1 month ago
vllm_client.py
11.1 kB
Upload folder using huggingface_hub
about 1 month ago