Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
ivangabriele
/
trl-sandbox
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
trl-sandbox
/
examples
/
research_projects
/
stack_llama
/
scripts
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
ivangabriele
feat: initialize project
2f5127c
verified
12 days ago
README.md
Safe
1.87 kB
feat: initialize project
12 days ago
merge_peft_adapter.py
Safe
2.61 kB
feat: initialize project
12 days ago
reward_modeling.py
Safe
11.9 kB
feat: initialize project
12 days ago
rl_training.py
Safe
10.3 kB
feat: initialize project
12 days ago
supervised_finetuning.py
Safe
7.73 kB
feat: initialize project
12 days ago