Spaces:
Running on A10G
Running on A10G
Commit ·
61cf76d
1
Parent(s): bf3e402
Fix TRL version conflict: remove trl==0.12.1 pin that broke GRPOConfig import
Browse filesUnsloth installs a GRPO-compatible TRL (>=0.13.0), but the explicit
trl==0.12.1 pin was downgrading it to a version without GRPOConfig/GRPOTrainer.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Dockerfile +1 -3
Dockerfile
CHANGED
|
@@ -18,11 +18,9 @@ RUN pip install --no-cache-dir -U pip setuptools wheel
|
|
| 18 |
# 1. Install Unsloth first (it's the most sensitive to environment)
|
| 19 |
RUN pip install --no-cache-dir "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
|
| 20 |
|
| 21 |
-
# 2.
|
| 22 |
-
# We use 0.12.1 for TRL and 4.48.2 for transformers - these are stable with Torch 2.5.1
|
| 23 |
RUN pip install --no-cache-dir \
|
| 24 |
"transformers==4.48.2" \
|
| 25 |
-
"trl==0.12.1" \
|
| 26 |
"peft==0.14.0" \
|
| 27 |
"accelerate==1.2.1" \
|
| 28 |
"bitsandbytes==0.45.0"
|
|
|
|
| 18 |
# 1. Install Unsloth first (it's the most sensitive to environment)
|
| 19 |
RUN pip install --no-cache-dir "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
|
| 20 |
|
| 21 |
+
# 2. Pin RL stack — do NOT pin trl; Unsloth already installed a GRPO-compatible version
|
|
|
|
| 22 |
RUN pip install --no-cache-dir \
|
| 23 |
"transformers==4.48.2" \
|
|
|
|
| 24 |
"peft==0.14.0" \
|
| 25 |
"accelerate==1.2.1" \
|
| 26 |
"bitsandbytes==0.45.0"
|