Nitishkumar-ai Claude Opus 4.6 commited on
Commit
61cf76d
·
1 Parent(s): bf3e402

Fix TRL version conflict: remove trl==0.12.1 pin that broke GRPOConfig import

Browse files

Unsloth installs a GRPO-compatible TRL (>=0.13.0), but the explicit
trl==0.12.1 pin was downgrading it to a version without GRPOConfig/GRPOTrainer.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Files changed (1) hide show
  1. Dockerfile +1 -3
Dockerfile CHANGED
@@ -18,11 +18,9 @@ RUN pip install --no-cache-dir -U pip setuptools wheel
18
  # 1. Install Unsloth first (it's the most sensitive to environment)
19
  RUN pip install --no-cache-dir "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
20
 
21
- # 2. Install specifically pinned versions of the RL stack
22
- # We use 0.12.1 for TRL and 4.48.2 for transformers - these are stable with Torch 2.5.1
23
  RUN pip install --no-cache-dir \
24
  "transformers==4.48.2" \
25
- "trl==0.12.1" \
26
  "peft==0.14.0" \
27
  "accelerate==1.2.1" \
28
  "bitsandbytes==0.45.0"
 
18
  # 1. Install Unsloth first (it's the most sensitive to environment)
19
  RUN pip install --no-cache-dir "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
20
 
21
+ # 2. Pin RL stack do NOT pin trl; Unsloth already installed a GRPO-compatible version
 
22
  RUN pip install --no-cache-dir \
23
  "transformers==4.48.2" \
 
24
  "peft==0.14.0" \
25
  "accelerate==1.2.1" \
26
  "bitsandbytes==0.45.0"