meaculpitt commited on
Commit
85c0280
·
verified ·
1 Parent(s): 308aea6

chute_config: fix param names (shutdown_after_seconds, max_hourly_price_per_gpu moved to NodeSelector). Previously silently dropped → chute deprovisioned after 5min idle.

Browse files
Files changed (1) hide show
  1. chute_config.yml +13 -11
chute_config.yml CHANGED
@@ -7,20 +7,22 @@ Image:
7
  NodeSelector:
8
  gpu_count: 1
9
  min_vram_gb_per_gpu: 16
10
- # Chutes caps exclude list at 5 items. max_hourly_price below handles
11
- # cost filtering (blocks anything >$0.50/hr); this list handles
12
- # compatibility-only concerns. Pattern copied from known-working
13
- # person/vehicle production chute (VALIDATION_GUIDE reference).
14
  exclude:
15
- - "5090" # newest Blackwell consumer; sm120 compat unverified
16
- - "b200" # Blackwell data center; expensive + sm100
17
- - "h200" # Hopper; very expensive
18
- - "mi300x" # AMD; different runtime stack
19
 
20
  Chute:
21
- timeout_seconds: 300
 
 
 
 
22
  concurrency: 4
23
  max_instances: 5
24
  scaling_threshold: 0.5
25
- shutdown_after: 288000
26
- max_hourly_price: 0.50
 
7
  NodeSelector:
8
  gpu_count: 1
9
  min_vram_gb_per_gpu: 16
10
+ # max_hourly_price_per_gpu belongs in NodeSelector, not Chute
11
+ max_hourly_price_per_gpu: 0.50
12
+ # Chutes caps exclude at 5. Pattern from production person/vehicle chute.
 
13
  exclude:
14
+ - "5090"
15
+ - "b200"
16
+ - "h200"
17
+ - "mi300x"
18
 
19
  Chute:
20
+ # Chute.__init__ only accepts these params; anything else is silently
21
+ # dropped by safe_instantiate() in the template. Previously-used keys
22
+ # `shutdown_after` (was dropped because the real name is
23
+ # `shutdown_after_seconds`) and `timeout_seconds` (not a Chute param)
24
+ # caused the chute to auto-deprovision after 5 min of idle.
25
  concurrency: 4
26
  max_instances: 5
27
  scaling_threshold: 0.5
28
+ shutdown_after_seconds: 288000 # 80h idle tolerance