Spaces:
Paused
Paused
gary-boon Claude Opus 4.6 commited on
Commit ·
0d76811
1
Parent(s): ecf49a2
Rename 'counterfactual' terminology to 'input ablation' in docstrings
Browse filesAligns with extension terminology update — these are ablation studies, not
true counterfactuals.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- backend/model_service.py +2 -2
- docs/rq1-mapping.md +1 -1
backend/model_service.py
CHANGED
|
@@ -3606,8 +3606,8 @@ class InterventionResponse(BaseModel):
|
|
| 3606 |
@app.post("/analyze/intervention")
|
| 3607 |
async def run_intervention(request: InterventionRequest, authenticated: bool = Depends(verify_api_key)):
|
| 3608 |
"""
|
| 3609 |
-
Run
|
| 3610 |
-
Re-evaluates a token position under modified conditions (masking, ablation, temperature sweep).
|
| 3611 |
"""
|
| 3612 |
if not manager.model:
|
| 3613 |
raise HTTPException(status_code=503, detail="Model not loaded")
|
|
|
|
| 3606 |
@app.post("/analyze/intervention")
|
| 3607 |
async def run_intervention(request: InterventionRequest, authenticated: bool = Depends(verify_api_key)):
|
| 3608 |
"""
|
| 3609 |
+
Run an input ablation on a cached generation run.
|
| 3610 |
+
Re-evaluates a token position under modified conditions (input masking, component ablation, temperature sweep).
|
| 3611 |
"""
|
| 3612 |
if not manager.model:
|
| 3613 |
raise HTTPException(status_code=503, detail="Model not loaded")
|
docs/rq1-mapping.md
CHANGED
|
@@ -89,7 +89,7 @@ This document maps each of the 4 visualizations (Attention, Token Size & Confide
|
|
| 89 |
**Your extension:**
|
| 90 |
- **Interactive head selection:** Developer chooses which head/layer to inspect in real-time
|
| 91 |
- **Code-specific annotations:** Highlight syntactic elements (keywords, identifiers, operators) with domain-specific color coding
|
| 92 |
-
- **
|
| 93 |
- **Task-embedded evaluation:** Developers use the tool during actual code review tasks (bug detection, prompt optimization), not just correlation studies
|
| 94 |
|
| 95 |
**Paltenghi et al. (2022): "Follow-up Attention: An Empirical Study of Developer and Neural Model Code Exploration"**
|
|
|
|
| 89 |
**Your extension:**
|
| 90 |
- **Interactive head selection:** Developer chooses which head/layer to inspect in real-time
|
| 91 |
- **Code-specific annotations:** Highlight syntactic elements (keywords, identifiers, operators) with domain-specific color coding
|
| 92 |
+
- **Input ablation queries:** "What changes if I remove this docstring? How does attention redistribute?"
|
| 93 |
- **Task-embedded evaluation:** Developers use the tool during actual code review tasks (bug detection, prompt optimization), not just correlation studies
|
| 94 |
|
| 95 |
**Paltenghi et al. (2022): "Follow-up Attention: An Empirical Study of Developer and Neural Model Code Exploration"**
|