Text Generation
Transformers
Safetensors
PEFT
English
Chinese
qwen3_5
image-text-to-text
veriloop
veriloop-coder
code
coding-agent
software-engineering
repository-understanding
tool-use
lora
harness-engineering
evidence-binding
rollback
uncertainty-calibration
long-context
open-weights
conversational
Upload README.md
Browse files
README.md
CHANGED
|
@@ -1,89 +1,64 @@
|
|
| 1 |
---
|
| 2 |
library_name: transformers
|
| 3 |
pipeline_tag: text-generation
|
| 4 |
-
license:
|
|
|
|
| 5 |
base_model:
|
| 6 |
- Qwen/Qwen3.6-27B
|
|
|
|
| 7 |
language:
|
| 8 |
- en
|
|
|
|
| 9 |
tags:
|
| 10 |
-
- safetensors
|
| 11 |
- qwen3_6
|
|
|
|
| 12 |
- qwen
|
|
|
|
|
|
|
| 13 |
- code
|
| 14 |
- coding-agent
|
| 15 |
-
- software-engineering
|
| 16 |
-
- harness-engineering
|
| 17 |
- agentic-coding
|
|
|
|
| 18 |
- repository-understanding
|
| 19 |
- tool-use
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
- evidence-binding
|
| 21 |
- rollback
|
| 22 |
- uncertainty-calibration
|
| 23 |
-
-
|
| 24 |
-
-
|
|
|
|
|
|
|
| 25 |
---
|
| 26 |
|
| 27 |
# VeriLoop Coder-E1
|
| 28 |
|
| 29 |
-
**VeriLoop Coder-E1** is an open-weight coding model release built on a Qwen3.6-27B backbone and
|
| 30 |
-
|
| 31 |
-
This release is designed for developers and researchers who want a coding model that is not only fluent at code generation, but also more prepared for tool-mediated, evidence-aware, rollback-safe, and uncertainty-calibrated coding pipelines.
|
| 32 |
|
| 33 |
-
|
| 34 |
|
| 35 |
-
|
| 36 |
-
|
| 37 |
-
- `toolspec_adapter/adapter`
|
| 38 |
-
- `uncertainty_adapter/adapter`
|
| 39 |
-
- `rollback_adapter/adapter`
|
| 40 |
-
- `evidence_adapter/adapter`
|
| 41 |
-
|
| 42 |
-
The public release follows the standard Hugging Face / PEFT adapter format. Internal production runtime components, private runtime heads, training data, logs, and orchestration code are not included in this public model card.
|
| 43 |
|
| 44 |
---
|
| 45 |
|
| 46 |
## Highlights
|
| 47 |
|
| 48 |
-
VeriLoop Coder-E1 is
|
| 49 |
-
|
| 50 |
-
Key capability directions include:
|
| 51 |
-
|
| 52 |
-
- **Harness-ready coding behavior** — trained to operate cleanly inside external coding runtimes, validators, tool routers, and repair loops.
|
| 53 |
-
- **Tool-spec awareness** — improves obedience to tool-call schemas, preconditions, postconditions, and execution-facing instruction formats.
|
| 54 |
-
- **Evidence-bound reasoning style** — encourages stronger alignment between claims, code changes, validation signals, and supporting context.
|
| 55 |
-
- **Rollback and revision discipline** — improves behavior around failed edits, validator feedback, worktree-sensitive repairs, and bounded correction loops.
|
| 56 |
-
- **Uncertainty calibration** — improves routing signals for answer uncertainty, evidence gaps, execution necessity, specification mismatch, and risk pressure.
|
| 57 |
-
- **Repository-scale workflow orientation** — intended for code understanding, patch drafting, iterative debugging, and agentic software engineering tasks.
|
| 58 |
-
- **Open standard artifacts** — released with `safetensors` backbone weights and PEFT-compatible adapter checkpoints for reproducible public loading.
|
| 59 |
|
| 60 |
-
|
| 61 |
-
|
| 62 |
-
--
|
| 63 |
-
|
| 64 |
-
|
| 65 |
-
|
| 66 |
-
|
| 67 |
-
|
| 68 |
-
- Qwen3.6-27B-compatible model files in the repository root.
|
| 69 |
-
- Standard `safetensors` model shards.
|
| 70 |
-
- Tokenizer and generation configuration files.
|
| 71 |
-
- Four public PEFT adapters:
|
| 72 |
-
- ToolSpec adapter
|
| 73 |
-
- Uncertainty adapter
|
| 74 |
-
- Rollback adapter
|
| 75 |
-
- Evidence Binding adapter
|
| 76 |
-
- Public adapter manifests and metric summaries.
|
| 77 |
-
|
| 78 |
-
### Not included in this public release
|
| 79 |
-
|
| 80 |
-
- Private runtime heads.
|
| 81 |
-
- Internal harness orchestration code.
|
| 82 |
-
- Training JSONL files and evaluation JSONL files.
|
| 83 |
-
- Internal logs, checkpoints, optimizer states, and scheduler states.
|
| 84 |
-
- Private routing, sandbox, memory, evidence-gate, or production-serving logic.
|
| 85 |
|
| 86 |
-
|
| 87 |
|
| 88 |
---
|
| 89 |
|
|
@@ -93,13 +68,41 @@ This separation is intentional: the public repository provides standard model as
|
|
| 93 |
|---|---|
|
| 94 |
| Model family | VeriLoop Coder-E1 |
|
| 95 |
| Backbone | Qwen3.6-27B-compatible backbone |
|
| 96 |
-
| Public release type | Open-weight backbone + PEFT adapters |
|
| 97 |
| Primary domain | Coding, software engineering, coding-agent workflows |
|
|
|
|
| 98 |
| Weight format | `safetensors` |
|
| 99 |
| Adapter format | PEFT / LoRA-style adapter checkpoints |
|
| 100 |
| Runtime target | Harness-driven coding systems, tool-mediated agents, repository workflows |
|
|
|
|
| 101 |
|
| 102 |
-
The
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 103 |
|
| 104 |
---
|
| 105 |
|
|
@@ -107,12 +110,12 @@ The backbone inherits the long-context and coding-oriented capabilities of Qwen3
|
|
| 107 |
|
| 108 |
| Adapter | Folder | Public files | Role |
|
| 109 |
|---|---|---|---|
|
| 110 |
-
| ToolSpec | `toolspec_adapter/adapter` | `adapter_config.json`, `adapter_model.safetensors` | Tool-call discipline, schema obedience,
|
| 111 |
-
| Uncertainty | `uncertainty_adapter/adapter` | `adapter_config.json`, `adapter_model.safetensors` | Runtime uncertainty calibration across answer, evidence, execution,
|
| 112 |
| Rollback | `rollback_adapter/adapter` | `adapter_config.json`, `adapter_model.safetensors` | Validator-aware repair behavior, rollback discipline, bounded revision control |
|
| 113 |
| Evidence Binding | `evidence_adapter/adapter` | `adapter_config.json`, `adapter_model.safetensors` | Stronger alignment between claims, evidence, provenance, and validation context |
|
| 114 |
|
| 115 |
-
Each adapter is published independently. Users can load one adapter at a time for focused
|
| 116 |
|
| 117 |
---
|
| 118 |
|
|
@@ -124,9 +127,7 @@ Each adapter is published independently. Users can load one adapter at a time fo
|
|
| 124 |
pip install -U transformers peft accelerate safetensors
|
| 125 |
```
|
| 126 |
|
| 127 |
-
|
| 128 |
-
|
| 129 |
-
### Load the backbone
|
| 130 |
|
| 131 |
```python
|
| 132 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
|
@@ -149,19 +150,29 @@ model = AutoModelForCausalLM.from_pretrained(
|
|
| 149 |
model.eval()
|
| 150 |
```
|
| 151 |
|
| 152 |
-
### Load
|
| 153 |
|
| 154 |
```python
|
|
|
|
| 155 |
from peft import PeftModel
|
|
|
|
| 156 |
|
| 157 |
repo_id = "veriloop-lab/veriloop-coder-e1"
|
|
|
|
| 158 |
|
| 159 |
-
|
| 160 |
-
|
| 161 |
repo_id,
|
| 162 |
-
|
|
|
|
|
|
|
| 163 |
)
|
| 164 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 165 |
model.eval()
|
| 166 |
```
|
| 167 |
|
|
@@ -174,25 +185,28 @@ rollback_adapter/adapter
|
|
| 174 |
evidence_adapter/adapter
|
| 175 |
```
|
| 176 |
|
| 177 |
-
###
|
| 178 |
|
| 179 |
```python
|
| 180 |
-
prompt = "Write a
|
| 181 |
|
| 182 |
messages = [
|
| 183 |
-
{"role": "user", "content": prompt}
|
| 184 |
]
|
| 185 |
|
| 186 |
-
|
| 187 |
messages,
|
|
|
|
| 188 |
add_generation_prompt=True,
|
| 189 |
-
|
| 190 |
-
).to(model.device)
|
| 191 |
|
|
|
|
| 192 |
outputs = model.generate(
|
| 193 |
-
inputs,
|
| 194 |
-
max_new_tokens=
|
| 195 |
-
|
|
|
|
|
|
|
| 196 |
)
|
| 197 |
|
| 198 |
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
|
@@ -200,106 +214,58 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
|
| 200 |
|
| 201 |
---
|
| 202 |
|
| 203 |
-
##
|
| 204 |
|
| 205 |
-
The
|
| 206 |
|
| 207 |
-
|
| 208 |
|
| 209 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 210 |
|
| 211 |
-
|
| 212 |
-
2. Mount one VeriLoop PEFT adapter as a LoRA module.
|
| 213 |
-
3. Route requests to the adapter that matches the task profile.
|
| 214 |
-
4. For production coding agents, add external validation, sandboxing, and tool orchestration outside the model.
|
| 215 |
|
| 216 |
---
|
| 217 |
|
| 218 |
## Recommended Use Cases
|
| 219 |
|
| 220 |
-
VeriLoop Coder-E1 is intended for:
|
| 221 |
-
|
| 222 |
-
- Repository understanding and codebase navigation.
|
| 223 |
-
- Patch drafting and bounded code revision.
|
| 224 |
-
- Tool-mediated coding workflows.
|
| 225 |
-
- Validator-aware debugging loops.
|
| 226 |
-
- Evidence-aware code explanation.
|
| 227 |
-
- Coding-agent research and runtime integration.
|
| 228 |
-
- Experiments with uncertainty-aware code generation.
|
| 229 |
|
| 230 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 231 |
|
| 232 |
---
|
| 233 |
|
| 234 |
## Limitations
|
| 235 |
|
| 236 |
-
|
| 237 |
-
|
| 238 |
-
|
| 239 |
-
|
| 240 |
-
-
|
| 241 |
-
- Users should validate generated code before using it in production.
|
| 242 |
-
- Repository-scale behavior depends heavily on retrieval, context construction, and tool execution outside the model.
|
| 243 |
-
- Adapter composition should be tested carefully; do not assume that naively merging or stacking all adapters is optimal for every task.
|
| 244 |
-
- Public benchmark results for this release will be updated after standardized external evaluation.
|
| 245 |
-
|
| 246 |
-
---
|
| 247 |
-
|
| 248 |
-
## Evaluation Status
|
| 249 |
-
|
| 250 |
-
Public benchmark results are not yet included in this release.
|
| 251 |
-
|
| 252 |
-
The current repository is a public model-asset release focused on:
|
| 253 |
-
|
| 254 |
-
- Standard weight availability.
|
| 255 |
-
- Adapter availability.
|
| 256 |
-
- Reproducible loading.
|
| 257 |
-
- Harness-oriented coding model alignment.
|
| 258 |
-
|
| 259 |
-
External leaderboard and benchmark results will be added after controlled evaluation on standardized coding and agentic software-engineering benchmarks.
|
| 260 |
|
| 261 |
---
|
| 262 |
|
| 263 |
## Safety and Responsible Use
|
| 264 |
|
| 265 |
-
VeriLoop Coder-E1 is a coding-
|
| 266 |
-
|
| 267 |
-
Recommended safeguards:
|
| 268 |
-
|
| 269 |
-
- Run generated code in a sandbox before execution on real systems.
|
| 270 |
-
- Review file-system, network, credential, and destructive-operation behavior.
|
| 271 |
-
- Use static analysis and unit tests for generated patches.
|
| 272 |
-
- Do not grant unrestricted shell, repository, or deployment permissions without external policy checks.
|
| 273 |
-
- Treat the model as an assistant for software engineering, not as an autonomous authority.
|
| 274 |
-
|
| 275 |
-
For high-risk environments, deploy VeriLoop Coder-E1 behind explicit permission controls, audit logging, validation gates, and rollback procedures.
|
| 276 |
-
|
| 277 |
-
---
|
| 278 |
-
|
| 279 |
-
## Public vs. Production Capability
|
| 280 |
-
|
| 281 |
-
This Hugging Face repository provides the **public standard model layer**:
|
| 282 |
-
|
| 283 |
-
```text
|
| 284 |
-
27B backbone weights
|
| 285 |
-
+ four public PEFT adapters
|
| 286 |
-
+ public adapter manifests
|
| 287 |
-
```
|
| 288 |
-
|
| 289 |
-
A full production coding-agent stack may additionally include:
|
| 290 |
|
| 291 |
-
|
| 292 |
-
runtime orchestration
|
| 293 |
-
sandbox validation
|
| 294 |
-
evidence management
|
| 295 |
-
memory/context systems
|
| 296 |
-
self-check and repair loops
|
| 297 |
-
policy gates
|
| 298 |
-
observability
|
| 299 |
-
external expert routing
|
| 300 |
-
```
|
| 301 |
|
| 302 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 303 |
|
| 304 |
---
|
| 305 |
|
|
@@ -309,64 +275,69 @@ The public model is useful on its own for research and development. The stronges
|
|
| 309 |
README.md
|
| 310 |
config.json
|
| 311 |
configuration.json
|
| 312 |
-
generation_config.json
|
| 313 |
model.safetensors.index.json
|
| 314 |
-
tokenizer.json
|
| 315 |
-
tokenizer_config.json
|
| 316 |
-
special_tokens_map.json
|
| 317 |
-
merges.txt
|
| 318 |
-
preprocessor_config.json
|
| 319 |
-
video_preprocessor_config.json
|
| 320 |
veriloop-coder-e1-model-00001-of-00010.safetensors
|
| 321 |
...
|
| 322 |
veriloop-coder-e1-model-00010-of-00010.safetensors
|
|
|
|
|
|
|
|
|
|
|
|
|
| 323 |
|
| 324 |
-
toolspec_adapter/
|
| 325 |
-
|
| 326 |
-
|
| 327 |
-
|
| 328 |
-
|
| 329 |
-
|
| 330 |
-
|
| 331 |
-
|
| 332 |
-
|
| 333 |
-
uncertainty_adapter/
|
| 334 |
-
|
| 335 |
-
|
| 336 |
-
|
| 337 |
-
|
| 338 |
-
|
| 339 |
-
|
| 340 |
-
|
| 341 |
-
|
| 342 |
-
rollback_adapter/
|
| 343 |
-
|
| 344 |
-
|
| 345 |
-
|
| 346 |
-
|
| 347 |
-
|
| 348 |
-
|
| 349 |
-
|
| 350 |
-
|
| 351 |
-
evidence_adapter/
|
| 352 |
-
|
| 353 |
-
|
| 354 |
-
|
| 355 |
-
|
| 356 |
-
|
| 357 |
-
|
| 358 |
-
|
| 359 |
```
|
| 360 |
|
| 361 |
---
|
| 362 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 363 |
## Citation
|
| 364 |
|
| 365 |
-
If you use VeriLoop Coder-E1 in
|
| 366 |
|
| 367 |
```bibtex
|
| 368 |
@misc{veriloop_coder_e1_2026,
|
| 369 |
-
title = {VeriLoop Coder-E1: Harness-
|
| 370 |
author = {VeriLoop Lab},
|
| 371 |
year = {2026},
|
| 372 |
howpublished = {Hugging Face model repository},
|
|
@@ -378,10 +349,4 @@ If you use VeriLoop Coder-E1 in your work, please cite this repository:
|
|
| 378 |
|
| 379 |
## Acknowledgements
|
| 380 |
|
| 381 |
-
VeriLoop Coder-E1 is built on top of the Qwen3.6-27B open-weight
|
| 382 |
-
|
| 383 |
-
---
|
| 384 |
-
|
| 385 |
-
## License
|
| 386 |
-
|
| 387 |
-
This repository includes model assets derived from an upstream open-weight backbone and VeriLoop adapter artifacts. Users are responsible for complying with the upstream base-model license and any applicable VeriLoop release terms described in this repository.
|
|
|
|
| 1 |
---
|
| 2 |
library_name: transformers
|
| 3 |
pipeline_tag: text-generation
|
| 4 |
+
license: apache-2.0
|
| 5 |
+
license_link: https://huggingface.co/Qwen/Qwen3.6-27B/blob/main/LICENSE
|
| 6 |
base_model:
|
| 7 |
- Qwen/Qwen3.6-27B
|
| 8 |
+
base_model_relation: finetune
|
| 9 |
language:
|
| 10 |
- en
|
| 11 |
+
- zh
|
| 12 |
tags:
|
|
|
|
| 13 |
- qwen3_6
|
| 14 |
+
- qwen3
|
| 15 |
- qwen
|
| 16 |
+
- veriloop
|
| 17 |
+
- veriloop-coder
|
| 18 |
- code
|
| 19 |
- coding-agent
|
|
|
|
|
|
|
| 20 |
- agentic-coding
|
| 21 |
+
- software-engineering
|
| 22 |
- repository-understanding
|
| 23 |
- tool-use
|
| 24 |
+
- peft
|
| 25 |
+
- lora
|
| 26 |
+
- safetensors
|
| 27 |
+
- transformers
|
| 28 |
+
- harness-engineering
|
| 29 |
- evidence-binding
|
| 30 |
- rollback
|
| 31 |
- uncertainty-calibration
|
| 32 |
+
- long-context
|
| 33 |
+
- open-weights
|
| 34 |
+
- reasoning
|
| 35 |
+
- conversational
|
| 36 |
---
|
| 37 |
|
| 38 |
# VeriLoop Coder-E1
|
| 39 |
|
| 40 |
+
**VeriLoop Coder-E1** is an open-weight coding model release built on a Qwen3.6-27B backbone and specialized for harness-driven software engineering workflows.
|
|
|
|
|
|
|
| 41 |
|
| 42 |
+
The public release provides a standard, Hugging Face-compatible model package: sharded `safetensors` backbone weights in the repository root, plus four focused PEFT adapters for coding-agent behavior shaping. It is designed for developers and researchers who want a coding model that is not only fluent at code generation, but also better prepared for tool-mediated execution, evidence-aware reasoning, rollback-safe revision, and uncertainty-calibrated coding workflows.
|
| 43 |
|
| 44 |
+
> **Release note**
|
| 45 |
+
> This is the first public VeriLoop Coder-E1 27B release package. Public benchmark results will be added after the formal evaluation run. The current repository focuses on releasing clean, standard model artifacts without exposing internal production Harness code, private runtime heads, training data, or server-side orchestration logic.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 46 |
|
| 47 |
---
|
| 48 |
|
| 49 |
## Highlights
|
| 50 |
|
| 51 |
+
VeriLoop Coder-E1 is built for coding-agent environments where a model must cooperate with external tools, validators, repository context, and iterative repair loops.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 52 |
|
| 53 |
+
- **Harness-ready coding behavior** — shaped for coding runtimes that use tool calls, validation feedback, staged execution, and bounded repair loops.
|
| 54 |
+
- **Tool-spec alignment** — improves obedience to tool schemas, argument discipline, preconditions, postconditions, and execution-facing instruction formats.
|
| 55 |
+
- **Evidence-bound coding style** — encourages stronger alignment between claims, code edits, validation signals, and supporting repository context.
|
| 56 |
+
- **Rollback-aware revision behavior** — improves response patterns around failed edits, validator negation, worktree-sensitive repair, and safe correction boundaries.
|
| 57 |
+
- **Uncertainty-calibrated routing signals** — supports more reliable decisions around answer uncertainty, evidence gaps, execution necessity, specification mismatch, and risk pressure.
|
| 58 |
+
- **Repository-scale workflow orientation** — intended for code understanding, patch drafting, debugging, refactoring assistance, and agentic software engineering tasks.
|
| 59 |
+
- **Open standard artifacts** — released with `safetensors` model shards and PEFT-compatible adapter checkpoints for public loading and reproducible experimentation.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 60 |
|
| 61 |
+
VeriLoop Coder-E1 should be viewed as a **coding model foundation for harness-centric systems**. The full VeriLoop product experience may involve additional runtime components such as tool orchestration, sandbox validation, evidence handling, memory, observability, and API-side routing.
|
| 62 |
|
| 63 |
---
|
| 64 |
|
|
|
|
| 68 |
|---|---|
|
| 69 |
| Model family | VeriLoop Coder-E1 |
|
| 70 |
| Backbone | Qwen3.6-27B-compatible backbone |
|
| 71 |
+
| Public release type | Open-weight backbone + four PEFT adapters |
|
| 72 |
| Primary domain | Coding, software engineering, coding-agent workflows |
|
| 73 |
+
| Languages | English, Chinese |
|
| 74 |
| Weight format | `safetensors` |
|
| 75 |
| Adapter format | PEFT / LoRA-style adapter checkpoints |
|
| 76 |
| Runtime target | Harness-driven coding systems, tool-mediated agents, repository workflows |
|
| 77 |
+
| Public benchmark status | Formal benchmark results pending |
|
| 78 |
|
| 79 |
+
The public release separates standard model assets from private production runtime infrastructure. Users can load the base model directly, or mount one of the four public PEFT adapters for targeted experimentation.
|
| 80 |
+
|
| 81 |
+
---
|
| 82 |
+
|
| 83 |
+
## Public Release Contents
|
| 84 |
+
|
| 85 |
+
### Included
|
| 86 |
+
|
| 87 |
+
- Qwen3.6-27B-compatible backbone files in the repository root.
|
| 88 |
+
- Standard sharded `safetensors` model weights.
|
| 89 |
+
- Tokenizer, generation, and configuration files.
|
| 90 |
+
- Four public PEFT adapters:
|
| 91 |
+
- `toolspec_adapter/adapter`
|
| 92 |
+
- `uncertainty_adapter/adapter`
|
| 93 |
+
- `rollback_adapter/adapter`
|
| 94 |
+
- `evidence_adapter/adapter`
|
| 95 |
+
- Public adapter README files, metric summaries, and adapter manifests.
|
| 96 |
+
|
| 97 |
+
### Not Included
|
| 98 |
+
|
| 99 |
+
- Private runtime heads.
|
| 100 |
+
- Internal Harness orchestration code.
|
| 101 |
+
- Training JSONL files and evaluation JSONL files.
|
| 102 |
+
- Internal logs, checkpoints, optimizer states, and scheduler states.
|
| 103 |
+
- Private routing, sandbox, memory, evidence-gate, or production-serving logic.
|
| 104 |
+
|
| 105 |
+
This separation is intentional. The public repository provides standard open model assets, while production-grade coding-agent behavior may require a full runtime system around the model.
|
| 106 |
|
| 107 |
---
|
| 108 |
|
|
|
|
| 110 |
|
| 111 |
| Adapter | Folder | Public files | Role |
|
| 112 |
|---|---|---|---|
|
| 113 |
+
| ToolSpec | `toolspec_adapter/adapter` | `adapter_config.json`, `adapter_model.safetensors` | Tool-call discipline, schema obedience, precondition/postcondition sensitivity |
|
| 114 |
+
| Uncertainty | `uncertainty_adapter/adapter` | `adapter_config.json`, `adapter_model.safetensors` | Runtime uncertainty calibration across answer, evidence, execution, specification, and risk signals |
|
| 115 |
| Rollback | `rollback_adapter/adapter` | `adapter_config.json`, `adapter_model.safetensors` | Validator-aware repair behavior, rollback discipline, bounded revision control |
|
| 116 |
| Evidence Binding | `evidence_adapter/adapter` | `adapter_config.json`, `adapter_model.safetensors` | Stronger alignment between claims, evidence, provenance, and validation context |
|
| 117 |
|
| 118 |
+
Each adapter is published independently. Users can load one adapter at a time for focused evaluation, or build their own adapter-selection policy around the public artifacts.
|
| 119 |
|
| 120 |
---
|
| 121 |
|
|
|
|
| 127 |
pip install -U transformers peft accelerate safetensors
|
| 128 |
```
|
| 129 |
|
| 130 |
+
### Load the Backbone
|
|
|
|
|
|
|
| 131 |
|
| 132 |
```python
|
| 133 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
|
|
|
| 150 |
model.eval()
|
| 151 |
```
|
| 152 |
|
| 153 |
+
### Load One Public PEFT Adapter
|
| 154 |
|
| 155 |
```python
|
| 156 |
+
from transformers import AutoTokenizer, AutoModelForCausalLM
|
| 157 |
from peft import PeftModel
|
| 158 |
+
import torch
|
| 159 |
|
| 160 |
repo_id = "veriloop-lab/veriloop-coder-e1"
|
| 161 |
+
adapter_subfolder = "evidence_adapter/adapter" # choose one adapter
|
| 162 |
|
| 163 |
+
tokenizer = AutoTokenizer.from_pretrained(repo_id, trust_remote_code=True)
|
| 164 |
+
base_model = AutoModelForCausalLM.from_pretrained(
|
| 165 |
repo_id,
|
| 166 |
+
torch_dtype=torch.bfloat16,
|
| 167 |
+
device_map="auto",
|
| 168 |
+
trust_remote_code=True,
|
| 169 |
)
|
| 170 |
|
| 171 |
+
model = PeftModel.from_pretrained(
|
| 172 |
+
base_model,
|
| 173 |
+
repo_id,
|
| 174 |
+
subfolder=adapter_subfolder,
|
| 175 |
+
)
|
| 176 |
model.eval()
|
| 177 |
```
|
| 178 |
|
|
|
|
| 185 |
evidence_adapter/adapter
|
| 186 |
```
|
| 187 |
|
| 188 |
+
### Minimal Generation Example
|
| 189 |
|
| 190 |
```python
|
| 191 |
+
prompt = "Write a Python function that validates whether a patch should be accepted after unit tests."
|
| 192 |
|
| 193 |
messages = [
|
| 194 |
+
{"role": "user", "content": prompt},
|
| 195 |
]
|
| 196 |
|
| 197 |
+
text = tokenizer.apply_chat_template(
|
| 198 |
messages,
|
| 199 |
+
tokenize=False,
|
| 200 |
add_generation_prompt=True,
|
| 201 |
+
)
|
|
|
|
| 202 |
|
| 203 |
+
inputs = tokenizer(text, return_tensors="pt").to(model.device)
|
| 204 |
outputs = model.generate(
|
| 205 |
+
**inputs,
|
| 206 |
+
max_new_tokens=1024,
|
| 207 |
+
temperature=0.6,
|
| 208 |
+
top_p=0.95,
|
| 209 |
+
do_sample=True,
|
| 210 |
)
|
| 211 |
|
| 212 |
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
|
|
|
| 214 |
|
| 215 |
---
|
| 216 |
|
| 217 |
+
## Serving Notes
|
| 218 |
|
| 219 |
+
The repository root contains the backbone model files and can be served with standard inference engines that support the underlying architecture. PEFT adapters may require framework-specific LoRA loading support.
|
| 220 |
|
| 221 |
+
### vLLM Backbone Serving
|
| 222 |
|
| 223 |
+
```bash
|
| 224 |
+
vllm serve veriloop-lab/veriloop-coder-e1 \
|
| 225 |
+
--trust-remote-code \
|
| 226 |
+
--tensor-parallel-size 2 \
|
| 227 |
+
--max-model-len 131072
|
| 228 |
+
```
|
| 229 |
|
| 230 |
+
For public PEFT adapters, use the serving engine's LoRA/adapter loading mechanism if supported by your deployment configuration. The full VeriLoop production setup may use additional private runtime components that are not part of this public release.
|
|
|
|
|
|
|
|
|
|
| 231 |
|
| 232 |
---
|
| 233 |
|
| 234 |
## Recommended Use Cases
|
| 235 |
|
| 236 |
+
VeriLoop Coder-E1 is intended for research and development in:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 237 |
|
| 238 |
+
- Coding-agent model evaluation.
|
| 239 |
+
- Tool-mediated code generation.
|
| 240 |
+
- Repository understanding and patch drafting.
|
| 241 |
+
- Validator-aware repair experiments.
|
| 242 |
+
- Evidence-aware coding workflows.
|
| 243 |
+
- Uncertainty-aware software engineering agents.
|
| 244 |
+
- Harness and runtime policy research.
|
| 245 |
|
| 246 |
---
|
| 247 |
|
| 248 |
## Limitations
|
| 249 |
|
| 250 |
+
- Public benchmark numbers are not yet included in this release and will be added after formal evaluation.
|
| 251 |
+
- The public repository does not include private runtime heads or internal Harness orchestration.
|
| 252 |
+
- Public adapter loading does not reproduce the complete VeriLoop production API behavior.
|
| 253 |
+
- Long-context and high-throughput serving require appropriate GPU memory, KV-cache planning, and inference-engine configuration.
|
| 254 |
+
- Users should validate generated code with tests, static analysis, sandboxing, and security review before deployment.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 255 |
|
| 256 |
---
|
| 257 |
|
| 258 |
## Safety and Responsible Use
|
| 259 |
|
| 260 |
+
VeriLoop Coder-E1 is a coding-focused model and may produce incorrect, insecure, incomplete, or environment-specific code. Users are responsible for validating outputs before use.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 261 |
|
| 262 |
+
Recommended safeguards include:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 263 |
|
| 264 |
+
- Run generated code in isolated environments.
|
| 265 |
+
- Review dependencies and shell commands before execution.
|
| 266 |
+
- Use automated tests and linters.
|
| 267 |
+
- Treat security-sensitive code paths as high risk.
|
| 268 |
+
- Avoid using generated code for destructive actions without human review.
|
| 269 |
|
| 270 |
---
|
| 271 |
|
|
|
|
| 275 |
README.md
|
| 276 |
config.json
|
| 277 |
configuration.json
|
|
|
|
| 278 |
model.safetensors.index.json
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 279 |
veriloop-coder-e1-model-00001-of-00010.safetensors
|
| 280 |
...
|
| 281 |
veriloop-coder-e1-model-00010-of-00010.safetensors
|
| 282 |
+
tokenizer.json
|
| 283 |
+
tokenizer_config.json
|
| 284 |
+
generation_config.json
|
| 285 |
+
special_tokens_map.json
|
| 286 |
|
| 287 |
+
toolspec_adapter/
|
| 288 |
+
README.md
|
| 289 |
+
metrics_summary.json
|
| 290 |
+
veriloop_adapter_manifest.json
|
| 291 |
+
adapter/
|
| 292 |
+
README.md
|
| 293 |
+
adapter_config.json
|
| 294 |
+
adapter_model.safetensors
|
| 295 |
+
|
| 296 |
+
uncertainty_adapter/
|
| 297 |
+
README.md
|
| 298 |
+
metrics_summary.json
|
| 299 |
+
veriloop_adapter_manifest.json
|
| 300 |
+
adapter/
|
| 301 |
+
README.md
|
| 302 |
+
adapter_config.json
|
| 303 |
+
adapter_model.safetensors
|
| 304 |
+
|
| 305 |
+
rollback_adapter/
|
| 306 |
+
README.md
|
| 307 |
+
metrics_summary.json
|
| 308 |
+
veriloop_adapter_manifest.json
|
| 309 |
+
adapter/
|
| 310 |
+
README.md
|
| 311 |
+
adapter_config.json
|
| 312 |
+
adapter_model.safetensors
|
| 313 |
+
|
| 314 |
+
evidence_adapter/
|
| 315 |
+
README.md
|
| 316 |
+
metrics_summary.json
|
| 317 |
+
veriloop_adapter_manifest.json
|
| 318 |
+
adapter/
|
| 319 |
+
README.md
|
| 320 |
+
adapter_config.json
|
| 321 |
+
adapter_model.safetensors
|
| 322 |
```
|
| 323 |
|
| 324 |
---
|
| 325 |
|
| 326 |
+
## Evaluation Status
|
| 327 |
+
|
| 328 |
+
Formal benchmark results are planned. Future updates may include coding-agent benchmarks, repository-level tasks, tool-use evaluations, validation/rollback tests, and long-context software-engineering workflows.
|
| 329 |
+
|
| 330 |
+
Until benchmark numbers are published, this model card should be interpreted as a release description and loading guide, not as a performance leaderboard claim.
|
| 331 |
+
|
| 332 |
+
---
|
| 333 |
+
|
| 334 |
## Citation
|
| 335 |
|
| 336 |
+
If you use VeriLoop Coder-E1 in research, prototypes, or agent systems, please cite:
|
| 337 |
|
| 338 |
```bibtex
|
| 339 |
@misc{veriloop_coder_e1_2026,
|
| 340 |
+
title = {VeriLoop Coder-E1: Harness-Ready Open-Weight Coding Model Release},
|
| 341 |
author = {VeriLoop Lab},
|
| 342 |
year = {2026},
|
| 343 |
howpublished = {Hugging Face model repository},
|
|
|
|
| 349 |
|
| 350 |
## Acknowledgements
|
| 351 |
|
| 352 |
+
VeriLoop Coder-E1 is built on top of the Qwen3.6-27B open-weight backbone. We thank the open-source model and tooling communities for enabling reproducible model development, adapter-based experimentation, and open deployment workflows.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|