Verifiable Labs

company

AI & ML interests

AI agents, agent evaluation, promotion gates, synthetic evidence, formal methods, contamination-resistant evaluation, model evaluation

Recent Activity

stelioszach03 updated a Space 16 days ago

verifiablelabs/README

stelioszach03 published a Space 16 days ago

verifiablelabs/README

stelioszach03 updated a dataset 17 days ago

verifiablelabs/vlabs-clean-gate-evidence

View all activity

Organization Card

Community About org cards

Verifiable Labs

Verifiable Labs builds clean feedback and promotion gates for increasingly general AI agents.

We develop infrastructure for evaluating agent improvements, checking whether those improvements transfer to unseen/OOD/adversarial situations, and producing synthetic/redacted evidence artifacts for promotion decisions.

Public resources

GitHub: https://github.com/verifiablelabs
PyPI: https://pypi.org/project/vlabs-sdk/0.0.2/
Install: pip install "vlabs-sdk==0.0.2"
Dataset: https://huggingface.co/datasets/verifiablelabs/vlabs-clean-gate-evidence
W&B: https://wandb.ai/verifiable-labs/clean-generalization-gate

Evidence policy

Public evidence is synthetic/redacted and is not a training dataset.

It does not include customer data, hidden evals, gold answers, raw traces, private traps, private engine internals, secrets, or provider keys.

Formal scope

Selected mathematical properties behind the contamination-resistant promotion gate are machine-verified in Lean 4. The implementation is property-tested against the formal specification.