AI & ML interests

AI agents, agent evaluation, promotion gates, synthetic evidence, formal methods, contamination-resistant evaluation, model evaluation

Recent Activity

stelioszach03  updated a Space 16 days ago
verifiablelabs/README
stelioszach03  published a Space 16 days ago
verifiablelabs/README
View all activity

Organization Card

Verifiable Labs

Verifiable Labs builds clean feedback and promotion gates for increasingly general AI agents.

We develop infrastructure for evaluating agent improvements, checking whether those improvements transfer to unseen/OOD/adversarial situations, and producing synthetic/redacted evidence artifacts for promotion decisions.

Public resources

Evidence policy

Public evidence is synthetic/redacted and is not a training dataset.

It does not include customer data, hidden evals, gold answers, raw traces, private traps, private engine internals, secrets, or provider keys.

Formal scope

Selected mathematical properties behind the contamination-resistant promotion gate are machine-verified in Lean 4. The implementation is property-tested against the formal specification.

models 0

None public yet