Parse judgments with structured output prompting, one response model, one judge model at a time.
eb4ec23
justinxzhao
commited on