Benchmarks of reasoning levels?

#6
by coder543 - opened

Are there any benchmarks of the reasoning levels that are mentioned in the Step-3.7-Flash-GGUF README? It would be nice to see how token usage varies across the reasoning levels for a set of standardized benchmarks, and how it affects the scores in those benchmarks.

coder543 changed discussion title from Benchmarks of reasoning levels to Benchmarks of reasoning levels?

Sign up or log in to comment