Showcase: Conversations with, and Musings of Gemma 4 31B it compressed

#1
by SINAPSA-IC - opened

Thank you for creating this so-easily-accessible version of Gemma 4, and for allowing the users access to it, along with Gemma 4 26B it compressed.

We've found this LLM (Gemma 4 31B it compressed) to be especially useful in reasoning (as far as we can corroborate with known information).

Tested it on knowledge about:

  • ancient history
  • humanities
  • movies
  • natural sciences: paleontology, astronomy

We have carried out several conversations with this LLM, and asked it for own musings on various subjects.

  • Software for interacting with LLMs:
    LM Studio (https://lmstudio.ai) - free to use (https://lmstudio.ai/blog/free-for-work)

  • LLM working parameters:
    Temperature: 0 (recommended value)
    Repeat Penalty: 1.15 (recommended: 1.5)
    Top K: 2 (recommended: 1)
    Top P: 0.95 (recommended: 1.0)
    MIn P: 0.9

  • Context Length set to: 60000

  • Evaluation Batch Size set to: 16384

  • backend: CUDA llama.cpp (Windows) 2.14.0

  • No System Prompt

  • the PDF files containing the conversations and musings, at URL (as of 2026.05.18):
    https://sinapsaro.ro/llmusings/sinapsa_ai_llmusings.htm

  • first four such items:

#01: A story about the most fantastic things and events this LLM can imagine
#02: Solipsism and Self-Awareness
#03: Theology
#04: Paradoxes of time travel

NONE of the links in this post should be viewed as advertising a product or company; they are provided as DIRECT links to repositories of FREE information and products.

Thanks for checking out this quant!

It can also handle complex coding tasks, etc!

It can also handle complex coding tasks, etc!

It's also pretty good for Roleplaying. Still just getting started on the role AI-RP-thing, sounds interesting for writing, but most models struggle with lots of stuff. This model allowed me to test a "heavier" model with larger context, so thank you for that. (Still think AI is not too great at that RP stuff BTW, or at least convinced that nothing I can run on my hardware does a really good job)

I'd definitely like to see more of these compressed models. In my head it just makes sense to do some smart-assery to imit noise to places where it doesn't cause as much trouble... But maybe I'm the weird one.

Sign up or log in to comment