Compliment Forest MiniCPM5-1B

This is the merged text fine-tune for The Compliment Forest. It turns a name and situation into schema-valid, situation-grounded forest JSON for the app's author pass. The same model also runs a bounded critic pass that prunes generic or redundant clearings.

Training

  • Base: openbmb/MiniCPM5-1B (Llama architecture, about 1.08B parameters)
  • Data: build-small-hackathon/compliment-forest-sft
  • Method: 4-bit NF4 QLoRA on Modal
  • LoRA: rank 16, alpha 32, dropout 0.05
  • Targets: attention and MLP projections
  • Sequence length: 2,048
  • Epochs: 2
  • Learning rate: 2e-4 with cosine decay
  • Runtime thinking mode: disabled for deterministic JSON generation

The dataset was filtered for JSON validity, concrete situation grounding, non-toxic positivity, and short first-person spells. This model is for whimsical encouragement; it is not a therapist or a substitute for professional support.

Inference

Use the base model's chat template with enable_thinking=False. The app enforces the output with Pydantic and retries malformed generations at most twice.

The repository also includes a Q4_K_M GGUF build for local llama.cpp inference.

License

Apache-2.0, following the base model and project code. Dataset source licenses are documented on the dataset card.

Downloads last month
151
Safetensors
Model size
1B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for build-small-hackathon/compliment-forest-minicpm5-1b

Quantized
(36)
this model

Dataset used to train build-small-hackathon/compliment-forest-minicpm5-1b

Spaces using build-small-hackathon/compliment-forest-minicpm5-1b 2