Lisper Gemma 4 E2B Audio LoRA

This is the Lisper Gemma 4 E2B LoRA adapter for raw-audio lisp coaching.

Lisper is a hackathon prototype for low-pressure /s/ practice. It classifies a short speech clip as clear, frontal, lateral, dental, or palatal, then returns one concise reason, one corrective cue, and one encouragement line.

Model Lineage

Base model: google/gemma-4-E2B-it
Fine-tuning: Unsloth supervised fine-tuning with QLoRA / LoRA
Trainable parameters: about 29.86M of 5.15B
Training rows: 16,000
Validation rows: 2,000
Held-out test rows: 2,000
Training steps: 4,000
Selected checkpoint: checkpoint-2500

This is not a dense full-parameter fine-tune. The base model was frozen and the learned update is stored as LoRA adapter weights.

Evaluation

The release-quality evaluation is the v18 hybrid acoustic+Gemma path:

Held-out rows: 2,000
Hard errors: 0
Verdict: pass
Class match: 0.976
Clear/non-clear match: 0.989
Exact four-line format: 1.0
Reason/cue/encouragement present: 1.0

The evaluated pipeline uses acoustic features for the lisp-class hint and Gemma for structured coaching text and tone. Do not interpret these metrics as a pure direct-Gemma raw-audio classification result.

See eval_summary.json and publish_verdict.json for the public summary.

Companion Artifacts

Merged full checkpoint: thomasjvu/lisper-gemma4-e2b-audio-full
Browser q4f16 ONNX/WebGPU package: thomasjvu/lisper-gemma4-e2b-audio-onnx-q4f16
Server-side demo Space: thomasjvu/lisper-zerogpu

Limitations

The lisp dataset is synthetically generated from speaker-disjoint source speech.
This is a practice assistant, not a medical diagnosis tool or a replacement for a speech-language pathologist.
The browser q4f16 package is large for consumer devices, so a ZeroGPU fallback is provided.

Downloads last month: 14

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for thomasjvu/lisper-gemma4-e2b-audio-lora

Base model

google/gemma-4-E2B

Finetuned

google/gemma-4-E2B-it

Adapter

(79)

this model