micahr234/mouse-example-model

This repository contains a MOUSE model checkpoint.

Architecture

  • Backbone: qwen3
  • Hidden dimension: 1024
  • Heads: action_value
  • Action head: action_value

Encoder

StepEmbedder reads flat step-record dicts and projects each declared modality into the shared 1024-dimensional token space before the backbone.

Field Type Required Tensor shape Dtype Notes
action discrete yes [B, S] torch.long integer ids in [0, 3]
observation discrete yes [B, S] torch.long integer ids in [0, 63]
reward rff yes [B, S] torch.float32 scalar value
done discrete yes [B, S] torch.long integer ids in [0, 4]
- learnable no not read from step_stream n/a learned tokens; no input field

Install MouseCore

pip install mouse-core

Load The Model

import torch
from mouse_core import load_model

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = load_model("micahr234/mouse-example-model", map_location="cpu").eval().to(device)

Run Inference

The model accepts a list[list[dict]] batch of shape [B][S] — B sequences, each containing S step-record dicts with flat keys matching the encoder's declared modalities above.

# Batch shape: [B=1][S=1] — one sequence of one step.
batch = [[
    {
    "action": 0,
    "observation": 0,
    "reward": 0.0,
    "done": 0,
    }
]]
predictions, objective_data, cache = model(batch)

with torch.no_grad():
    predictions, _, cache = model(batch)
    action = model.get_action(predictions, temperature=0.0)

model() returns (predictions, objective_data, cache). objective_data is a TensorDict[B, S] of the modality tensors extracted by the encoder — pass it to objectives during training. For cached one-step rollout, keep cache and pass it back on the next call with use_cache=True.

Downloads last month
215
Video Preview
loading