Bio-posttrain Gemma-4-E2B RNA SFT

RNA supervised fine-tuning (SFT) checkpoint from How Post-Training Shapes Biological Reasoning Models.

Model details

  • Base model: google/gemma-4-E2B-it
  • RNA embeddings: Precomputed TranscriptFormer (2048-d), projected via rna_projection.pt
  • LoRA: rank 32, alpha 64
  • Validation loss: 0.9227

Files

File Description
model.safetensors Merged LLM weights
rna_projection.pt Linear map RNA embed (2048) → text hidden
rna_model_config.json Architecture metadata

Loading

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model = AutoModelForCausalLM.from_pretrained("mims-harvard/bio-posttrain-gemma4-e2b-rna-sft", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("mims-harvard/bio-posttrain-gemma4-e2b-rna-sft")
proj = torch.load("rna_projection.pt", map_location="cpu")

RNA sequence embeddings are supplied offline at inference time. See the rna_models code in the BioReason repository.

Collection

Part of the Bio-posttrain collection.

Downloads last month
12
Safetensors
Model size
6B params
Tensor type
F32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mims-harvard/bio-posttrain-gemma4-e2b-rna-sft

Finetuned
(242)
this model

Collection including mims-harvard/bio-posttrain-gemma4-e2b-rna-sft