You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

[This model is part of a manuscript in review. Public access to the model will be granted upon manuscript acceptance. If you wish to use this model earlier than that, please submit you request to one of HF's Insilico Medicine team members on this website]

Longevity-LLM (L-LLM)

A domain-adapted Qwen3.5-9B for aging and longevity biology. L-LLM is the result of continued pretraining + supervised fine-tuning + a reasoning-augmented continuation pass on a multi-domain corpus spanning clinical aging, epigenomics, transcriptomics, proteomics, and genetics. The two trained LoRA adapters were concatenated into a single rank-64 LoRA and merged into the base weights to produce this standalone bf16 checkpoint.

Methods

L-LLM was built by LoRA fine-tuning Qwen3.5-9B, a 9B-parameter hybrid transformer that interleaves Gated DeltaNet linear-attention layers with standard self-attention in a 3:1 ratio. Training data was assembled across three main domains:

Domain Sources
Knowledge priors UniProt protein/gene annotations, Gene Ontology, protein–protein interactions, pathway membership; published aging-clock formulas and CpG-site coefficients (Biolearn)
Clinical & epidemiology NHANES (age, mortality)
Epigenomics GEO DNA-methylation cohorts, CpG methylation profiles, aging-clock proxy tasks
Transcriptomics GTEx (tissue age), TCGA (cancer survival), expression-profile generation
Proteomics Olink plasma-proteomics panels, proteomic clock proxy tasks
Genetics & longevity OpenGenes (expression directionality), SynergyAge (lifespan), CellAge (senescence), anti-aging target classification
Reasoning corpus Prediction tasks augmented with frontier-model chain-of-thought traces

Approximate scale across all domains: ≈10⁶ training prompts at the order of several billion tokens total. Exact composition, prompt counts, and token counts will be reported in the forthcoming preprint.

Training proceeded in three stages on Qwen3.5-9B:

  1. Continued pretraining — knowledge priors only, raw text packed into 4,096-token blocks. Rank-32 LoRA with rsLoRA scaling, LR 2 × 10⁻⁵, 3 epochs.
  2. Supervised fine-tuning — aging prediction tasks in conversation format. Rank-32 LoRA initialized from the phase-1 adapter, standard α/r scaling, LR 1 × 10⁻⁴, 3 epochs.
  3. Reasoning continuation — continued the SFT adapter on the reasoning corpus, LR 3 × 10⁻⁵, ≈1 epoch, context length 32,768 with example packing.

All adapters targeted the 12 linear projections including the GatedDeltaNet modules. After training, the CPT and continued-SFT adapters were concatenated into a single rank-64 LoRA and merged into the base weights to produce this checkpoint. DeepSpeed ZeRO-2 on 2× NVIDIA H100 NVL 94 GB, bf16, flash-attention 2 on self-attention layers. Full details in the forthcoming preprint.

Example usage

from transformers.models.qwen3_5 import Qwen3_5ForConditionalGeneration
from transformers import AutoProcessor

model = Qwen3_5ForConditionalGeneration.from_pretrained(
    "insilicomedicine/longevity-llm",
    torch_dtype="bfloat16",
    trust_remote_code=True,
    device_map="auto",
)
processor = AutoProcessor.from_pretrained(
    "insilicomedicine/longevity-llm", trust_remote_code=True
)
messages = [
    {"role": "system", "content":
     "You are a helpful assistant with expertise in aging biology."},
    {"role": "user", "content": "What are the hallmarks of aging?"},
]
inputs = processor.apply_chat_template(
    messages, return_tensors="pt", add_generation_prompt=True,
    enable_thinking=False,
).to(model.device)
out = model.generate(inputs, max_new_tokens=400, do_sample=False)
print(processor.decode(out[0][inputs.shape[-1]:], skip_special_tokens=True))

For vLLM serving (OpenAI-compatible) use --trust-remote-code --dtype bfloat16. Thinking mode is not a server flag — pass per-request chat_template_kwargs={"enable_thinking": true} to opt in.

License

The fine-tuning additions in this repository are released under CC-BY-ND-4.0. The underlying Qwen3.5-9B weights remain under their original Apache-2.0 license.

Citation

@misc{insilicomedicine_longevity_llm_2026,
  title  = {Longevity-LLM: a domain-adapted Qwen3.5-9B for aging biology},
  author = {Insilico Medicine},
  year   = {2026},
  howpublished = {\url{https://huggingface.co/insilicomedicine/longevity-llm}}
}
Downloads last month
50
Safetensors
Model size
9B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for insilicomedicine/longevity-llm

Finetuned
Qwen/Qwen3.5-9B
Finetuned
(370)
this model