Laguna-XS.2 — Pre-compiled for AWS Neuron (trn2.3xlarge)

Pre-compiled and pre-sharded model artifacts for serving poolside/Laguna-XS.2 on AWS Trainium2 using NxD Inference.

Configuration

File	Size	Description
	4.3 GB	Compiled NEFFs (6 CTE + 6 TKG buckets)
	12 KB	NxDI inference configuration
	16 GB	Sharded weights for TP rank 0
	16 GB	Sharded weights for TP rank 1
	16 GB	Sharded weights for TP rank 2
	16 GB	Sharded weights for TP rank 3

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

(5)

this model