noamwies
/

llama-test-gqa-with-better-transformer

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

A miniature llama model for testing the llama GQA variant in the BetterTransformer framework.

Downloads last month: 141

Safetensors

Model size

875k params

Tensor type

F32

·

FP16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.