Edit model card

TinyLLama TensorRT LLM Edition.

This repo contains the TensorRT LLM version of TinyLlama Model. The conversion is done to support Float16 precision on Nvidia TensorRT.

Downloads last month
2
Inference API
Unable to determine this model’s pipeline type. Check the docs .