Edit model card
README.md exists but content is empty. Use the Edit model card button to edit it.
Downloads last month
144
Safetensors
Model size
6.74B params
Tensor type
FP16
·
Inference API
Input a message to start chatting with mohitsha/Llama-2-7b-chat-hf-FP8-KV.
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.

Collection including mohitsha/Llama-2-7b-chat-hf-FP8-KV