Edit model card
README.md exists but content is empty. Use the Edit model card button to edit it.
Downloads last month
0
Safetensors
Model size
69B params
Tensor type
FP16
·
Inference API
Input a message to start chatting with mohitsha/Llama-2-70b-chat-hf-FP8-KV.
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.

Collection including mohitsha/Llama-2-70b-chat-hf-FP8-KV