Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nm-testing
/
Llama-2-70b-chat-hf-W8A8-Dynamic-Per-Token
like
0
Follow
NM Testing
31
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Inference Endpoints
8-bit precision
compressed-tensors
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
No model card
New: Create and edit this model card directly on the website!
Contribute a Model Card
Downloads last month
16
Safetensors
Model size
69B params
Tensor type
FP16
·
I8
·
Inference Examples
Text Generation
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to
Inference Endpoints (dedicated)
instead.