Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
WendyHoang
/
Llama3-70B-RAG
like
0
Text Generation
Transformers
PyTorch
GGUF
llama
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
No model card
New: Create and edit this model card directly on the website!
Contribute a Model Card
Downloads last month
285
GGUF
Model size
70.6B params
Architecture
llama
4-bit
Q4_K_M
6-bit
Q6_K_M
Inference Examples
Text Generation
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to
Inference Endpoints (dedicated)
instead.
Model tree for
WendyHoang/Llama3-70B-RAG
Quantizations
1 model