apepkuss79's picture
Update README.md
f45d1e6 verified
|
raw
history blame
860 Bytes
metadata
license: llama3.1
model_name: Llama-3.1-Nemotron-70B-Instruct-HF
base_model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
inference: false
pipeline_tag: text-generation
library_name: transformers
model_creator: nvidia
quantized_by: Second State Inc.
tags:
  - nvidia
  - llama3.1

Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Original Model

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Run with Gaianet

Prompt template:

prompt template: llama-3-chat

Context size:

chat_ctx_size: 128000

Run with GaiaNet:

Quantized with llama.cpp b3932