apepkuss79's picture
Update README.md
5b937f1 verified
|
raw
history blame
828 Bytes
metadata
license: llama3.1
model_name: Llama-3.1-Nemotron-70B-Instruct-HF-GGUF
base_model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF
inference: false
model_creator: nvidia
quantized_by: Second State Inc.
tags:
  - nvidia
  - llama3.1
  - reward model

Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Original Model

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Run with Gaianet

Prompt template:

prompt template: llama-3-chat

Context size:

chat_ctx_size: 128000

Run with GaiaNet:

Quantized with llama.cpp b3932