TFMC/imatrix-dataset-for-japanese-llm を使用してimatrixデータを生成。

詳細はここのissueを追ってください。

GGUF

Model size

341B params

Architecture

nemotron4

2-bit

3-bit

4-bit

8-bit

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for yayoimizuha/Nemotron-4-340B-Instruct-imatrix-GGUF

Base model

Quantized

(1)

this model

yayoimizuha
/

Nemotron-4-340B-Instruct-imatrix-GGUF