ssmits
/

Llama-3.1-Nemotron-92B-Instruct-HF-late

+---
+base_model:
+- nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+# output-Llama-3.1-Nemotron-93B-Instruct-HF-late
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+### Merge Method
+This model was merged using the passthrough merge method.
+### Models Merged
+The following models were included in the merge:
+* [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF)
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+dtype: bfloat16
+merge_method: passthrough
+slices:
+- sources:
+  - layer_range:
+    - 0
+    - 55
+    model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
+- sources:
+  - layer_range:
+    - 50
+    - 60
+    model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
+- sources:
+  - layer_range:
+    - 55
+    - 65
+    model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
+- sources:
+  - layer_range:
+    - 60
+    - 70
+    model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
+- sources:
+  - layer_range:
+    - 65
+    - 75
+    model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
+- sources:
+  - layer_range:
+    - 70
+    - 80
+    model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
+```