Replete-AI
/

Llama-3-13B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rombodawg commited on Apr 19

Commit

5960e84

•

1 Parent(s): 9ef25ca

Update README.md

Files changed (1) hide show

README.md +7 -44

README.md CHANGED Viewed

@@ -1,54 +1,17 @@
 ---
 base_model: []
 library_name: transformers
-tags:
-- mergekit
-- merge
 ---
-# Llama-3-BIG-Instruct
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the passthrough merge method.
-### Models Merged
-The following models were included in the merge:
-* /media/kquant/SSD/Model-2
-* /media/kquant/SSD/Model-1
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-dtype: float16
-merge_method: passthrough
-slices:
-- sources:
-  - layer_range: [0, 8]
-    model: /media/kquant/SSD/Model-1
-- sources:
-  - layer_range: [4, 12]
-    model: /media/kquant/SSD/Model-2
-- sources:
-  - layer_range: [8, 16]
-    model: /media/kquant/SSD/Model-1
-- sources:
-  - layer_range: [12, 20]
-    model: /media/kquant/SSD/Model-2
-- sources:
-  - layer_range: [16, 24]
-    model: /media/kquant/SSD/Model-1
-- sources:
-  - layer_range: [20, 28]
-    model: /media/kquant/SSD/Model-2
-- sources:
-  - layer_range: [24, 32]
-    model: /media/kquant/SSD/Model-1
-```

 ---
 base_model: []
 library_name: transformers
 ---
+Llama-3-13B-Instruct
+Thank you to Meta for the weights for Meta-Llama-3-8B-Instruct
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png)
+This is an upscaling of the Meta-Llama-3-8B-Instruct Ai using techniques created for Mistral-Evolved-11b-v0.1. This Ai model has been upscaled from 8b parameters to 13b parameters without any continuous pretraining or fine-tuning.
+From testing, the model seems to function perfectly at fp16, but has some issues at 4-bit quantization.
+The model that was used to create this one is linked below:
+https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct