mlabonne
/

Meta-Llama-3-120B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mlabonne commited on May 6

Commit

2ed062e

•

1 Parent(s): 1294f9e

Update README.md

Files changed (1) hide show

README.md +28 -2

README.md CHANGED Viewed

@@ -20,9 +20,35 @@ base_model:
 Meta-Llama-3-120B-Instruct is a self-merge with [meta-llama/Meta-Llama-3-70B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct).
-It was inspired by large merges like [alpindale/goliath-120b](https://huggingface.co/alpindale/goliath-120b), [nsfwthrowitaway69/Venus-120b-v1.0](https://huggingface.co/nsfwthrowitaway69/Venus-120b-v1.0), [cognitivecomputations/MegaDolphin-120b](https://huggingface.co/cognitivecomputations/MegaDolphin-120b), and [wolfram/miquliz-120b-v2.0](https://huggingface.co/wolfram/miquliz-120b-v2.0).
-No eval yet, but it is approved by Eric Hartford: https://twitter.com/erhartford/status/1787050962114207886
 ## 🧩 Configuration

 Meta-Llama-3-120B-Instruct is a self-merge with [meta-llama/Meta-Llama-3-70B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct).
+It was inspired by large merges like:
+- [alpindale/goliath-120b](https://huggingface.co/alpindale/goliath-120b)
+- [nsfwthrowitaway69/Venus-120b-v1.0](https://huggingface.co/nsfwthrowitaway69/Venus-120b-v1.0)
+- [cognitivecomputations/MegaDolphin-120b](https://huggingface.co/cognitivecomputations/MegaDolphin-120b)
+- [wolfram/miquliz-120b-v2.0](https://huggingface.co/wolfram/miquliz-120b-v2.0).
+## 🔍 Applications
+I recommend using this model for creative writing. It uses the Llama 3 chat template with a default context window of 8K (can be extended with rope theta).
+Check the examples in the evaluation section to get an idea of its performance.
+## ⚡ Quantized models
+Thanks to [Eric Hartford](https://huggingface.co/ehartford), [elinas](https://huggingface.co/elinas), and the [mlx-community](https://huggingface.co/mlx-community) for providing these models.
+* **GGUF**: https://huggingface.co/cognitivecomputations/Meta-Llama-3-120B-Instruct-gguf
+* **EXL2**: https://huggingface.co/elinas/Meta-Llama-3-120B-Instruct-4.0bpw-exl2
+* **mlx**: https://huggingface.co/mlx-community/Meta-Llama-3-120B-Instruct-4bit
+## 🏆 Evaluation
+The model looks excellent for creating writing tasks, outperforming GPT-4. Thanks again to [Eric Hartford](https://huggingface.co/ehartford) for noticing this.
+* **X thread by Eric Hartford (creative writing)**: https://twitter.com/erhartford/status/1787050962114207886
+* **X thread by Daniel Kaiser (creative wrirting)**: https://twitter.com/spectate_or/status/1787257261309518101
+* **X thread by Simon (reasoning)**: https://twitter.com/NewDigitalEdu/status/1787403266894020893
+* **r/LocalLLaMa**: https://www.reddit.com/r/LocalLLaMA/comments/1cl525q/goliath_lovers_where_is_the_feedback_about/
 ## 🧩 Configuration