MaziyarPanahi/zephyr-orpo-141b-A35b-v0.1-GGUF

Update README.md

by MaziyarPanahi - opened Apr 11

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

+74

-2

Files changed (1) hide show

README.md +74 -2

README.md CHANGED Viewed

@@ -31,6 +31,78 @@ quantized_by: MaziyarPanahi
 <img src="https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1/resolve/main/logo.png" alt="Zephyr 141B Logo" width="400" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
-# Model Card for zephyr-orpo-141b-A35b-v0.1-GGUF
-in progress ...

 <img src="https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1/resolve/main/logo.png" alt="Zephyr 141B Logo" width="400" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
+# zephyr-orpo-141b-A35b-v0.1-GGUF
+On April 11th, [@HuggingFaceH4](https://huggingface.co/HuggingFaceH4) released a fine-tuned model called [HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1](https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1) based on `Mixtral-8x22B-v0.1` model.
+- **Model type:** A Mixture of Experts (MoE) model with 141B total parameters and 35B active parameters. Fine-tuned on a mix of publicly available, synthetic datasets.
+- **Language(s) (NLP):** Primarily English.
+- **License:** Apache 2.0
+- **Finetuned from model:** [mistral-community/Mixtral-8x22B-v0.1](https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1)
+## Load sharded model
+`llama_load_model_from_file` will detect the number of files and will load additional tensors from the rest of files.
+```sh
+llama.cpp/main -m zephyr-orpo-141b-A35b-v0.1.Q2_K-00001-of-00005.gguf --file prompt.txt -n 1024 -e
+```
+The output from `Q2_K` quantized model:
+```
+ <|system|>
+You are a helpful assistant.
+<|user|>
+Hello, how are you?
+<|assistant|>
+I'm doing great. How can I help you today?
+<|user|>
+Show me how to build a website in 10 simple steps
+<|assistant|>
+Of course, I'd be happy to guide you through the process of building a website in 10 simple steps.
+1. Determine the purpose of your website: Decide what your website will be used for, whether it's for business, personal use, or a blog.
+2. Choose a domain name: Your domain name is the address that people will use to find your website. It should be easy to remember and relevant to your website's purpose.
+3. Select a web hosting provider: A web hosting provider will store your website files and make them accessible to users around the world. Research different providers and choose one that fits your needs.
+4. Choose a platform: There are many platforms available for building a website, including WordPress, Wix, and Squarespace. Consider your technical skills and the features you need   when   choosing   a platform.
+5. Plan your website structure: Decide on the pages you want to include on your website and how they will be organized. This will help guide the design process.
+6. Design your website: Use a template or create your own design for your website. Make sure it's visually appealing and easy to navigate.
+7. Add content to your website: Write the text and create images or videos for your website. Make sure the content is relevant to your website's purpose and engaging for users.
+8. Optimize your website for search engines: Make sure your website is easy to find by using keywords in your content and metadata. This will help your website rank higher in search engine results.
+9. Test your website: Test your website to make sure everything is working properly and that there are no broken links or errors.
+10. Launch your website: Once everything is working properly, it's time to launch your website. Share the link with your audience and start promoting your website.
+I hope this guide helps you build a successful website. Let me know if you have any questions. [end of text]
+llama_print_timings:        load time =   11670.53 ms
+llama_print_timings:      sample time =      16.30 ms /   422 runs   (    0.04 ms per token, 25894.34 tokens per second)
+llama_print_timings: prompt eval time =    5084.73 ms /    78 tokens (   65.19 ms per token,    15.34 tokens per second)
+llama_print_timings:        eval time =  279055.53 ms /   421 runs   (  662.84 ms per token,     1.51 tokens per second)
+llama_print_timings:       total time =  284314.00 ms /   499 tokens
+Log end
+```
+What's inside the `prompt.txt`:
+```
+<|system|>
+You are a helpful assistant.</s>
+<|user|>
+Hello, how are you?</s>
+<|assistant|>
+I'm doing great. How can I help you today?</s>
+<|user|>
+Show me how to build a website in 10 simple steps</s>
+<|assistant|>
+```