https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-405B

#227

by leafspark - opened Aug 15, 2024

Discussion

leafspark

Aug 15, 2024

•

edited Aug 15, 2024

NousResearch/Hermes-3-Llama-3.1-405B

Another huge model by NousResearch, interestingly it's a full parameter fine tune on the base.

They also released a 8B and 70B finetune:
https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-70B
https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B

nicoboss

Aug 15, 2024

•

edited Aug 15, 2024

What a high-quality model. This is a full parameter finetune of Llama-3.1 405B on par with Llama-3.1 405B Instruct. It uses the ChatML template, supports function calling and has a system prompt to generate structured JSON output.

mradermacher

Owner Aug 16, 2024

Queued, but for the 405B, patience is required.

mradermacher changed discussion status to closed Aug 16, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment