brittlewis12's picture
Update README.md
4298608 verified
metadata
base_model: argilla/distilabeled-Marcoro14-7B-slerp-full
inference: false
license: apache-2.0
language:
  - en
datasets:
  - argilla/distilabel-intel-orca-dpo-pairs
tags:
  - distilabel
  - dpo
  - rlaif
  - rlhf
  - merge
  - mergekit
model_creator: argilla
model_name: distilabeled-Marcoro14-7B-slerp-full
model_type: mistral
pipeline_tag: text-generation
quantized_by: brittlewis12

distilabeled-Marcoro14-7B-slerp-full GGUF

Original model: distilabeled-Marcoro14-7B-slerp-full Model creator: Argilla

This repo contains GGUF format model files for Argilla’s distilabeled-Marcoro14-7B-slerp-full.

As described on the original model card:

This model is a new DPO fine-tune of our new open dataset argilla/distilabel-intel-orca-dpo-pairs, on the mlabonne/Marcoro14-7B-slerp model. You can find more information of the "distilabeled" dataset used at this repo argilla/distilabeled-Hermes-2.5-Mistral-7B, and visit distilabel.

The difference between this model and argilla/distilabeled-Marcoro14-7B-slerp is that this model has been fine-tuned for a whole epoch instead instead of 200 steps, so it has seen the whole dataset.

What is GGUF?

GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp. Converted using llama.cpp build 1879 (revision 3e5ca79)

Prompt template: Unknown

{{prompt}}


Download & run with cnvrs on iPhone, iPad, and Mac!

cnvrs.ai

cnvrs is the best app for private, local AI on your device:

  • create & save Characters with custom system prompts & temperature settings
  • download and experiment with any GGUF model you can find on HuggingFace!
  • make it your own with custom Theme colors
  • powered by Metal ⚡️ & Llama.cpp, with haptics during response streaming!
  • try it out yourself today, on Testflight!
  • follow cnvrs on twitter to stay up to date

Original Model Evaluations:

Model AGIEval GPT4ALL TruthfulQA Bigbench Average
argilla/distilabeled-Marcoro14-7B-slerp-full 45.17 76.59 64.68 48.15 58.65
argilla/distilabeled-Marcoro14-7B-slerp 45.4 76.47 65.46 47.19 58.63
Marcoro14-7B-slerp 44.66 76.24 64.15 45.64 57.67
argilla/distilabeled-Hermes-2.5-Mistral-7B 44.64 73.35 55.96 42.21 54.04