Text Generation
Transformers
Safetensors
English
mistral
text-generation-inference
unsloth
trl
dpo
uncensored
roleplay
fine-tune
conversational
Eval Results
Inference Endpoints
Edit model card
image

💫 Pulsar_7B

⚠️ This is an experimental model!

A more compliant, RP-oriented version of MTSAIR/multi_verse_model, fine-tuned on carefully selected datasets. It's smart, adept at following the desired markdown format and adhering to the provided character card. The first message of the character card significantly influences its writing style. Pulsar_7B pairs well with guidance from CFG Scale and works effectively with PLists + Ali:Chat character cards. Pulsar_7B was fine-tuned on the following datasets:

  • grimulkan/theory-of-mind
  • grimulkan/physical-reasoning
  • ResplendentAI/Luna_Alpaca
  • unalignment/toxic-dpo-v0.2
  • kira/math-dpo
  • athirdpath/DPO_Pairs-Roleplay-Alpaca-NSFW-v1-SHUFFLED

Quantizations

Thanks to mradermacher, static GGUF quants are available here.

Formatting/Preset

Pulsar_7B works well with Alpaca, it's not a picky model when it comes to formatting/preset. Mistral should be compatible too. The custom chat template from MTSAIR/multi_verse_model also performs well:

{% for message in messages %}{% if message['role'] == 'user' %}{{ '### Instruction:\n' + message['content'] + '\n### Response:\n' }}{% elif message['role'] == 'assistant' %}{{ message['content'] + eos_token}}{% elif message['role'] == 'system' %}{{ '### System:\n' + message['content'] + '\n' }}{% endif %}{% endfor %}

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 74.23
AI2 Reasoning Challenge (25-Shot) 69.71
HellaSwag (10-Shot) 86.99
MMLU (5-Shot) 63.72
TruthfulQA (0-shot) 69.28
Winogrande (5-shot) 84.06
GSM8k (5-shot) 71.65

Downloads last month
1,004
Safetensors
Model size
7.24B params
Tensor type
FP16
·

Finetuned from

Datasets used to train rmdhirr/Pulsar_7B

Collection including rmdhirr/Pulsar_7B

Evaluation results