💫 Pulsar_7B

⚠️ This is an experimental model!

A more compliant, RP-oriented version of MTSAIR/multi_verse_model, fine-tuned on carefully selected datasets. It's smart, adept at following the desired markdown format and adhering to the provided character card. The first message of the character card significantly influences its writing style. Pulsar_7B pairs well with guidance from CFG Scale and works effectively with PLists + Ali:Chat character cards. Pulsar_7B was fine-tuned on the following datasets:

grimulkan/theory-of-mind
grimulkan/physical-reasoning
ResplendentAI/Luna_Alpaca
unalignment/toxic-dpo-v0.2
kira/math-dpo
athirdpath/DPO_Pairs-Roleplay-Alpaca-NSFW-v1-SHUFFLED

Quantizations

Thanks to mradermacher, static GGUF quants are available here.

Formatting/Preset

Pulsar_7B works well with Alpaca, it's not a picky model when it comes to formatting/preset. Mistral should be compatible too. The custom chat template from MTSAIR/multi_verse_model also performs well:

{% for message in messages %}{% if message['role'] == 'user' %}{{ '### Instruction:\n' + message['content'] + '\n### Response:\n' }}{% elif message['role'] == 'assistant' %}{{ message['content'] + eos_token}}{% elif message['role'] == 'system' %}{{ '### System:\n' + message['content'] + '\n' }}{% endif %}{% endfor %}

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	74.23
AI2 Reasoning Challenge (25-Shot)	69.71
HellaSwag (10-Shot)	86.99
MMLU (5-Shot)	63.72
TruthfulQA (0-shot)	69.28
Winogrande (5-shot)	84.06
GSM8k (5-shot)	71.65

rmdhirr
/

Pulsar_7B

💫 Pulsar_7B

Quantizations

Formatting/Preset

Open LLM Leaderboard Evaluation Results

Model tree for rmdhirr/Pulsar_7B

Datasets used to train rmdhirr/Pulsar_7B

Collection including rmdhirr/Pulsar_7B

7B Mistral Fine-tunes

Evaluation results