Llama 3 70B Instruct no refusal

This is a model that uses the orthogonal feature ablation as featured in this paper.

Calibration data:

256 prompts from jondurbin/airoboros-2.2
256 prompts from AdvBench
The direction is extracted between layer 40 and 41

I haven't tested the model but like the 8B model, may still refuse some instructions. Use this model responsibly, I decline any liability resulting of the use of this model.

I will post the code later.

Downloads last month: 45

Safetensors

Model size

70.6B params

Tensor type

BF16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for theo77186/Llama-3-70B-Instruct-norefusal

Quantizations

2 models