failspy
/

Phi-3-mini-4k-geminified

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Phi-3-mini-4k-geminified / README.md

failspy's picture

Upload folder using huggingface_hub

204edd5 verified 5 months ago

|

1.17 kB

	---
	license: mit
	license_link: https://huggingface.co/microsoft/Phi-3-medium-4k-instruct/resolve/main/LICENSE

	language:
	- multilingual
	pipeline_tag: text-generation
	tags:
	- nlp
	- code
	inference:
	parameters:
	temperature: 0.7
	widget:
	- messages:
	- role: user
	content: What's the difference between a banana and a strawberry?
	---

	# Phi-3-mini-128k-instruct~~abliterated-v3~~ geminified

	[My Jupyter "cookbook" to replicate the methodology can be found here, refined library coming soon](https://huggingface.co/failspy/llama-3-70B-Instruct-abliterated/blob/main/ortho_cookbook.ipynb)

	## Summary

	This is [microsoft/Phi-3-mini-128k-instruct](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) with orthogonalized bfloat16 safetensor weights, generated with a refined methodology based on that which was described in the preview paper/blog post: '[Refusal in LLMs is mediated by a single direction](https://www.alignmentforum.org/posts/jGuXSZgv6qfdhMCuJ/refusal-in-llms-is-mediated-by-a-single-direction)' which I encourage you to read to understand more.

	This model has been orthogonalized to act more like certain rhymes-with-Shmemini models.