ChavyvAkvar
/

habib-DPO-v3

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

habib-DPO-v3 / README.md

ChavyvAkvar's picture

Update README.md

5811984 verified 6 months ago

|

1.15 kB

	---
	language:
	- en
	license: apache-2.0
	library_name: transformers
	tags:
	- unsloth
	- trl
	- sft
	base_model: SanjiWatsuki/Kunoichi-DPO-v2-7B
	pipeline_tag: text-generation
	---

	# Model Card for habib-DPO-v3

	<!-- Provide a quick summary of what the model is/does. -->

	This model is derivative version of [Kunoichi-DPO-v2-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-DPO-v2-7B) on internal datasets.

	### Model Description

	<!-- Provide a longer summary of what this model is. -->

	This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

	- Developed by: [Habibullah Akbar](https://chavyv.vercel.app/)
	- Funded by [optional]: [Creasoft ID](https://creasoft.id)
	- Shared by [optional]: [Habibullah Akbar](https://chavyv.vercel.app/)
	- Model type: Auto-regressive
	- Language(s) (NLP): Mostly English
	- License: Apache-2.0
	- Finetuned from model [optional]: [Kunoichi-DPO-v2-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-DPO-v2-7B)


	## Evaluation

	<!-- This section describes the evaluation protocols and provides the results. -->

	(coming soon)