|
--- |
|
language: |
|
- en |
|
license: apache-2.0 |
|
library_name: transformers |
|
tags: |
|
- unsloth |
|
- trl |
|
- sft |
|
base_model: SanjiWatsuki/Kunoichi-DPO-v2-7B |
|
pipeline_tag: text-generation |
|
--- |
|
|
|
# Model Card for habib-DPO-v3 |
|
|
|
<!-- Provide a quick summary of what the model is/does. --> |
|
|
|
This model is derivative version of [Kunoichi-DPO-v2-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-DPO-v2-7B) on internal datasets. |
|
|
|
### Model Description |
|
|
|
<!-- Provide a longer summary of what this model is. --> |
|
|
|
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. |
|
|
|
- **Developed by:** [Habibullah Akbar](https://chavyv.vercel.app/) |
|
- **Funded by [optional]:** [Creasoft ID](https://creasoft.id) |
|
- **Shared by [optional]:** [Habibullah Akbar](https://chavyv.vercel.app/) |
|
- **Model type:** Auto-regressive |
|
- **Language(s) (NLP):** Mostly English |
|
- **License:** Apache-2.0 |
|
- **Finetuned from model [optional]:** [Kunoichi-DPO-v2-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-DPO-v2-7B) |
|
|
|
|
|
## Evaluation |
|
|
|
<!-- This section describes the evaluation protocols and provides the results. --> |
|
|
|
(coming soon) |