helehan
/

topic-overwrite-llava-7b-full

Model card Files Files and versions Community

Model Card for Model ID

Model Details

The model, trained using the RLHF/RLAIF methods proposed in the TPO paper by llava, has enhanced trustworthiness and reduced hallucinations.

Model Description

Trained from model: llava-v1.5-7B
Trained on data: TPO-Dataset

Usage

Please look at GitHub for more details about usage.

Citation

@article{he2024topic,
  title={A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs},
  author={He, Lehan and Chen, Zeren and Shi, Zhelun and Yu, Tianyu and Shao, Jing and Sheng, Lu},
  journal={arXiv preprint arXiv:2411.17265},
  year={2024}
}

Downloads last month: 15

Safetensors

Model size

7.06B params

Tensor type

BF16

·

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Dataset used to train helehan/topic-overwrite-llava-7b-full