YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

SECAP: Speech Emotion Captioning with Large Language Model

This repository contains the implementation of the paper "SECap: Speech Emotion Captioning with Large Language Model". It includes the model code, training and testing scripts, and a test dataset. The test dataset consists of 600 wav audio files and their corresponding emotion descriptions.

Please find more details at the GitHub repo[https://github.com/xuyaoxun/SECaps]

Checkpoint

You can download the model checkpoint in this repo freely and put it in the main folder of SECaps.

Meanwhile you will need to download the weights folder and also put it in the main folder of SECaps.

Citation

If you use this repository in your research, please kindly cite our paper:

@article{SECap, title={SECap: Speech Emotion Captioning with Large Language Model},

}

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.