YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)


GTFO_Character_Voice_models

ไธญๆ–‡็‰ˆ

Table of Contents
  1. About The Project
  2. Getting Started
  3. License
  4. Contact



About The Project

This project provides trained models for audio to audio simulation using so-vits-4.1, as training the model is resource intensive, but not so much for infering an audio. Every model included has been trained for at least 20K steps

This repo includes:

  • All data sets used to train the models
  • Default model
  • Diffusion model
  • Fusion model
  • Sample of model

Dataset Source:

Trained with so-vits-svc

Getting Started

To use the models, you need to follow the instructions on so-vits-svc or so-vits-svc-fork for a better GUI and easier inference as no training is required.

Dragging the folder into the so-vits-svc folder should work right away, otherwise, move models to designated folder based on description.

so-vits-svc-4.1
    โ”‚
    โ”œโ”€โ”€โ”€configs
    โ”‚      โ”œโ”€โ”€โ”€config.json - config file for default training
    โ”‚      โ””โ”€โ”€โ”€diffusion.yaml - config file for diffusion training
    โ”‚   
    โ””โ”€โ”€โ”€logs
          โ””โ”€โ”€โ”€44k
                โ”œโ”€โ”€โ”€G_(name of character).pth - Default model
                โ”œโ”€โ”€โ”€(name of character)Kmeans.pt - fusion model
                โ””โ”€โ”€โ”€diffusion
                        โ””โ”€โ”€โ”€(name of character).pt - difussion model for character

data_set - dataset used for training, audio cut to slices.

Usage

Select Default model, diffusion model, fusion model and respective config for training. Note: Update the speaker in the config file to avoid key errors. "Hacket_data_set,Dauda_data_set,Bishop_data_set,Woods_data_set" If that does not work, try using pre process a folder with such names, and preconfig to set all configs with the same voice name.

  • Fusion model = cluster model
  • You might not see the option for diffusion as it is a new feature, it is only provided in some versions of so-vits-forks

License

Distributed under the MIT License. See LICENSE.txt for more information. If used, please attatch link to the repo.

Contact

NAinfini - na.infini@gmail.com

NA infini#6457 -Discord

Project Link: GTFO_Character_Voice_models

(back to top)

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .