VAGOsolutions/SauerkrautLM-7b-HerO · Training dataset + Hyperparamters

Hugging Face

Training dataset + Hyperparamters

by Viewegger - opened Nov 26, 2023

Discussion

Viewegger

Nov 26, 2023

Hello,

Thank you for making this public, it looks that there is recent rise in non-English models.

Do you plan to make the dataset public? If not, would it be possible to make public at least small portion of it, to see how similar dataset could be modeled in different languages?
Could you provide some details on training procedure? Hyper-parameters and you HW setup + total time it took you to finish training?

Danke!

CyberTimon

Nov 26, 2023

I would also love to see the data public - would like to reproduce it with different models. Thanks

DavidGF

VAGO solutions org Nov 30, 2023

Hey @ all.
We are already planning to publish a partial data set that we used for the training. This is data that has been completely augmented from an existing English top dataset.
I think the dataset should make our approach clearer for the open source community.

Best Regards,
David

DaryoushV changed discussion status to closed Dec 1, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment