Instruct-finetuning dataset

#43

by Andriy - opened Mar 31

Discussion

Andriy

Mar 31

Hi! What instruct-finetuning dataset was used to train the chat model?

CatUkraine

Apr 10

The dataset is probably closed-source, but, in theory, it is possible to generate an "artificial" dataset for instruction following. It can be done by program two instances of LLM to chat with each other and log their generated data into some file.

markding

Apr 12

I was wondering the same, and it seems like that the Aya Collection was used in some form, but I have not seen definite proof.

skevja

Apr 17

Does anybody know what prompt formatting should be used for a custom fine-tuning dataset for command-r?

ewre324

Apr 23

@skevja I am also looking for this information...
Any updates?

skevja

21 days ago

•

edited 21 days ago

@ewre324 Unfortunately no, didn't find any information on this.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment