Finetuning details?

Also, please be aware I only tried it in combination with in-context learning, but it just hit me that it might output total nonsense when it's used without in-context learning.

I'm working on a new version and I'll test this one with cases outside the task I finetuned it for.

Mihaiii

Owner Dec 10, 2023

I just tried it outside my custom use case and without in context learning and it's total rubbish :) .

brucethemoose

Dec 10, 2023

Interesting. The scores on the HF leaderboard were OK (which is how I found it), maybe because it doesn't use the prompting syntax?

Mihaiii

Owner Dec 10, 2023

•

edited Dec 10, 2023

Afaik the HF leaderboard benchmarks use few shots prompting and I think this is the reason why it has ok scores.

With 0 shot it outputs rubbish, with few shots it ouputs something useful.

Mihaiii

Owner Dec 11, 2023

•

edited Dec 11, 2023

I uploaded a new model and submitted it to HF leaderboard (I'm waiting for the results).

It still isn't generic enough in a sense that it shouldn't be used for story telling, for example, but only for reasoning and text comprehension.

https://huggingface.co/Mihaiii/Pallas-0.4

I'm closing this discussion. Feel free to (re)open if needed.

Mihaiii changed discussion status to closed Dec 11, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment