ShuttleAI

company
Verified
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

xtristan  new activity 7 days ago
shuttleai/shuttle-jaguar:Lora
xtristan  updated a collection 9 days ago
Shuttle Jaguar
xtristan  updated a collection 9 days ago
Shuttle 3.1 Aesthetic
View all activity

shuttleai's activity

Lora

1
#2 opened 7 days ago by
Renderless
xtristan 
in shuttleai/shuttle-3-AWQ-Int4 about 1 month ago

What model is this?

5
#1 opened about 1 month ago by
ryg81
Fizzarolli 
posted an update 8 months ago
view post
Post
2074
hi everyone!

i wanted to share an experiment i did with upcycling phi-3 mini into an moe recently.
while benchmarks are definitely within a margin of error and they performed similarly, i think it's an interesting base to try and see if you can improve phi's performance! (maybe looking into HuggingFaceFW/fineweb-edu could be interesting, i also left some other notes if anyone with more compute access wants to try it themselves)

check it out! Fizzarolli/phi3-4x4b-v1
Fizzarolli 
posted an update 9 months ago
view post
Post
2501
Is anyone looking into some sort of decentralized/federated dataset generation or classification by humans instead of synthetically?

From my experience with trying models, a *lot* of modern finetunes are trained on what amounts to, in essence, GPT-4 generated slop that makes everything sound like a rip-off GPT-4 (refer to i.e. the Dolphin finetunes). I have a feeling that this is a lot of the reason people haven't been quite as successful as Meta's instruct tunes of Llama 3.