BF16 in MPS from Apple

by DAgir - opened Apr 25

Apr 25

Hi guys. This is great news, thank you for your hard work. I will definitely try the 3B model.
But it would be noticeable if models with the BF16 tensor type ran without problems on MPS from Apple.
Pytorch 2.3 solved some problems, but there are still bottlenecks where the process is dumped on the CPU, and this bottleneck is why tokens/s sags.
Of course MLX is a great tool, but would you like to support seamless development (cloud is cuda).

ptx0

29 days ago

wrong forum for this

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment