Post
1517
PREM-1B-CHAT QUANTIZED INTO Q4
THEN SERVED IN WEBGPU DEMO
OG model premai-io/prem-1B-chat
Q4 model ucalyptus/prem-1B-chat-onnx-q4
WEBGPU demo ucalyptus/prem-1B-chat-webgpu
THEN SERVED IN WEBGPU DEMO
OG model premai-io/prem-1B-chat
Q4 model ucalyptus/prem-1B-chat-onnx-q4
WEBGPU demo ucalyptus/prem-1B-chat-webgpu