Runtime error 15 π¦π¦π¦ Llama-3.1-405B-Instruct Service unavailable, 405B recently taken off hub inference:/
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 13 items β’ Updated Sep 18 β’ 222
ISTA-DASLab/Meta-Llama-3.1-70B-Instruct-AQLM-PV-2Bit-1x16 Text Generation β’ Updated Sep 17 β’ 2.93k β’ 44