How to speed up inferring?
7
#21 opened 12 months ago
by
merlinarer
May I ask if there are plans to provide 8-bit or 4-bit quantized versions?
10
#19 opened 12 months ago
by
intelligencegear
Not able to run hello world example, bigcode/starcoder is not a valid model identifier
14
#11 opened 12 months ago
by
rameshn