Slow inference speed
#25
by
DarrenChen - opened
Is there any method that can achieve a several-fold improvement in single inference efficiency?
Is there any method that can achieve a several-fold improvement in single inference efficiency?