Tensor parallel version of this for efficient inference?

#5
by mayank31398 - opened

Does a TP version exist?

mayank31398 changed discussion status to closed

Sign up or log in to comment