[Cache Request] facebook/seamless-m4t-v2-large

#13

by aitransync - opened Mar 7, 2024

Discussion

aitransync

Mar 7, 2024

Please add the following model to the neuron cache

dacorvo

AWS Inferentia and Trainium org Mar 8, 2024

The inference cache is only available for causal lm models for now. cc @Jingya

Jingya

AWS Inferentia and Trainium org Mar 13, 2024

•

edited Mar 13, 2024

We do not have seamless-m4t-v2 support yet, not even in optimum main, so we will need to add support for its export and inference first. Besides regarding the size of this model, we might need tp support for it as well...

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment