Add bfloat16 support for lighter (maybe faster too?) inference. I used to add this argument on pipeline, see for example https://gist.github.com/younesbelkada/dba25f75d3749b4e2d2d4821f0d6f385#file-benchmark-py-L42 /

osanseviero changed pull request status to merged

Sign up or log in to comment