Add TF weights

#1
by reach-vb HF staff - opened

Model converted by the transformers' pt_to_tf CLI. All converted model outputs and hidden layers were validated against its Pytorch counterpart.

Maximum crossload output difference=3.433e-05; Maximum crossload hidden layer difference=1.016e-02;
Maximum conversion output difference=3.433e-05; Maximum conversion hidden layer difference=1.016e-02;

CAUTION: The maximum admissible error was manually increased to 1.0!

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment