Model converted by the transformers' pt_to_tf CLI. All converted model outputs and hidden layers were validated against its PyTorch counterpart.

Maximum crossload output difference=1.788e-06; Maximum crossload hidden layer difference=4.053e-06;
Maximum conversion output difference=1.788e-06; Maximum conversion hidden layer difference=4.053e-06;

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment