Model converted by the transformers' pt_to_tf CLI. All converted model outputs and hidden layers were validated against its PyTorch counterpart.

Maximum crossload output difference=2.289e-05; Maximum crossload hidden layer difference=6.390e-05;
Maximum conversion output difference=2.289e-05; Maximum conversion hidden layer difference=6.390e-05;

CAUTION: The maximum admissible error was manually increased to 6.5e-05!

sjrhuschlee changed pull request status to closed

Sign up or log in to comment