Update tolerance of provider tests to fix flaky tests #19792

tianleiwu · 2024-03-06T07:44:16Z

Description

Check float/double/float16/bfloat16 tensors are close like numpy.isclose.

absolute(a - b) <= (atol + rtol * absolute(b))

The default tolerance thresholds:

float: atol=1e-6 and rtol=1e-5
float: atol=1e-5 and rtol=1e-4
float16: atol=0.0025 and rtol=0.001
bfloat16: atol=0.02 and rtol=0.01

Motivation and Context

Current pipeline has frequent failure due to using only relative tolerance in #19608:

[ RUN ] MatMulIntegerToFloat.NoZeroPoint_NoBias_test_U8S8
1: C:\a_work\1\s\onnxruntime\test\providers\checkers.cc(272): error: The difference between cur_expected[i] and cur_actual[i] is 1.3113021850585938e-06, which exceeds *(params.relative_error) * std::abs(cur_expected[i]), where
1: cur_expected[i] evaluates to -1.3113021850585938e-06,
1: cur_actual[i] evaluates to 0, and
1: *(params.relative_error) * std::abs(cur_expected[i]) evaluates to 2.6226043559063328e-08.

It is not reasonable to use relative tolerance for a small value very close to 0. Combining relative tolerance with a positive absolute tolerance could avoid such issue.

### Description Check float/double/float16/bfloat16 tensors are close like [numpy.isclose](https://numpy.org/doc/stable/reference/generated/numpy.isclose.html). ``` absolute(a - b) <= (atol + rtol * absolute(b)) ``` The default tolerance thresholds: - float: atol=1e-5 and rtol=1e-4 - float16: atol=0.0025 and rtol=0.001 - bfloat16: atol=0.02 and rtol=0.01 ### Motivation and Context Current pipeline has frequent failure due to using only relative tolerance in microsoft#19608: [ RUN ] MatMulIntegerToFloat.NoZeroPoint_NoBias_test_U8S8 1: C:\a\_work\1\s\onnxruntime\test\providers\checkers.cc(272): error: The difference between cur_expected[i] and cur_actual[i] is 1.3113021850585938e-06, which exceeds *(params.relative_error) * std::abs(cur_expected[i]), where 1: cur_expected[i] evaluates to -1.3113021850585938e-06, 1: cur_actual[i] evaluates to 0, and 1: *(params.relative_error) * std::abs(cur_expected[i]) evaluates to 2.6226043559063328e-08. It is not reasonable to use relative tolerance for a small value very close to 0. Combining relative tolerance with a positive absolute tolerance could avoid such issue.

tianleiwu added 5 commits March 6, 2024 07:24

update tolerance formula

679cce1

float type

7bcab55

change default

6f63322

Merge branch 'main' into tlwu/test_rel_error

d4e3551

fix build warning

976faf4

tianleiwu force-pushed the tlwu/test_rel_error branch from 34aebcd to 976faf4 Compare March 6, 2024 17:31

update threshold

d92f250

tianleiwu force-pushed the tlwu/test_rel_error branch from 541de1e to d92f250 Compare March 6, 2024 21:27

yufenglee approved these changes Mar 7, 2024

View reviewed changes

tianleiwu merged commit bff4f8b into main Mar 7, 2024
95 checks passed

tianleiwu deleted the tlwu/test_rel_error branch March 7, 2024 01:47

jywu-msft mentioned this pull request Mar 7, 2024

[VitisAI]set-data_loaction-as-default-when-load-external-data #19712

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update tolerance of provider tests to fix flaky tests #19792

Update tolerance of provider tests to fix flaky tests #19792

tianleiwu commented Mar 6, 2024 •

edited

Loading

Update tolerance of provider tests to fix flaky tests #19792

Update tolerance of provider tests to fix flaky tests #19792

Conversation

tianleiwu commented Mar 6, 2024 • edited Loading

Description

Motivation and Context

tianleiwu commented Mar 6, 2024 •

edited

Loading