[RFC] feat: more precise qbinary handling in tract core with unaligned entry #1333

JulienBalianSonos · 2024-02-09T15:19:12Z

Change ways binary ops are done for math with quantized tensors, by allowing unaligned qparams in input/output tensors.

This change allows increasing precision of quantization and reduce casting/alignment cost, but we now need to handle the qparams of output tensor as function input.

We need to handle all ops implied by bin_to_super_type macro already supporting QU8, that may use unaligned qparams quantization.

elementwise mul
div

in next cases we directly apply dequant to f32 and requant to targeted qtype for now

add
sub
min
max (here we set a small optim to make it slightly faster for ReLU case)

we propose to leave aside from this PR: rem, and all "bool" or "bit" operations

…& output qparams)

…ized inputs such as relu max(a_qu8, 0.0)

…nd relu operator

…perations (usefull for quantized tensors)

core/src/ops/binary.rs

core/src/ops/math/mod.rs

nnef/src/registry.rs

test-rt/suite-unit/src/q_binary.rs

…nt-algined-zp-scale

JulienBalianSonos added 29 commits February 21, 2024 16:09

feat: more precise qmul handling in tract core (with unaligned entry …

4ff4d8b

…& output qparams)

fix: rm useless comments

9b67809

feat: add unit test not aligned scale and offset

93a2891

fix: rm old comment with regard to alignment

7207200

fix: missing mul unittest set of output tensor type + clearer example

e466fc2

fix: better comment

9886bdc

fix: in registry avoid unwrap on binary out tensor as some are None

fbc8da0

fix: need to maintain forced cast even for mixed quantized/ non-quant…

589063f

…ized inputs such as relu max(a_qu8, 0.0)

feat: add a unit test to avoid forgetting about mixed dt with q8/fp a…

6b752f5

…nd relu operator

feat: support unaligned input tensor for addition

b29f0a8

feat: add test to qadd

659c846

fix: missing replacement

315daaa

fix: more generic attempt

5056be2

fix: enforce more limited type

f50defc

fix: more tests on min/max with q tensors

96667b9

fix: support pow in qu8

c0cc6f8

fix: allows nnef register to map core output dt for min,max,sub,div o…

d018e6b

…perations (usefull for quantized tensors)

fix: graph_name in an example

43deee5

fix: use $crate instead of crate to use crate of macro definition

c2e35ea

fix: rm outdated comment

816e42b

fix: add more generic handling of binary qtensors

3105f7b

fix: more generic optim for max with quantized elements

709acd4

feat: wip of test-rt suite-unit

d9650b0

fix: WIP unit test migration to suite-unit

1c0d54d

feat: updated working test suite for qbinary

0bcbda6

fix: rm relu quant as this is not part of prop test in test-rt

b3d1b73

fix: no more filter in nnef registry

7a6ca0c

fix: rm useless comment

e75cb85

fix: maybe default is needed

2929ae7

JulienBalianSonos force-pushed the fix/elmwise-mul-quant-algined-zp-scale branch from e02b862 to 2929ae7 Compare February 21, 2024 15:10

JulienBalianSonos and others added 6 commits February 21, 2024 16:11

fix: add _suite

2d9fb57

fix: rm warn unused var

7cd9d3b

fix: for now discard q_binary for tflite test suite

1330bb6

fix: more clear operators tested in q_binary

180fb60

fix: useless in f32

b5714f7

ignore extra broken test

ab01f22

JulienBalianSonos changed the title ~~[WIP] feat: more precise qmul handling in tract core with unaligned entry~~ [RFC] feat: more precise qmul handling in tract core with unaligned entry Feb 29, 2024

JulienBalianSonos changed the title ~~[RFC] feat: more precise qmul handling in tract core with unaligned entry~~ [RFC] feat: more precise qbinary handling in tract core with unaligned entry Feb 29, 2024

kali reviewed Mar 1, 2024

View reviewed changes

JulienBalianSonos added 5 commits March 1, 2024 12:39

fix: align with PR comments

45a1309

Merge remote-tracking branch 'upstream/main' into fix/elmwise-mul-qua…

739a01f

…nt-algined-zp-scale

fix: apply force cast only if at least 1 input is not quantized

b8466cd

fix: allow correct TDim less handling

a6892bb

fix: acceptable error ratio

0b9923a

kali merged commit 99ebf61 into sonos:main Mar 5, 2024
44 of 45 checks passed

JulienBalianSonos deleted the fix/elmwise-mul-quant-algined-zp-scale branch March 5, 2024 10:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] feat: more precise qbinary handling in tract core with unaligned entry #1333

[RFC] feat: more precise qbinary handling in tract core with unaligned entry #1333

JulienBalianSonos commented Feb 9, 2024 •

edited

Loading

[RFC] feat: more precise qbinary handling in tract core with unaligned entry #1333

[RFC] feat: more precise qbinary handling in tract core with unaligned entry #1333

Conversation

JulienBalianSonos commented Feb 9, 2024 • edited Loading

JulienBalianSonos commented Feb 9, 2024 •

edited

Loading