You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think there is a bug with universe samples in calculating aggregated column when the query contains group-by's that include columns of a universe sample.
For example, suppose an universe sample on column X of table A is used and the query contains group-by on column X,Y of table A. In this case, the sample contains every row in the groups, so it does not require any further adjustment using sampling probability. However, the current algorithm always adjusts aggregated values with sampling probability, which results in the estimated aggregated values having a very high error in this particular case.
The text was updated successfully, but these errors were encountered:
I think there is a bug with universe samples in calculating aggregated column when the query contains group-by's that include columns of a universe sample.
For example, suppose an universe sample on column X of table A is used and the query contains group-by on column X,Y of table A. In this case, the sample contains every row in the groups, so it does not require any further adjustment using sampling probability. However, the current algorithm always adjusts aggregated values with sampling probability, which results in the estimated aggregated values having a very high error in this particular case.
The text was updated successfully, but these errors were encountered: