Add a fast integer divide that rounds to zero by abadams · Pull Request #6455 · halide/Halide

abadams · 2021-11-30T21:20:35Z

While working on legacy code I discovered a need for this. Performance test shows a good speed-up over native division for vector code:

signed division rounding to zero:
type            const-divisor speed-up  runtime-divisor speed-up
 Int(32,  1)     2.416                   1.153
 Int(16,  1)     2.552                   1.457
 Int( 8,  1)     1.782                   0.667
 Int(32,  8)     8.592                   5.908
 Int(16, 16)    53.008                  38.505
 Int( 8, 32)    19.480                   8.197

dsharletg · 2021-11-30T21:26:33Z

+        Expr xsign = select(numerator > 0, cast(t, 0), cast(t, -1));
+
+        // Multiply-keep-high-half
+        result = (cast(wide, mul) * numerator);


I think this should use widening_mul intrinsics, because uses of this are after find_intrinsics. Maybe this whole sequence should be mul_shift_right.

Actually this code is only called directly by users, so it's before find_intrinsics. The compiler doesn't ever call this.

Maybe add this as a comment for future readers.

I actually think we should change it to intrinsics anyways. But since the code is just moved and pre-existing, maybe it should be a separate PR.

steven-johnson · 2021-11-30T21:46:38Z


-        // Reference good version
-        g(x, y) = input(x, y) / cast<T>(y + min_val);
+            // Reference good version


This looks identical to the case just above, are they supposed to be identical?

Yes, they have different schedules which turn the denominator into a constant in one case but not the other.

(I'll add a comment)

steven-johnson · 2021-11-30T21:47:24Z

+bool srz_method_0(int den, int sh_post, int bits) {
+    int64_t min = -(1L << (bits - 1)), max = (1L << (bits - 1)) - 1;
+    for (int64_t num = min; num <= max; num++) {
+        // for (int iter = 0; iter < 1000000L; iter++) {


Why is this commented out? If it's being left in for (eg) debugging purposes, please say so.

Fixed (deleted)

abadams · 2021-11-30T22:00:11Z

See also related issue #6456

abadams · 2021-12-02T14:07:25Z

review ping

steven-johnson

LGTM

steven-johnson · 2021-12-02T17:41:51Z

+        Expr xsign = select(numerator > 0, cast(t, 0), cast(t, -1));
+
+        // Multiply-keep-high-half
+        result = (cast(wide, mul) * numerator);


Maybe add this as a comment for future readers.

lordnn · 2022-09-10T21:50:20Z

buf(x) = fast_integer_divide_round_to_zero(select(x % 2 == 0, 5, -5), 2);
result is:
2, -3, 2, -3, 2, -3
Not rounded to zero.

abadams · 2022-09-10T23:05:44Z

Looks like there's a bug in the handling of constant denominators (an early-out path that assumes we're rounding to -infinity). Will fix.

abadams · 2022-09-11T00:15:38Z

See #7008

abadams added 3 commits November 30, 2021 13:17

Add a version of fast_integer_divide that rounds towards zero

aa10a41

clang-format

67f0170

Fix test condition

0c12734

abadams requested a review from dsharletg November 30, 2021 21:20

dsharletg reviewed Nov 30, 2021

View reviewed changes

steven-johnson reviewed Nov 30, 2021

View reviewed changes

abadams added 2 commits November 30, 2021 13:55

Clean up debugging code

914cbdd

Add explanatory comment to performance test

61aabe3

Pacify clang tidy

f215365

steven-johnson approved these changes Dec 2, 2021

View reviewed changes

dsharletg approved these changes Dec 2, 2021

View reviewed changes

abadams merged commit 7992369 into master Dec 7, 2021

Conversation

abadams commented Nov 30, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abadams commented Nov 30, 2021

Uh oh!

abadams commented Dec 2, 2021

Uh oh!

steven-johnson left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lordnn commented Sep 10, 2022

Uh oh!

abadams commented Sep 10, 2022

Uh oh!

abadams commented Sep 11, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants