add trailing_zeros and leading_zeros to non zero types #79114

andjo403 · 2020-11-16T22:06:18Z

as a way towards being able to use the optimized intrinsics ctlz_nonzero and cttz_nonzero from stable.

have not crated any tracking issue if this is not a solution that is wanted

rust-highfive · 2020-11-16T22:06:22Z

(rust_highfive has picked a reviewer for you, use r? to override)

scottmcm · 2020-11-16T22:25:30Z

I'm a fan 👍 This is important for avoiding dead ASM for the default x64 target, but can even improve the generated code on newer target-cpus too because of the tighter value range. An example: #70835 (comment)

m-ou-se

Looks like a good addition!

Maybe it's good to explain the existence of these functions in the doc comments? Maybe something like On many architectures, this function can perform better than `leading_zeros()` on the underlying integer type, as special handling of zero can be avoided., or something in that direction.

library/core/src/num/nonzero.rs

library/core/tests/nonzero.rs

andjo403 · 2020-11-17T18:26:38Z

thanks for the comments have addressed them and pushed up the changes now

m-ou-se

Thanks! One more small comment:

library/core/src/num/nonzero.rs

andjo403 · 2020-11-17T18:56:04Z

thanks again and fixed

m-ou-se · 2020-11-17T19:04:28Z

Thanks!

@bors r+ rollup=always

bors · 2020-11-17T19:04:30Z

📌 Commit 9bbc4c1 has been approved by m-ou-se

…eros, r=m-ou-se add trailing_zeros and leading_zeros to non zero types as a way towards being able to use the optimized intrinsics ctlz_nonzero and cttz_nonzero from stable. have not crated any tracking issue if this is not a solution that is wanted

Rollup of 11 pull requests Successful merges: - rust-lang#78361 (Updated the list of white-listed target features for x86) - rust-lang#78785 (linux: try to use libc getrandom to allow interposition) - rust-lang#78999 (stability: More precise location for deprecation lint on macros) - rust-lang#79039 (Tighten the bounds on atomic Ordering in std::sys::unix::weak::Weak) - rust-lang#79079 (Turn top-level comments into module docs in MIR visitor) - rust-lang#79114 (add trailing_zeros and leading_zeros to non zero types) - rust-lang#79131 (Enable AVX512 *epi64 variants by updating stdarch) - rust-lang#79133 (bootstrap: use the same version number for rustc and cargo) - rust-lang#79145 (Fix handling of panic calls) - rust-lang#79151 (Fix typo in `std::io::Write` docs) - rust-lang#79158 (type is too big -> values of the type are too big) Failed merges: r? `@ghost` `@rustbot` modify labels: rollup

leonardo-m · 2020-11-19T13:21:19Z

I've tried this in my codebase, looking at the generated asm, and I've seen that in 100% of the cases, thanks sometimes to inlining, LLVM was able to infer that the input isn't zero, so no performance change has happened.

m-ou-se · 2020-11-19T13:25:23Z

In cases where the non-zero value are is created somewhat close to the place where leading_zeros/trailing_zeros is applied, it will definitely optimize this nicely. But just using NonZeroI32 by itself without the context that creates the value, doesn't result in this optimization (yet?): https://godbolt.org/z/bEoPE9

scottmcm · 2020-11-19T13:42:07Z

@leonardo-m As another example, note that LLVM is currently not capable of optimizing this even when there's an explicit check for zero in the code before doing the uN::leading_zeros: https://rust.godbolt.org/z/4EhnK4

leonardo-m · 2020-11-21T21:02:38Z

But just using NonZeroI32 by itself without the context that creates the value, doesn't result in this optimization (yet?): https://godbolt.org/z/bEoPE9

Using "-C opt-level=2 -C target-cpu=native" it seems to optimize it well.

leonardo-m · 2020-11-21T21:05:10Z

As another example, note that LLVM is currently not capable of optimizing this even when there's an explicit check for zero in the code before doing the uN::leading_zeros: https://rust.godbolt.org/z/4EhnK4

With "-C opt-level=3 -C target-cpu=native -Z mir-opt-level=3" it seems to optimize well.

m-ou-se · 2020-11-21T21:06:23Z

@leonardo-m With that option it makes use of an instruction that does not need a special case for zero. It still doesn't make use of the fact that NonZero* cannot be zero.

leonardo-m · 2020-11-21T21:15:24Z

Oh, fun, thank you. It's still the same LLVM limit, I guess:
#54868

scottmcm · 2020-11-22T03:40:51Z

With "-C opt-level=3 -C target-cpu=native -Z mir-opt-level=3" it seems to optimize well.

native there isn't doing anything different from haswell in my example -- the choice was those two was just to pick up instruction sets with and without BMI1.

Very interesting that mir-opts affect this, though...

rust-highfive assigned m-ou-se Nov 16, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Nov 16, 2020

m-ou-se reviewed Nov 17, 2020

View reviewed changes

library/core/src/num/nonzero.rs Outdated Show resolved Hide resolved

library/core/src/num/nonzero.rs Outdated Show resolved Hide resolved

library/core/tests/nonzero.rs Outdated Show resolved Hide resolved

m-ou-se added the T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. label Nov 17, 2020

andjo403 mentioned this pull request Nov 17, 2020

Tracking Issue for feature(nonzero_leading_trailing_zeros) #79143

Closed

andjo403 force-pushed the nonzero_leading_trailing_zeros branch from 7cda148 to 02a6aad Compare November 17, 2020 18:24

andjo403 force-pushed the nonzero_leading_trailing_zeros branch from 02a6aad to e8864a0 Compare November 17, 2020 18:29

m-ou-se reviewed Nov 17, 2020

View reviewed changes

library/core/src/num/nonzero.rs Outdated Show resolved Hide resolved

add trailing_zeros and leading_zeros to non zero types

9bbc4c1

andjo403 force-pushed the nonzero_leading_trailing_zeros branch from e8864a0 to 9bbc4c1 Compare November 17, 2020 18:55

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Nov 17, 2020

m-ou-se mentioned this pull request Nov 18, 2020

Rollup of 11 pull requests #79165

Closed

m-ou-se mentioned this pull request Nov 18, 2020

Rollup of 11 pull requests #79167

Merged

bors merged commit 126d88b into rust-lang:master Nov 18, 2020

rustbot added this to the 1.50.0 milestone Nov 18, 2020

andjo403 deleted the nonzero_leading_trailing_zeros branch November 18, 2020 20:39

Soveu mentioned this pull request Feb 17, 2021

Variables with rustc_layout_scalar_valid_range are not assumed to be in their range #82224

Closed

andjo403 mentioned this pull request Apr 11, 2021

Tracking Issue for const_nonzero_leading_trailing_zeros #84089

Closed

1 task

add trailing_zeros and leading_zeros to non zero types #79114

add trailing_zeros and leading_zeros to non zero types #79114

Uh oh!

Conversation

andjo403 commented Nov 16, 2020

Uh oh!

rust-highfive commented Nov 16, 2020

Uh oh!

scottmcm commented Nov 16, 2020

Uh oh!

m-ou-se left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andjo403 commented Nov 17, 2020

Uh oh!

m-ou-se left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

andjo403 commented Nov 17, 2020

Uh oh!

m-ou-se commented Nov 17, 2020

Uh oh!

bors commented Nov 17, 2020

Uh oh!

leonardo-m commented Nov 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

m-ou-se commented Nov 19, 2020

Uh oh!

scottmcm commented Nov 19, 2020

Uh oh!

leonardo-m commented Nov 21, 2020

Uh oh!

leonardo-m commented Nov 21, 2020

Uh oh!

m-ou-se commented Nov 21, 2020

Uh oh!

leonardo-m commented Nov 21, 2020

Uh oh!

scottmcm commented Nov 22, 2020

Uh oh!

Uh oh!

leonardo-m commented Nov 19, 2020 •

edited

Loading