llvm-project.git/libcxx/include/__bit_reference, branch main

[libc++] Replace __libcpp_{ctz, clz} with __builtin_{ctzg, clzg} (#133920)

2025-04-19T00:57:05+00:00

`__libcpp_{ctz, clz}` were previously used as fallbacks for `__builtin_{ctzg, clzg}` to ensure compatibility with older compilers (Clang 18 and earlier), as `__builtin_{ctzg, clzg}` became available in Clang 19. Now that support for Clang 18 has been officially dropped in #130142, we can now safely  replace all instances of `__libcpp_{ctz, clz}` with `__count{l,r}_zero` (which internally call `__builtin_{ctzg, clzg}` and eliminate the fallback logic.

Closes #131179.

[libc++] Remove unnecessary division and modulo operations in bitset (#121312)

2025-03-26T16:02:03+00:00

The PR removes the unnecessary division and modulo operations in the
one-word specialization `__bitset<1, _Size>`. The reason is that for the
one-word specialization, we have `__pos < __bits_per_word` (as
`__bitset<1, _Size>` is an implementation detail only used by the public
`bitset`). So `__pos / __bits_per_word == 0` and `__pos / __pos %
__bits_per_word == __pos`.

[libc++] Fix {std, ranges}::equal for vector with small storage types (#130394)

2025-03-19T15:51:21+00:00

The current implementation of `{std, ranges}::equal` fails to correctly
compare `vector`s when the underlying storage type is smaller than
`int` (e.g., `unsigned char`, `unsigned short`, `uint8_t` and
`uint16_t`). See [demo](https://godbolt.org/z/j4s87s6b3)). The problem
arises due to integral promotions on the intermediate bitwise
operations, leading to incorrect final equality comparison results. This
patch fixes the issue by ensuring that `{std, ranges}::equal` operate
properly for both aligned and unaligned bits.
 
Fixes #126369.

[libc++] Fix ambiguous call in {ranges, std}::find (#122641)

2025-03-13T18:15:03+00:00

This PR fixes an ambiguous call encountered when using the `std::ranges::find` or `std::find`
algorithms with `vector` with small `allocator_traits::size_type`s, an issue reported
in #122528. The ambiguity arises from integral promotions during the internal bitwise
arithmetic of the `find` algorithms when applied to `vector` with small integral
`size_type`s. This leads to multiple viable candidates for small integral types:
__libcpp_ctz(unsigned), __libcpp_ctz(unsigned long), and __libcpp_ctz(unsigned long long),
none of which represent a single best viable match, resulting in an ambiguous call error.

To resolve this, we propose invoking an internal function __countr_zero as a dispatcher
that directs the call to the appropriate overload of __libcpp_ctz. Necessary amendments
have also been made to __countr_zero.

[libc++] Optimize ranges::rotate for vector::iterator (#121168)

2025-03-13T18:07:23+00:00

This PR optimizes the performance of `std::ranges::rotate` for
`vector::iterator`. The optimization yields a performance
improvement of up to 2096x.

Closes #64038.

[libc++] Optimize ranges::swap_ranges for vector::iterator (#121150)

2025-03-04T22:15:36+00:00

This PR optimizes the performance of `std::ranges::swap_ranges` for
`vector::iterator`, addressing a subtask outlined in issue #64038.
The optimizations yield performance improvements of up to **611x** for
aligned range swap and **78x** for unaligned range swap comparison.
Additionally, comprehensive tests covering up to 4 storage words (256
bytes) with odd and even bit sizes are provided, which validate the
proposed optimizations in this patch.

[libc++] Optimize ranges::equal for vector::iterator (#121084)

2025-02-26T17:18:25+00:00

This PR optimizes the performance of `std::ranges::equal` for
`vector::iterator`, addressing a subtask outlined in issue #64038.
The optimizations yield performance improvements of up to 188x for
aligned equality comparison and 82x for unaligned equality
comparison. Moreover, comprehensive tests covering up to 4 storage words
(256 bytes) with odd and even bit sizes are provided, which validate the
proposed optimizations in this patch.

[libc++] Optimize ranges::move{,_backward} for vector::iterator (#121109)

2025-02-19T16:36:45+00:00

As a follow-up to #121013 (which optimized `ranges::copy`) and #121026
(which optimized `ranges::copy_backward`), this PR enhances the
performance of `std::ranges::{move, move_backward}` for
`vector::iterator`, addressing a subtask outlined in issue #64038.

The optimizations bring performance improvements analogous to those
achieved for the `{copy, copy_backward}` algorithms: up to 2000x for
aligned moves and 60x for unaligned moves. Moreover, comprehensive
tests covering up to 4 storage words (256 bytes) with odd and even bit
sizes are provided, which validate the proposed optimizations in this
patch.

[libc++] Fix UB in bitwise logic of {std, ranges}::{fill, fill_n} algorithms (#122410)

2025-02-05T16:39:49+00:00

This PR addresses an undefined behavior that arises when using the
`std::fill` and `std::fill_n` algorithms, as well as their ranges
counterparts `ranges::fill` and `ranges::fill_n`, with `vector`
that utilizes a custom-sized allocator with small integral types.

[libc++] Optimize ranges::copy_backward for vector::iterator (#121026)

2025-01-30T19:55:05+00:00

As a follow-up to #121013 (which focused on `std::ranges::copy`), this
PR optimizes the performance of `std::ranges::copy_backward` for
`vector::iterator`, addressing a subtask outlined in issue #64038.
The optimizations yield performance improvements of up to 2000x for
aligned copies and 60x for unaligned copies.