llvm-project.git/llvm/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp, branch main

Re-land [Transform][LoadStoreVectorizer] allow redundant in Chain (#168135)

2025-11-20T01:39:10+00:00

This is the fixed version of
https://github.com/llvm/llvm-project/pull/163019

Revert "[Transform][LoadStoreVectorizer] allow redundant in Chain (#1… (#168105)

2025-11-14T19:49:09+00:00

…63019)"

This reverts commit 92e5608ffa6ff39ac3707f29418cc9482471f5d9.

[Transform][LoadStoreVectorizer] allow redundant in Chain (#163019)

2025-11-13T20:19:29+00:00

This can absorb redundant loads when forming vector load. Can be used to
fix the situation created by VectorCombine. See:
https://discourse.llvm.org/t/what-is-the-purpose-of-vectorizeloadinsert-in-the-vectorcombine-pass/88532

[LoadStoreVectorizer] Batch alias analysis results to improve compile time (#147555)

2025-07-10T16:23:33+00:00

This should be generally good for a lot of LSV cases, but the attached
test demonstrates a specific compile time issue that appears in the
event where the `CaptureTracking` default max uses is raised.

Without using batching alias analysis, this test takes 6 seconds to
compile in a release build. With, less than a second. This is because
the mechanism that proves `NoAlias` in this case is very expensive
(`CaptureTracking.cpp`), and caching the result leads to 2 calls to that
mechanism instead of ~300,000 (run with -stats to see the difference)

This test only demonstrates the compile time issue if
`capture-tracking-max-uses-to-explore` is set to at least 1024, because
with the default value of 100, the `CaptureTracking` analysis is not
run, `NoAlias` is not proven, and the vectorizer gives up early.

[ValueTracking] Make Depth last default arg (NFC) (#142384)

2025-06-03T16:12:24+00:00

Having a finite Depth (or recursion limit) for computeKnownBits is very
limiting, but is currently a load-bearing necessity, as all KnownBits
are recomputed on each call and there is no caching. As a prerequisite
for an effort to remove the recursion limit altogether, either using a
clever caching technique, or writing a easily-invalidable KnownBits
analysis, make the Depth argument in APIs in ValueTracking uniformly the
last argument with a default value. This would aid in removing the
argument when the time comes, as many callers that currently pass 0
explicitly are now updated to omit the argument altogether.

[NFC] Cleanup dead code in `LoadStoreVectorizer.cpp` (#139211)

2025-05-09T07:28:10+00:00

Closes #138691

[Vectorize] Avoid repeated hash lookups (NFC) (#126345)

2025-02-08T08:48:51+00:00

[NFC][DebugInfo] Use iterator moveBefore at many call-sites (#123583)

2025-01-24T10:53:11+00:00

As part of the "RemoveDIs" project, BasicBlock::iterator now carries a
debug-info bit that's needed when getFirstNonPHI and similar feed into
instruction insertion positions. Call-sites where that's necessary were
updated a year ago; but to ensure some type safety however, we'd like to
have all calls to moveBefore use iterators.

This patch adds a (guaranteed dereferenceable) iterator-taking
moveBefore, and changes a bunch of call-sites where it's obviously safe
to change to use it by just calling getIterator() on an instruction
pointer. A follow-up patch will contain less-obviously-safe changes.

We'll eventually deprecate and remove the instruction-pointer
insertBefore, but not before adding concise documentation of what
considerations are needed (very few).

[LoadStoreVectorizer] Postprocess and merge equivalence classes (#121861)

2025-01-08T01:17:26+00:00

This patch introduces a new method:

void Vectorizer::mergeEquivalenceClasses(EquivalenceClassMap &EQClasses)
const;

The method is called at the end of
Vectorizer::collectEquivalenceClasses() and is needed to merge
equivalence classes that differ only by their underlying objects (UO1
and UO2), where UO1 is 1-level-indirection underlying base for UO2. This
situation arises due to the limited lookup depth used during the search
of underlying bases with llvm::getUnderlyingObject(ptr).

Using any fixed lookup depth can result into creation of multiple
equivalence classes that only differ by 1-level indirection bases.

The new approach merges equivalence classes if they have adjacent bases
(1-level indirection). If a series of equivalence classes form ladder
formed of 1-step/level indirections, they are all merged into a single
equivalence class. This provides more opportunities for the load-store
vectorizer to generate better vectors.

---------

Signed-off-by: Klochkov, Vyacheslav N

Revert "[LoadStoreVectorizer] Postprocess and merge equivalence classes" (#119657)

2024-12-12T04:36:23+00:00

Reverts llvm/llvm-project#114501, due to the following failure:
https://lab.llvm.org/buildbot/#/builders/55/builds/4171