llvm-project.git/llvm/lib/CodeGen/RegisterClassInfo.cpp, branch main

[CodeGen] Turn MCRegUnit into an enum class (NFC) (#167943)

2025-11-16T17:46:44+00:00

This changes `MCRegUnit` type from `unsigned` to `enum class : unsigned`
and inserts necessary casts.
The added `MCRegUnitToIndex` functor is used with `SparseSet`,
`SparseMultiSet` and `IndexedMap` in a few places.

`MCRegUnit` is opaque to users, so it didn't seem worth making it a
full-fledged class like `Register`.

Static type checking has detected one issue in
`PrologueEpilogueInserter.cpp`, where `BitVector` created for
`MCRegister` is indexed by both `MCRegister` and `MCRegUnit`.

The number of casts could be reduced by using `IndexedMap` in more
places and/or adding a `BitVector` adaptor, but the number of casts *per
file* is still small and `IndexedMap` has limitations, so it didn't seem
worth the effort.

Pull Request: https://github.com/llvm/llvm-project/pull/167943

[X86][BreakFalseDeps] Using reverse order for undef register selection (#137569)

2025-06-11T14:08:20+00:00

BreakFalseDeps picks the best register for undef operands if
instructions have false dependency. The problem is if the instruction is
close to the beginning of the function, ReachingDefAnalysis is over
optimism to the unused registers, which results in collision with
registers just defined in the caller.

This patch changes the selection of undef register in an reverse order,
which reduces the probability of register collisions between caller and
callee. It brings improvement in some of our internal benchmarks with
negligible effect on other benchmarks.

[CodeGen] Remove some implict conversions of MCRegister to unsigned by using(). NFC

2025-01-19T21:18:04+00:00

Many of these are indexing BitVectors or something where we can't
using MCRegister and need the register number.

[CodeGen] Use regunits instead of MCRegUnitIterator in RegisterClassInfo. NFC.

2024-01-31T16:27:54+00:00

[CodeGen] Simplify RegisterClassInfo BitVector comparisons. NFC.

2024-01-31T16:25:19+00:00

Revert "[CodeGen] Don't include aliases in RegisterClassInfo::IgnoreCSRForAllocOrder (#80015)"

2024-01-31T10:25:51+00:00

This reverts commit f8525030004f907cd108e7c18df255a6d3b23124.

It was supposed to speed things up but llvm-compile-time-tracker.com
showed a slight slow down.

[CodeGen] Don't include aliases in RegisterClassInfo::IgnoreCSRForAllocOrder (#80015)

2024-01-31T08:16:06+00:00

Previously we called ignoreCSRForAllocationOrder on every alias of every
CSR which was expensive on targets like AMDGPU which define a very large
number of overlapping register tuples.

On such targets it is simpler and faster to call
ignoreCSRForAllocationOrder once for every physical register.

Differential Revision: https://reviews.llvm.org/D146735

[CodeGen] Use RegUnits in RegisterClassInfo::getLastCalleeSavedAlias (#79996)

2024-01-30T14:06:45+00:00

Change the implementation of getLastCalleeSavedAlias to use RegUnits
instead of register aliases. This is much faster on targets like AMDGPU
which define a very large number of overlapping register tuples.

No functional change intended. If PhysReg overlaps multiple CSRs then
getLastCalleeSavedAlias(PhysReg) could conceivably return a different
arbitrary one, but currently it is only used for some debug printing
anyway.

Differential Revision: https://reviews.llvm.org/D146734

[CodeGen] Use range-based for loops (NFC)

2023-12-25T06:45:50+00:00

Fix CSR update check

2022-08-25T01:09:49+00:00

D132080 introduced a bug leading to `RegisterClassInfo` caches not
getting invalidated when there was exactly one more CSR register added.

Differential Revision: https://reviews.llvm.org/D132606