llvm-project.git/llvm/lib/Target/AMDGPU/AMDGPUResourceUsageAnalysis.cpp, branch main

[AMDGPU] Remove calling conv check on entry function (#162080)

2025-11-10T21:27:55+00:00

It is undefined behavior to call a function with a mismatched calling
convention. Rather than crash on this behavior, it should compile.

This LLVM defect was identified via the AMD Fuzzing project.

[AMDGPU] upstream barrier count reporting part1 (#154409)

2025-08-19T23:42:31+00:00

[AMDGPU] Ignore inactive VGPRs in .vgpr_count (#149052)

2025-08-13T08:47:00+00:00

When using the `amdgcn.init.whole.wave` intrinsic, we add dummy VGPR
arguments with the purpose of preserving their inactive lanes. The
pattern may look something like this:

```
entry:
  call amdgcn.init.whole.wave
  branch to shader or tail

shader:
  $vInactive = IMPLICIT_DEF ; Tells regalloc it's safe to use the active lanes
  actual code...

tail:
  call amdgcn.cs.chain [...], implicit $vInactive
```

We should not report these VGPRs in the `.vgpr_count` metadata. This
patch achieves that goal by ignoring meta instructions and calls. This should
be safe since if those registers are actually used in any other context,
they will be counted there. The same reasoning applies in the general
case, so we don't explicitly check for the existence of `init.whole.wave`.

This is a reworked version of #133242, which was reverted in #144039
and split into smaller bits.

[AMDGPU] Use SIRegisterInfo to compute used registers. NFCI (#149051)

2025-07-18T07:22:37+00:00

Simplify the code in AMDGPUResourceUsageAnalysis to rely more
on the TargetRegisterInfo for computing the number of used SGPRs and
AGPRs. This is a preliminary refactoring split out from #144855.

(While we could technically use TRI to compute the used number of VGPRs
 at this point too, I'm leaving some of the original code in since for
 VGPRs we're going to introduce some special cases).

[AMDGPU][NewPM] Port "AMDGPUResourceUsageAnalysis" to NPM (#130959)

2025-07-10T08:05:43+00:00

Revert "[AMDGPU] Skip register uses in AMDGPUResourceUsageAnalysis (#… (#144039)

2025-06-13T10:48:24+00:00

…133242)"

This reverts commit 130080fab11cde5efcb338b77f5c3b31097df6e6 because it
causes issues in testcases similar to coalescer_remat.ll [1], i.e. when
we use a VGPR tuple but only write to its lower parts. The high VGPRs
would then not be included in the vgpr_count, and accessing them would
be an out of bounds violation.

[1]
https://github.com/llvm/llvm-project/blob/main/llvm/test/CodeGen/AMDGPU/coalescer_remat.ll

[AMDGPU] Skip register uses in AMDGPUResourceUsageAnalysis (#133242)

2025-06-03T09:20:48+00:00

Don't count register uses when determining the maximum number of
registers used by a function. Count only the defs. This is really an
underestimate of the true register usage, but in practice that's not
a problem because if a function uses a register, then it has either
defined it earlier, or some other function that executed before has
defined it.

In particular, the register counts are used:
1. When launching an entry function - in which case we're safe because
   the register counts of the entry function will include the register
   counts of all callees.
2. At function boundaries in dynamic VGPR mode. In this case it's safe
   because whenever we set the new VGPR allocation we take into account
   the outgoing_vgpr_count set by the middle-end.

The main advantage of doing this is that the artificial VGPR arguments
used only for preserving the inactive lanes when using the
llvm.amdgcn.init.whole.wave intrinsic are no longer counted. This
enables us to allocate only the registers we need in dynamic VGPR mode.

---------

Co-authored-by: Thomas Symalla <5754458+tsymalla@users.noreply.github.com>

[AMDGPU] Remove unused includes (NFC) (#116154)

2024-11-14T05:10:03+00:00

Identified with misc-include-cleaner.

[AMDGPU] Factor out getNumUsedPhysRegs(). NFC. (#112624)

2024-10-17T07:47:19+00:00

I will need it from one more place.

[AMDGPU] Convert AMDGPUResourceUsageAnalysis pass from Module to MF pass (#102913)

2024-09-30T10:43:34+00:00

Converts AMDGPUResourceUsageAnalysis pass from Module to MachineFunction
pass. Moves function resource info propagation to to MC layer (through
helpers in AMDGPUMCResourceInfo) by generating MCExprs for every
function resource which the emitters have been prepped for.

Fixes https://github.com/llvm/llvm-project/issues/64863