llvm-project.git/llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp, branch main

[AMDGPU] update LDS block size for gfx1250 (#167614)

2025-11-18T00:03:47+00:00

LDS block size should be 2048 bytes (512 dwords) based on current spec.

[AMDGPU] Fix layering violations in AMDGPUMCExpr.cpp. NFC (#168242)

2025-11-17T17:13:55+00:00

AMDGPUMCExpr lives in the MC layer it should not depend on Function.h or
GCNSubtarget.h

Move the function that needed GCNSubtarget to the one file that called
it.

Revert "[AMDGPU][gfx1250] Add `cu-store` subtarget feature (#150588)" (#157639)

2025-09-10T08:20:59+00:00

This reverts commit be17791f2624f22b3ed24a2539406164a379125d.

This is not necessary for gfx1250 anymore.

[AMDGPU] Fix hw stage metadata setting for unsigned values (#154502)

2025-09-02T08:42:11+00:00

[AMDGPU] Set GRANULATED_WAVEFRONT_SGPR_COUNT of compute_pgm_rsrc1 to 0 for gfx10+ (#154666)

2025-08-27T01:48:42+00:00

According to `llvm-project/llvm/docs/AMDGPUUsage.rst::L5212` the
`GRANULATED_WAVEFRONT_SGPR_COUNT`, which is `compute_pgm_rsrc1[6:9]` has
to be 0 for gfx10+ arch

---------

Co-authored-by: Matt Arsenault

[AMDGPU] Do not assert on non-zero COMPUTE_PGM_RSRC3 on gfx1250. NFCI (#155498)

2025-08-26T21:42:48+00:00

COMPUTE_PGM_RSRC3 does exist on gfx1250, we are just not using it yet.

[AMDGPU] report named barrier cnt part2 (#154588)

2025-08-20T19:00:45+00:00

[AMDGPU] Remove an unnecessary cast (NFC) (#154470)

2025-08-20T05:45:12+00:00

getAddressableLocalMemorySize() already returns unsigned.

[AMDGPU] upstream barrier count reporting part1 (#154409)

2025-08-19T23:42:31+00:00

AMDGPU gfx12: Add _dvgpr$ symbols for dynamic VGPRs (#148251)

2025-08-15T15:33:06+00:00

For each function with the AMDGPU_CS_Chain calling convention, with
dynamic VGPRs enabled, add a _dvgpr$ symbol, with the value of the
function symbol, plus an offset encoding one less than the number of
VGPR blocks used by the function (16 VGPRs per block, no more than 128)
in bits 5..3 of the symbol value. This is used by a front-end to have
functions that are chained rather than called, and a dispatcher that
dynamically resizes the VGPR count before dispatching to a function.