<feed xmlns='http://www.w3.org/2005/Atom'>
<title>llvm-project.git/llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp, branch main</title>
<subtitle>Unnamed repository; edit this file 'description' to name the repository.
</subtitle>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/'/>
<entry>
<title>[AMDGPU] update LDS block size for gfx1250 (#167614)</title>
<updated>2025-11-18T00:03:47+00:00</updated>
<author>
<name>Changpeng Fang</name>
<email>changpeng.fang@amd.com</email>
</author>
<published>2025-11-18T00:03:47+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=5f38ae4a77d42a0165070a8ad9b6ea2dbbfdfb51'/>
<id>5f38ae4a77d42a0165070a8ad9b6ea2dbbfdfb51</id>
<content type='text'>
LDS block size should be 2048 bytes (512 dwords) based on current spec.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
LDS block size should be 2048 bytes (512 dwords) based on current spec.</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] Fix layering violations in AMDGPUMCExpr.cpp. NFC (#168242)</title>
<updated>2025-11-17T17:13:55+00:00</updated>
<author>
<name>Craig Topper</name>
<email>craig.topper@sifive.com</email>
</author>
<published>2025-11-17T17:13:55+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=72b02c7b376f211a6fffd5524e5db4c006ec6704'/>
<id>72b02c7b376f211a6fffd5524e5db4c006ec6704</id>
<content type='text'>
AMDGPUMCExpr lives in the MC layer it should not depend on Function.h or
GCNSubtarget.h

Move the function that needed GCNSubtarget to the one file that called
it.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
AMDGPUMCExpr lives in the MC layer it should not depend on Function.h or
GCNSubtarget.h

Move the function that needed GCNSubtarget to the one file that called
it.</pre>
</div>
</content>
</entry>
<entry>
<title>Revert "[AMDGPU][gfx1250] Add `cu-store` subtarget feature (#150588)" (#157639)</title>
<updated>2025-09-10T08:20:59+00:00</updated>
<author>
<name>Pierre van Houtryve</name>
<email>pierre.vanhoutryve@amd.com</email>
</author>
<published>2025-09-10T08:20:59+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=dcaa29c8ed6cc3b62368cdd609f2d05a25541366'/>
<id>dcaa29c8ed6cc3b62368cdd609f2d05a25541366</id>
<content type='text'>
This reverts commit be17791f2624f22b3ed24a2539406164a379125d.

This is not necessary for gfx1250 anymore.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
This reverts commit be17791f2624f22b3ed24a2539406164a379125d.

This is not necessary for gfx1250 anymore.</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] Fix hw stage metadata setting for unsigned values (#154502)</title>
<updated>2025-09-02T08:42:11+00:00</updated>
<author>
<name>Ana Mihajlovic</name>
<email>Ana.Mihajlovic@amd.com</email>
</author>
<published>2025-09-02T08:42:11+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=c4885849adf0addf8c154bfcaf143d959ffda961'/>
<id>c4885849adf0addf8c154bfcaf143d959ffda961</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] Set GRANULATED_WAVEFRONT_SGPR_COUNT of compute_pgm_rsrc1 to 0 for gfx10+ (#154666)</title>
<updated>2025-08-27T01:48:42+00:00</updated>
<author>
<name>Shoreshen</name>
<email>372660931@qq.com</email>
</author>
<published>2025-08-27T01:48:42+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=7fff93db50bad40548fa08895844629f1d40e715'/>
<id>7fff93db50bad40548fa08895844629f1d40e715</id>
<content type='text'>
According to `llvm-project/llvm/docs/AMDGPUUsage.rst::L5212` the
`GRANULATED_WAVEFRONT_SGPR_COUNT`, which is `compute_pgm_rsrc1[6:9]` has
to be 0 for gfx10+ arch

---------

Co-authored-by: Matt Arsenault &lt;Matthew.Arsenault@amd.com&gt;</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
According to `llvm-project/llvm/docs/AMDGPUUsage.rst::L5212` the
`GRANULATED_WAVEFRONT_SGPR_COUNT`, which is `compute_pgm_rsrc1[6:9]` has
to be 0 for gfx10+ arch

---------

Co-authored-by: Matt Arsenault &lt;Matthew.Arsenault@amd.com&gt;</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] Do not assert on non-zero COMPUTE_PGM_RSRC3 on gfx1250. NFCI (#155498)</title>
<updated>2025-08-26T21:42:48+00:00</updated>
<author>
<name>Stanislav Mekhanoshin</name>
<email>Stanislav.Mekhanoshin@amd.com</email>
</author>
<published>2025-08-26T21:42:48+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=6bccd967b117176192b821c178a55ea31192b45a'/>
<id>6bccd967b117176192b821c178a55ea31192b45a</id>
<content type='text'>
COMPUTE_PGM_RSRC3 does exist on gfx1250, we are just not using it yet.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
COMPUTE_PGM_RSRC3 does exist on gfx1250, we are just not using it yet.</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] report named barrier cnt part2 (#154588)</title>
<updated>2025-08-20T19:00:45+00:00</updated>
<author>
<name>Gang Chen</name>
<email>gangc@amd.com</email>
</author>
<published>2025-08-20T19:00:45+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=60dbde69cd219e55bfab5581fca56e9106fe108f'/>
<id>60dbde69cd219e55bfab5581fca56e9106fe108f</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] Remove an unnecessary cast (NFC) (#154470)</title>
<updated>2025-08-20T05:45:12+00:00</updated>
<author>
<name>Kazu Hirata</name>
<email>kazu@google.com</email>
</author>
<published>2025-08-20T05:45:12+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=fab0860685457ab8cada072f571dfed2f9bd05b1'/>
<id>fab0860685457ab8cada072f571dfed2f9bd05b1</id>
<content type='text'>
getAddressableLocalMemorySize() already returns unsigned.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
getAddressableLocalMemorySize() already returns unsigned.</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] upstream barrier count reporting part1 (#154409)</title>
<updated>2025-08-19T23:42:31+00:00</updated>
<author>
<name>Gang Chen</name>
<email>gangc@amd.com</email>
</author>
<published>2025-08-19T23:42:31+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=ef68d1587df199b7503d243bdac703748044c99a'/>
<id>ef68d1587df199b7503d243bdac703748044c99a</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>AMDGPU gfx12: Add _dvgpr$ symbols for dynamic VGPRs (#148251)</title>
<updated>2025-08-15T15:33:06+00:00</updated>
<author>
<name>Tim Renouf</name>
<email>tim.renouf@amd.com</email>
</author>
<published>2025-08-15T15:33:06+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=f279c47cb3e7191a22703b837e006eb7dd591de7'/>
<id>f279c47cb3e7191a22703b837e006eb7dd591de7</id>
<content type='text'>
For each function with the AMDGPU_CS_Chain calling convention, with
dynamic VGPRs enabled, add a _dvgpr$ symbol, with the value of the
function symbol, plus an offset encoding one less than the number of
VGPR blocks used by the function (16 VGPRs per block, no more than 128)
in bits 5..3 of the symbol value. This is used by a front-end to have
functions that are chained rather than called, and a dispatcher that
dynamically resizes the VGPR count before dispatching to a function.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
For each function with the AMDGPU_CS_Chain calling convention, with
dynamic VGPRs enabled, add a _dvgpr$ symbol, with the value of the
function symbol, plus an offset encoding one less than the number of
VGPR blocks used by the function (16 VGPRs per block, no more than 128)
in bits 5..3 of the symbol value. This is used by a front-end to have
functions that are chained rather than called, and a dispatcher that
dynamically resizes the VGPR count before dispatching to a function.</pre>
</div>
</content>
</entry>
</feed>
