<feed xmlns='http://www.w3.org/2005/Atom'>
<title>llvm-project.git/llvm/lib/Target/AMDGPU/SIModeRegister.cpp, branch main</title>
<subtitle>Unnamed repository; edit this file 'description' to name the repository.
</subtitle>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/'/>
<entry>
<title>[AMDGPU][NPM] Port SIModeRegister to NPM (#129014)</title>
<updated>2025-03-04T05:21:03+00:00</updated>
<author>
<name>Akshat Oke</name>
<email>Akshat.Oke@amd.com</email>
</author>
<published>2025-03-04T05:21:03+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=6c87ec4f4d083a85ebcfbbda166ad4ba41d5da8d'/>
<id>6c87ec4f4d083a85ebcfbbda166ad4ba41d5da8d</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU][True16][CodeGen] true16 codegen pat for fptrunc_round (#124044)</title>
<updated>2025-01-30T23:31:52+00:00</updated>
<author>
<name>Brox Chen</name>
<email>guochen2@amd.com</email>
</author>
<published>2025-01-30T23:31:52+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=a51798e3d68b9e04660757ff6e645c8ba46baaa9'/>
<id>a51798e3d68b9e04660757ff6e645c8ba46baaa9</id>
<content type='text'>
true16 codegen pattern for fptrunc_round f32 to f16.

For mir test, split to preGFX11 and postGFX11. and add a true16 and a
fake16 test accordingly</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
true16 codegen pattern for fptrunc_round f32 to f16.

For mir test, split to preGFX11 and postGFX11. and add a true16 and a
fake16 test accordingly</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU][True16][MC] 16bit vsrc and vdst support in MC (#104510)</title>
<updated>2024-09-11T14:48:11+00:00</updated>
<author>
<name>Brox Chen</name>
<email>guochen2@amd.com</email>
</author>
<published>2024-09-11T14:48:11+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=35e27c0ee51f2822415c050c1cc4a73dfaa171d7'/>
<id>35e27c0ee51f2822415c050c1cc4a73dfaa171d7</id>
<content type='text'>
This is a large patch includes the MC level support for V_CVT_F16_F32,
V_CVT_F32_F16 and V_LDEXP_F16 in true16 format.

This patch includes the asm/disasm changes to encode/decode the 16bit
vsrc, vdst and src modifieres for vop and dpp format. This patch is a
dependency for many 16 bit instructions while only three instructions
are updated to make it easier to review.

There will be another patch to support these three instructions in the
codeGen level, this patch just replaces these two instructions with its
fake16 format.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
This is a large patch includes the MC level support for V_CVT_F16_F32,
V_CVT_F32_F16 and V_LDEXP_F16 in true16 format.

This patch includes the asm/disasm changes to encode/decode the 16bit
vsrc, vdst and src modifieres for vop and dpp format. This patch is a
dependency for many 16 bit instructions while only three instructions
are updated to make it easier to review.

There will be another patch to support these three instructions in the
codeGen level, this patch just replaces these two instructions with its
fake16 format.</pre>
</div>
</content>
</entry>
<entry>
<title>AMDGPU: Add f64 to f32 support for llvm.fptrunc.round (#107481)</title>
<updated>2024-09-06T05:57:27+00:00</updated>
<author>
<name>Changpeng Fang</name>
<email>changpeng.fang@amd.com</email>
</author>
<published>2024-09-06T05:57:27+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=24267a7e14b35f41ab55e15ba12bb80c82881941'/>
<id>24267a7e14b35f41ab55e15ba12bb80c82881941</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>AMDGPU: Add tonearest and towardzero roundings for intrinsic llvm.fptrunc.round (#104486)</title>
<updated>2024-08-17T18:22:47+00:00</updated>
<author>
<name>Changpeng Fang</name>
<email>changpeng.fang@amd.com</email>
</author>
<published>2024-08-17T18:22:47+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=16929219b056fdbf9e0c93092904f2c5cac8a66f'/>
<id>16929219b056fdbf9e0c93092904f2c5cac8a66f</id>
<content type='text'>
This work simplifies and generalizes the instruction definition for
intrinsic llvm.fptrunc.round. We no longer name the instruction with the
rounding mode. Instead, we introduce an immediate operand for the
rounding mode for the pseudo instruction. This immediate will be used to
set up the hardware mode register at the time the real instruction is
generated. We name the pseudo instruction as FPTRUNC_ROUND_F16_F32 (for
f32 -&gt; f16), which is easy to generalize for other types.

"round.towardzero" and "round.tonearest" are added for f32 -&gt; f16
truncating, in addition to the existing "round.upward" and
"round.downward". Other rounding modes are not supported by hardware at
this moment.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
This work simplifies and generalizes the instruction definition for
intrinsic llvm.fptrunc.round. We no longer name the instruction with the
rounding mode. Instead, we introduce an immediate operand for the
rounding mode for the pseudo instruction. This immediate will be used to
set up the hardware mode register at the time the real instruction is
generated. We name the pseudo instruction as FPTRUNC_ROUND_F16_F32 (for
f32 -&gt; f16), which is easy to generalize for other types.

"round.towardzero" and "round.tonearest" are added for f32 -&gt; f16
truncating, in addition to the existing "round.upward" and
"round.downward". Other rounding modes are not supported by hardware at
this moment.</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] Fix mode register pass for constrained FP operations (#90085)</title>
<updated>2024-05-03T17:47:15+00:00</updated>
<author>
<name>Abhinav Garg</name>
<email>39309352+abhigargrepo@users.noreply.github.com</email>
</author>
<published>2024-05-03T17:47:15+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=76508dce4380e0cea2ecb396200a161f7dbefd0b'/>
<id>76508dce4380e0cea2ecb396200a161f7dbefd0b</id>
<content type='text'>
This PR will fix the si-mode-register pass which is inserting an extra
setreg instruction in case of constrained FP operations. This pass will
be ignored for strictfp functions.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
This PR will fix the si-mode-register pass which is inserting an extra
setreg instruction in case of constrained FP operations. This pass will
be ignored for strictfp functions.</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU][NFC] Have helpers to deal with encoding fields. (#82772)</title>
<updated>2024-02-23T17:34:55+00:00</updated>
<author>
<name>Ivan Kosarev</name>
<email>ivan.kosarev@amd.com</email>
</author>
<published>2024-02-23T17:34:55+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=dfa1d9b027e677cf1379dffee0059261a34f3481'/>
<id>dfa1d9b027e677cf1379dffee0059261a34f3481</id>
<content type='text'>
These are hoped to provide more convenient and less error prone
facilities to encode and decode fields than manually defined constants
and functions.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
These are hoped to provide more convenient and less error prone
facilities to encode and decode fields than manually defined constants
and functions.</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] Reapply 'Sign extend simm16 in setreg intrinsic' (#78492)</title>
<updated>2024-01-18T01:23:46+00:00</updated>
<author>
<name>Stanislav Mekhanoshin</name>
<email>rampitec@users.noreply.github.com</email>
</author>
<published>2024-01-18T01:23:46+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=558ea411599a42d2a15dd6a878700cf62a8b36e7'/>
<id>558ea411599a42d2a15dd6a878700cf62a8b36e7</id>
<content type='text'>
We currently force users to use a negative contant in the intrinsic
call. Changing it zext would break existing programs, so just sign
extend an argument.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
We currently force users to use a negative contant in the intrinsic
call. Changing it zext would break existing programs, so just sign
extend an argument.</pre>
</div>
</content>
</entry>
<entry>
<title>[AMDGPU] Modernize Status and BlockData (NFC)</title>
<updated>2023-04-16T20:03:02+00:00</updated>
<author>
<name>Kazu Hirata</name>
<email>kazu@google.com</email>
</author>
<published>2023-04-16T20:03:02+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=bfaf0e78582ee4f1a2a1df0b3d52ce19a57b8c76'/>
<id>bfaf0e78582ee4f1a2a1df0b3d52ce19a57b8c76</id>
<content type='text'>
Identified with modernize-use-default-member-init.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
Identified with modernize-use-default-member-init.
</pre>
</div>
</content>
</entry>
<entry>
<title>[Target] Use llvm::count{l,r}_{zero,one} (NFC)</title>
<updated>2023-01-28T17:23:07+00:00</updated>
<author>
<name>Kazu Hirata</name>
<email>kazu@google.com</email>
</author>
<published>2023-01-28T17:23:07+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=e0782018352fc4d7e104e82edf380d895d4abdd5'/>
<id>e0782018352fc4d7e104e82edf380d895d4abdd5</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
</feed>
