<feed xmlns='http://www.w3.org/2005/Atom'>
<title>llvm-project.git/mlir/lib/Dialect/AMDGPU/IR/AMDGPUDialect.cpp, branch main</title>
<subtitle>Unnamed repository; edit this file 'description' to name the repository.
</subtitle>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/'/>
<entry>
<title>[mlir][amdgpu] Add lowerings for ScaledExtPacked816 (#168123)</title>
<updated>2025-11-17T21:51:52+00:00</updated>
<author>
<name>Erick Ochoa Lopez</name>
<email>erick.ochoalopez@amd.com</email>
</author>
<published>2025-11-17T21:51:52+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=909c9aacead077b14e2bff123d09641d08939fe5'/>
<id>909c9aacead077b14e2bff123d09641d08939fe5</id>
<content type='text'>
* Adds lowerings for amdgpy.scaled_ext_packed816
* updates verifiers</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
* Adds lowerings for amdgpy.scaled_ext_packed816
* updates verifiers</pre>
</div>
</content>
</entry>
<entry>
<title>[mlir][amdgpu] Fix documentation and verifiers (#167369)</title>
<updated>2025-11-17T13:34:21+00:00</updated>
<author>
<name>Erick Ochoa Lopez</name>
<email>erick.ochoalopez@amd.com</email>
</author>
<published>2025-11-17T13:34:21+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=e468ea3f40415c48281755c68548ba49480a2259'/>
<id>e468ea3f40415c48281755c68548ba49480a2259</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[mlir][amdgpu][rocdl] Add gfx1250 wmma ops (#165064)</title>
<updated>2025-10-28T16:42:39+00:00</updated>
<author>
<name>Jakub Kuderski</name>
<email>jakub@nod-labs.com</email>
</author>
<published>2025-10-28T16:42:39+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=466c5267141a221b5919b701e26d8b50d776bc55'/>
<id>466c5267141a221b5919b701e26d8b50d776bc55</id>
<content type='text'>
Update `amdgpu.wmma` op definition and implement amdgpu to rocdl
conversion for new variants.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
Update `amdgpu.wmma` op definition and implement amdgpu to rocdl
conversion for new variants.</pre>
</div>
</content>
</entry>
<entry>
<title>[mlir][amdgpu] Update mfma assembly format with intrinsic shape (#165037)</title>
<updated>2025-10-25T09:58:43+00:00</updated>
<author>
<name>Jakub Kuderski</name>
<email>jakub@nod-labs.com</email>
</author>
<published>2025-10-25T09:58:43+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=f248010a5233e726f6ab1767c09cd582057a6413'/>
<id>f248010a5233e726f6ab1767c09cd582057a6413</id>
<content type='text'>
Use the same format as introduced for wmma by
https://github.com/llvm/llvm-project/pull/164920.

Also make `blocks` default to 1.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
Use the same format as introduced for wmma by
https://github.com/llvm/llvm-project/pull/164920.

Also make `blocks` default to 1.</pre>
</div>
</content>
</entry>
<entry>
<title>[mlir][amdgpu] Add explicit intrinsic shape to wmma (#164920)</title>
<updated>2025-10-24T16:21:33+00:00</updated>
<author>
<name>Jakub Kuderski</name>
<email>jakub@nod-labs.com</email>
</author>
<published>2025-10-24T16:21:33+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=dc5f2745604d4c5a003e909574b531662b372355'/>
<id>dc5f2745604d4c5a003e909574b531662b372355</id>
<content type='text'>
This is in preparation for adding support for gfx1250 wmma intrinsics
that include much more possible shapes.

Instead of guessing the wave32/wave64 mode based on element types and
vector sizes, require the intrinsic shapes to be set explicitly as
attributes.</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
This is in preparation for adding support for gfx1250 wmma intrinsics
that include much more possible shapes.

Instead of guessing the wave32/wave64 mode based on element types and
vector sizes, require the intrinsic shapes to be set explicitly as
attributes.</pre>
</div>
</content>
</entry>
<entry>
<title>[mlir][amdgpu] Add scaled_ext_packed{8,16} operations (#159830)</title>
<updated>2025-10-17T16:58:03+00:00</updated>
<author>
<name>Erick Ochoa Lopez</name>
<email>erick.ochoalopez@amd.com</email>
</author>
<published>2025-10-17T16:58:03+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=a76c71b205db9e28ffe33432caef5efbea7f088f'/>
<id>a76c71b205db9e28ffe33432caef5efbea7f088f</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[mlir][AMGPU] Replace use of SmallVector with ArrayRef, NFC (#163770)</title>
<updated>2025-10-16T14:41:22+00:00</updated>
<author>
<name>Muzammil</name>
<email>55665739+Muzammiluddin-Syed-ECE@users.noreply.github.com</email>
</author>
<published>2025-10-16T14:41:22+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=5a6756d2a0450cca1efd6688ea4d7e1bad1a32c9'/>
<id>5a6756d2a0450cca1efd6688ea4d7e1bad1a32c9</id>
<content type='text'>
Improving choice of class used, from SmallVector to ArrayRef
(https://llvm.org/docs/ProgrammersManual.html#llvm-adt-arrayref-h). Also infer template types when possible.
Leftover from https://github.com/llvm/llvm-project/pull/155951.

---------

Signed-off-by: Muzammiluddin Syed &lt;muzasyed@amd.com&gt;</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
Improving choice of class used, from SmallVector to ArrayRef
(https://llvm.org/docs/ProgrammersManual.html#llvm-adt-arrayref-h). Also infer template types when possible.
Leftover from https://github.com/llvm/llvm-project/pull/155951.

---------

Signed-off-by: Muzammiluddin Syed &lt;muzasyed@amd.com&gt;</pre>
</div>
</content>
</entry>
<entry>
<title>[mlir][amdgpu] Add Inliner interface (#162873)</title>
<updated>2025-10-10T18:34:00+00:00</updated>
<author>
<name>Ivan Butygin</name>
<email>ivan.butygin@gmail.com</email>
</author>
<published>2025-10-10T18:34:00+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=6ad662d32294006dc1d71f70cee8eabdf9b0c415'/>
<id>6ad662d32294006dc1d71f70cee8eabdf9b0c415</id>
<content type='text'>
All the `amdgpu` dialect ops can be inlined.

---------

Signed-off-by: Ivan Butygin &lt;ivan.butygin@gmail.com&gt;</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
All the `amdgpu` dialect ops can be inlined.

---------

Signed-off-by: Ivan Butygin &lt;ivan.butygin@gmail.com&gt;</pre>
</div>
</content>
</entry>
<entry>
<title>[mlir][AMDGPU] Add canonicalization pattern to pack scales for ScaledMFMAOp (#155951)</title>
<updated>2025-09-18T19:25:14+00:00</updated>
<author>
<name>Muzammil</name>
<email>55665739+Muzammiluddin-Syed-ECE@users.noreply.github.com</email>
</author>
<published>2025-09-18T19:25:14+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=9628061e055c9f695ff80f9a74e4f6e524b34993'/>
<id>9628061e055c9f695ff80f9a74e4f6e524b34993</id>
<content type='text'>
The ScaledMFMAOp accepts scales as a vector of 4 bytes
(`vector&lt;4xf8E8M0FNU&gt;`) that can be stored in a single register with a
particular scale accessed using the `OpSel` attribute. Currently, we
only use one byte in this 4-byte vector, resulting in 3 wasted
registers.

This is fixed by identifying when single byte extractions are performed
and rewriting them into extractions of 4-byte vectors.

Example:
```
  %unit = vector.extract %ScaleSrc[offsets] : f8E8M0FNU from vector&lt;?x?x?xf8E8M0FNU&gt;
  %scale = vector.insert %unit, ... : f8E8M0FNU into vector&lt;4xf8E8M0FNU&gt;
  amdgpu.scaled_mfma(%scale[0] * ...
```
to
```
  %reshaped = vector.shape_cast %ScaleSrc : vector&lt;?x?x?xf8E8M0FNU&gt; to vector&lt;?x4xf8E8M0FNU&gt; 
  %scale = vector.extract %reshaped[?] : vector&lt;4xf8E8M0FNU&gt; from vector&lt;?x4xf8E8M0FNU&gt;
  amdgpu.scaled_mfma(%scale[0-3] * ...
```

---------

Signed-off-by: Muzammiluddin Syed &lt;muzasyed@amd.com&gt;</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
The ScaledMFMAOp accepts scales as a vector of 4 bytes
(`vector&lt;4xf8E8M0FNU&gt;`) that can be stored in a single register with a
particular scale accessed using the `OpSel` attribute. Currently, we
only use one byte in this 4-byte vector, resulting in 3 wasted
registers.

This is fixed by identifying when single byte extractions are performed
and rewriting them into extractions of 4-byte vectors.

Example:
```
  %unit = vector.extract %ScaleSrc[offsets] : f8E8M0FNU from vector&lt;?x?x?xf8E8M0FNU&gt;
  %scale = vector.insert %unit, ... : f8E8M0FNU into vector&lt;4xf8E8M0FNU&gt;
  amdgpu.scaled_mfma(%scale[0] * ...
```
to
```
  %reshaped = vector.shape_cast %ScaleSrc : vector&lt;?x?x?xf8E8M0FNU&gt; to vector&lt;?x4xf8E8M0FNU&gt; 
  %scale = vector.extract %reshaped[?] : vector&lt;4xf8E8M0FNU&gt; from vector&lt;?x4xf8E8M0FNU&gt;
  amdgpu.scaled_mfma(%scale[0-3] * ...
```

---------

Signed-off-by: Muzammiluddin Syed &lt;muzasyed@amd.com&gt;</pre>
</div>
</content>
</entry>
<entry>
<title>[MLIR] Apply clang-tidy fixes for readability-identifier-naming in AMDGPUDialect.cpp (NFC)</title>
<updated>2025-09-18T17:28:46+00:00</updated>
<author>
<name>Mehdi Amini</name>
<email>joker.eph@gmail.com</email>
</author>
<published>2025-08-21T13:19:34+00:00</published>
<link rel='alternate' type='text/html' href='https://git.belthelziquor.com/llvm-project.git/commit/?id=6ec08132ee413512c937fb09adaa3344355c1ecb'/>
<id>6ec08132ee413512c937fb09adaa3344355c1ecb</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
</feed>
