summaryrefslogtreecommitdiff
path: root/mlir/lib/Conversion/AMDGPUToROCDL
AgeCommit message (Expand)Author
2025-11-18[mlir][amdgpu] Sink op creation in scaled conversion intrinsics (NFC) (#168542)Erick Ochoa Lopez
2025-11-17[mlir][amdgpu] Add lowerings for ScaledExtPacked816 (#168123)Erick Ochoa Lopez
2025-11-13[mlir][ROCDL] Refactor wmma intrinsics to use attributes not operands where p...Muzammiluddin Syed
2025-10-30[mlir] Simplify Default cases in type switches. NFC. (#165767)Jakub Kuderski
2025-10-29[mlir][amdgpu][rocdl] Allow for graceful wmma conversion failures (#165616)Jakub Kuderski
2025-10-28[mlir][amdgpu][rocdl] Add gfx1250 wmma ops (#165064)Jakub Kuderski
2025-10-24[mlir][amdgpu] Add explicit intrinsic shape to wmma (#164920)Jakub Kuderski
2025-10-22[mlir] Switch uses of deprecated .create methods to free function. NFC. (#164...Jakub Kuderski
2025-09-24[AMDGPU] Add the support for 45-bit buffer resource (#159702)Shilei Tian
2025-09-23[mlir][AMDGPU] Use LDS-only MMRA fences for lds_barrier (#157919)Krzysztof Drewniak
2025-09-12[mlir][AMDGPU] Updated `PermlaneSwapOp` to select correct val (#157586)Gaurav Verma
2025-08-24[mlir][amdgpu] Promote gpu.shuffle to amdgpu.permlane_swap (#154933)Tim Gymnich
2025-08-21[mlir][AMDGPU] Add PermlaneSwapOp (#154345)Tim Gymnich
2025-07-25[mlir][NFC] update `mlir/Dialect` create APIs (27/n) (#150638)Maksim Levental
2025-07-25[mlir][amd] fix LLVM::InsertValueOp::create failure to disambiguate (#150605)Maksim Levental
2025-07-23[mlir][NFC] update `Conversion` create APIs (4/n) (#149879)Maksim Levental
2025-07-22[mlir][amdgpu] Add `rocdl.s.waitcnt` wrapper (#149670)Ivan Butygin
2025-07-09[AMDGPU] [MLIR] Add 96 and 128 bit GatherToLDS for gfx950 (#147496)Daniel Hernandez-Juarez
2025-06-25[AMDGPU] Adding AMDGPU dialect wrapper for ROCDL transpose loads. (#145395)Alan Li
2025-06-19Allow bf16 operands on new MFMAs (#144925)Umang Yadav
2025-06-13[MLIR][AMDGPU] Fix bug in GatherToLDSOpLowering, get the correct MemRefType f...Daniel Hernandez-Juarez
2025-06-13[mlir][AMDGPU] Add scaled floating point conversion ops (#141554)Tim Gymnich
2025-05-22[MLIR][LLVM] Tail call support for inline asm op (#140826)Bruno Cardoso Lopes
2025-05-21[mlir][ROCDL] Add fp4 and fp6 conversion intrinsics, fix fp8 immargs (#140801)Krzysztof Drewniak
2025-05-20Emit inbounds and nuw attributes in memref. (#138984)Peiyong Lin
2025-05-19[AMDGPU] Add a new amdgcn.load.to.lds intrinsic (#137425)Krzysztof Drewniak
2025-05-02[mlir][amdgpu] Define an amdgpu.scaling_mfma wrapper (#137498)Muzammil
2025-04-18[mlir] AMDGPUToROCDL: lower `amdgpu.swizzle_bitmode` (#136223)Ivan Butygin
2025-04-08[MLIR][AMDGPU] Add a wrapper for global LDS load intrinsics in AMDGPU (#133498)Alan Li
2025-04-01[mlir][AMDGPU] Add gfx950 MFMAs to the amdgpu.mfma op (#133553)Krzysztof Drewniak
2025-03-21[AMD][ROCDL] Add packed conversions fp8/bf8->bf16 and fp8/bf8->fp32 in ROCDL ...Yi Qian
2025-03-03[MLIR][AMDGPU] Add OCP FP8 support for new hardware (#127728)Mirza Halilčević
2025-02-27[mlir][AMDGPU] Add int4 intrinsics, mixed-type fp8 to handle gfx12 (#128963)Krzysztof Drewniak
2025-02-26[mlir][AMDGPU] Plumb address space 7 through MLIR, add address_space attr. (#...Krzysztof Drewniak
2025-02-23[mlir] AMDGPUToROCDL: handle 1-element vectors (#128266)Ivan Butygin
2025-02-19[MLIR][NFC] Use base alias for constructor inheritance (#127756)lorenzo chelini
2025-02-19[AMDGPU][MLIR] Replace gfx940 and gfx941 with gfx942 in MLIR (#125836)Fabian Ritter
2025-02-17[MLIR][NFC] Retire `let constructor` for passes in Conversion directory (part...lorenzo chelini
2025-02-06[mlir][ROCDL][~NFC] Migrate to LLVM dialect default builders (#125609)Krzysztof Drewniak
2025-01-21[mlir][IR][NFC] Move free-standing functions to `MemRefType` (#123465)Matthias Springer
2025-01-20[mlir][IR] Remove `isF...()` type API for low-precision FP types (#123326)Matthias Springer
2025-01-13[mlir][AMDGPU] Fix raw buffer ptr ops lowering (#122293)Fabian Mora
2024-12-20[mlir] AMDGPUToROCDL: RawBufferOpLowering fixes (#120642)Ivan Butygin
2024-10-31[mlir][AMDGPU] Support vector<2xbf16> packed atomic fadd (#113929)Krzysztof Drewniak
2024-10-07[MLIR] AMDGPUToROCDL: Use a bitcast op to reintepret a vector of i8 as single...Benoit Jacob
2024-10-05[mlir][NFC] Mark type converter in `populate...` functions as `const` (#111250)Matthias Springer
2024-09-20[mlir][AMDGPU] New gfx12 barrier instructions and update lowering LDSBarrierO...Daniel Hernandez-Juarez
2024-09-12[mlir][AMDGPU] Remove an old bf16 workaround (#108409)Krzysztof Drewniak
2024-09-12[mlir][AMDGPU] Enable emulating vector buffer_atomic_fadd on gfx11 (#108312)Krzysztof Drewniak
2024-09-11[mlir][AMDGPU] Support vector<2xf16> inputs to buffer atomic fadd (#108286)Krzysztof Drewniak