summaryrefslogtreecommitdiff
path: root/mlir/test/Dialect/GPU
AgeCommit message (Expand)Author
2025-11-21Revert "[MLIR][GPU] subgroup_mma fp64 extension" (#169049)Fabian Mora
2025-11-21[MLIR][GPU] subgroup_mma fp64 extension (#165873)Giacomo Castiglioni
2025-10-31[MLIR][GPU] Ensure all lanes in cluster have final reduction value (#165764)James Newling
2025-09-30[MLIR] Fix gpu.launch attribution argument printing (#161408)Mehdi Amini
2025-09-29Create function declaration in the proper module (#161281)Renaud Kauffmann
2025-09-25[mlir][gpu] Add innermost-first policy when mapping loops to GPU IDs (#160634)Georgios Pinitas
2025-09-08[mlir][gpu] Revert gpu.subgroup_broadcast with any_lane (#157373)Jakub Kuderski
2025-08-30[mlir][gpu] Add `subgroup_broadcast` op (#152808)Ivan Butygin
2025-08-24[mlir][amdgpu] Promote gpu.shuffle to amdgpu.permlane_swap (#154933)Tim Gymnich
2025-08-13[mlir][gpu] Support outlining nested `gpu.launch` (#152696)Longsheng Mou
2025-08-08[mlir][gpu] Update attribute definitions in `gpu::LaunchOp` (#152106)Longsheng Mou
2025-07-29[mlir][gpu] Make offset and width in gpu.rotate as attributes (#150901)Hsiangkai Wang
2025-07-07[mlir][SCF][GPU] Add DeviceMaskingAttrInterface (#146943)Nicolas Vasilache
2025-07-07[mlir][gpu][transforms] Add support for mapping to lanes (#146912)Nicolas Vasilache
2025-07-01[mlir][gpu] Add gpu.rotate operation (#142796)Hsiangkai Wang
2025-05-28[mlir][GPU] Fixes subgroup reduce lowering (#141825)Muzammil
2025-05-28[mlir][gpu] Add GPU subgroup MMA extract and insert operations (#139048)Hsiangkai Wang
2025-05-13[mlir][gpu] Pattern to promote `gpu.shuffle` to specialized AMDGPU ops (#137109)Ivan Butygin
2025-04-30[MLIR][GPU] Use arith instead of index for subgroup_id (#137843)Alan Li
2025-04-29[MLIR][GPU] Add a pattern to rewrite gpu.subgroup_id (#137671)Alan Li
2025-04-23[mlir][AMDGPU] Implement gpu.subgroup_reduce with DPP intrinsics on AMD GPUs ...Muzammil
2025-04-18[mlir] GPUToROCDL: Add support for non-i32/f32 shuffle types (#136320)Ivan Butygin
2025-04-13[mlir] GPUToROCDL: Fix crashes with unsupported shuffle datatypes (#135504)Ivan Butygin
2025-02-17[MLIR][NVVM] Add `ptxas-cmd-options` to pass flags to the downstream compiler...Guray Ozen
2025-02-11[mlir][target][nvvm] Perf by stage and store into properties (#126178)Zichen Lu
2025-02-06[mlir][LLVM] Switch `undef` for `poison` for uninitialized values (#125629)Krzysztof Drewniak
2025-02-04[mlir][gpu]add AffineScope Trait to gpu.launch. (#121058)lonely eagle
2025-01-17[mlir][affine]introducing new symbol rules that the result of a `Pure` operat...lonely eagle
2025-01-09[mlir][GPU] Implement ValueBoundsOpInterface for GPU ID operations (#122190)Krzysztof Drewniak
2025-01-08[MLIR][GPU] Fix gpu.printf (#121940)Guray Ozen
2024-12-16[mlir][gpu] Adding ELF section option to the gpu-module-to-binary pass (#119440)Renaud Kauffmann
2024-12-06[mlir][gpu] Add optional attributes of kernelModule and kernelFunc for outlin...Zhen Wang
2024-12-05[MLIR][GPU] Fix memref.dim folding with out-of-bound index (#118890)Mehdi Amini
2024-11-24[mlir] fix memory effects in GPU barrier elimination (#117432)Oleksandr "Alex" Zinenko
2024-11-22[MLIR] Move warp_execute_on_lane_0 from vector to gpu (#116994)Petr Kurapov
2024-10-31[MLIR][DLTI] Pretty parsing and printing for DLTI attrs (#113365)Rolf Morel
2024-10-26[mlir] [memref] add more checks to the memref.reinterpret_cast (#112669)donald chen
2024-09-09[Func][GPU] Use SymbolUserOpInterface in func::ConstantOp (#107748)Artem Kroviakov
2024-09-05[mlir][gpu] Add 'cluster_stride' attribute to gpu.subgroup_reduce (#107142)Andrea Faulds
2024-08-27[mlir][gpu] Add metadata attributes for storing kernel metadata in GPU object...Fabian Mora
2024-08-20[mlir][gpu] Add extra value types for gpu::ShuffleOp (#104605)Finlay
2024-08-20[mlir][gpu] Add 'cluster_size' attribute to gpu.subgroup_reduce (#104851)Andrea Faulds
2024-08-15[mlir][gpu] Fix typo in test filename (#104053)Andrea Faulds
2024-06-17[mlir][GPU] Improve handling of GPU bounds (#95166)Krzysztof Drewniak
2024-06-10[mlir][Arith] Generalize and improve -int-range-optimizations (#94712)Krzysztof Drewniak
2024-05-30[mlir][llvm] Implement ConstantLike for ZeroOp, UndefOp, PoisonOp (#93690)Guy David
2024-05-24[mlir] Replace MLIR_ENABLE_CUDA_CONVERSIONS with LLVM_HAS_NVPTX_TARGET (#93008)tyb0807
2024-05-24[mlir][test] Fix filecheck annotation typos (#92897)klensy
2024-05-14[mlir][test] Extend `InferIntRangeInterface` test Ops to arbitrary ints (#91850)Felix Schneider
2024-04-29[MLIR] Improve KernelOutlining to avoid introducing an extra block (#90359)Mehdi Amini