summaryrefslogtreecommitdiff
path: root/llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
AgeCommit message (Expand)Author
2025-11-13[AMDGPU][SIInsertWaitCnts] Gfx12.5 - Refactor xcnt optimization (#164357)Ryan Mitchell
2025-11-12[AMDGPU] Fix missing S_WAIT_XCNT with multiple pending VMEMs (#166779)Jay Foad
2025-11-01[AMDGPU] Fix Xcnt handling between blocks (#165201)Aaditya
2025-10-13[AMDGPU][NFC] Use `getScoreUB` for XCNT insertion. (#162448)Aaditya
2025-10-03[AMDGPU] Account for implicit XCNT insertion (#160812)Aaditya
2025-10-01[AMDGPU][SIInsertWaitCnts] Remove redundant TII/TRI/MRI arguments (NFC) (#161...Pierre van Houtryve
2025-10-01[AMDGPU][SIInsertWaitCnts] De-duplicate code (NFC) (#161161)Pierre van Houtryve
2025-10-01[AMDGPU][InsertWaitCnts] Refactor some helper functions, NFC (#161160)Pierre van Houtryve
2025-09-26[AMDGPU] Skip debug uses in SIInsertWaitcnts::shouldFlushVmCnt (#160818)Jay Foad
2025-09-18[AMDGPU][SIInsertWaitcnts] Track SCC. Insert KM_CNT waits for SCC writes. (#1...Petar Avramovic
2025-09-17[AMDGPU][CodeGen][True16] Track waitcnt as vgpr32 instead of vgpr16 for D16 I...Brox Chen
2025-09-12[AMDGPU] Remove scope check in SIInsertWaitcnts::generateWaitcntInstBefore (#...choikwa
2025-09-03[AMDGPU] Define 1024 VGPRs on gfx1250 (#156765)Stanislav Mekhanoshin
2025-08-23AMDGPU/GFX12: Do not wait unnecessarily before barriers (#154970)Nicolai Hähnle
2025-08-22[AMDGPU][NFC] Only include CodeGenPassBuilder.h where needed. (#154769)Ivan Kosarev
2025-08-12[AMDGPU] Add s_barrier_init|join|leave instructions (#153296)Stanislav Mekhanoshin
2025-07-30[AMDGPU] introduce S_WAITCNT_LDS_DIRECT in the memory legalizer (#150887)Sameer Sahasrabuddhe
2025-07-28[AMDGPU][gfx1250] Use SCOPE_SE for stores that may hit scratch (#150586)Pierre van Houtryve
2025-07-24[AMDGPU] gfx1250 vmem prefetch target intrinsics and builtins (#150466)Stanislav Mekhanoshin
2025-07-21[AMDGPU] ISel & PEI for whole wave functions (#145858)Diana Picus
2025-07-17[AMDGPU] Move common fields out of WaitcntBrackets. NFC. (#148864)Jay Foad
2025-07-17[AMDGPU] Move class WaitcntBrackets after class SIInsertWaitcnts. NFC.Jay Foad
2025-07-02[AMDGPU] Remove HasSampler variable. NFC. (#146682)Jay Foad
2025-07-02[AMDGPU] Use isImage. NFC. (#146677)Jay Foad
2025-07-01[AMDGPU][SIInsertWaitCnts] skip meta instructions early (#145720)Sameer Sahasrabuddhe
2025-06-26[AMDGPU][NFC] rename some constants for readability (#145870)Sameer Sahasrabuddhe
2025-06-25[AMDGPU] Simplify S_WAIT_XCNT insertion. NFC. (#145682)Jay Foad
2025-06-25[AMDGPU][SIInsertWaitcnts] don't crash when printing messages at end of block...Sameer Sahasrabuddhe
2025-06-25[AMDGPU][GFX1250] Insert S_WAIT_XCNT for SMEM and VMEM load-stores (#145566)Christudasan Devadasan
2025-06-24[AMDGPU] Replace dynamic VGPR feature with attribute (#133444)Diana Picus
2025-06-20[AMDGPU] Omit image waits in function prologue on gfx1250 (#145097)Stanislav Mekhanoshin
2025-06-19[NFC][AMDGPU] print more info when debugging SIInsertWaitcnts pass (#144629)Sameer Sahasrabuddhe
2025-05-28[AMDGPU] Treat image_msaa_load as a sampler operation (#141726)Carl Ritson
2025-05-22[llvm] Use *Map::try_emplace (NFC) (#141190)Kazu Hirata
2025-05-07[AMDGPU] Classify FLAT instructions as VMEM (#137148)Robert Imschweiler
2025-04-22[AMDGPU][InsertWaitCnts] Track global_wb/inv/wbinv (#135340)Pierre van Houtryve
2025-04-10[AMDGPU] Add support for point sample accel out of order returns (#127991)David Stuttard
2025-04-08[AMDGPU] Fix excessive stack usage in SIInsertWaitcnts::run (#134835)Jay Foad
2025-04-08[AMDGPU] Make a few WaitcntBrackets methods const. NFC. (#134824)Jay Foad
2025-03-24[AMDGPU] Relax lds dma waitcnt with no aliasing pair (#131842)Austin Kerbow
2025-03-24[AMDGPU][NPM] Port SIInsertWaitcnts to NPM (#130061)Akshat Oke
2025-03-23[AMDGPU] Avoid repeated hash lookups (NFC) (#132657)Kazu Hirata
2025-03-21[AMDGPU] Insert before and after instructions that always use GDS (#131338)Stephen Thomas
2025-03-19[AMDGPU] Deallocate VGPRs before exiting in dynamic VGPR mode (#130037)Diana Picus
2025-03-11[AMDGPU][True16][CodeGen] update waitcnt for true16 (#128927)Brox Chen
2025-03-06[AMDGPU] Reduce size of SGPR arrays in SIInsertWaitcnts. NFC. (#130097)Jay Foad
2025-03-06[llvm] Fix an unused variable warningDmitri Gribenko
2025-03-06[AMDGPU] Remove RegisterEncoding from SIInsertWaitcnts. NFC. (#130056)Jay Foad
2025-03-04[AMDGPU] Remove unused s_barrier_{init,join,leave} instructions (#129548)Mariusz Sikora
2025-02-12[TableGen] Emit OpName as an enum class instead of a namespace (#125313)Rahul Joshi