summaryrefslogtreecommitdiff
path: root/lldb/test/API/python_api/debugger/TestDebuggerAPI.py
diff options
context:
space:
mode:
authorDavid Green <david.green@arm.com>2023-03-07 21:24:16 +0000
committerDavid Green <david.green@arm.com>2023-03-07 21:24:16 +0000
commit9aa39481d9eb718e872993791547053a3c1f16d5 (patch)
tree711a1d7e8fd64358db17510d5046aa5d13e5fda0 /lldb/test/API/python_api/debugger/TestDebuggerAPI.py
parent912404db7882084bc2a5fa417746952bee04e26a (diff)
[AArch64] Prefer to fold dup into fmul/fma as opposed to ld1r
There is a fold to create LD1DUPpost from dup(load) that can be postinc. If the dup is used by a "by element" operation such as fmul or fma then it can be slightly better to fold the dup into the fmul instead, which produces slightly fast code. ld1r { v1.4s }, [x0], #4 fmul v0.4s, v1.4s, v0.4s vs ldr s1, [x0], #4 fmul v0.4s, v0.4s, v1.s[0] This could also be done with integer operations such as smull/umull too, so long as the load/dup gets correctly combined into the mul operation. Currently this just operates on foating point types. Differential Revision: https://reviews.llvm.org/D145184
Diffstat (limited to 'lldb/test/API/python_api/debugger/TestDebuggerAPI.py')
0 files changed, 0 insertions, 0 deletions