diff options
| author | David Green <david.green@arm.com> | 2023-03-07 21:24:16 +0000 |
|---|---|---|
| committer | David Green <david.green@arm.com> | 2023-03-07 21:24:16 +0000 |
| commit | 9aa39481d9eb718e872993791547053a3c1f16d5 (patch) | |
| tree | 711a1d7e8fd64358db17510d5046aa5d13e5fda0 /lldb/test/API/python_api/debugger/TestDebuggerAPI.py | |
| parent | 912404db7882084bc2a5fa417746952bee04e26a (diff) | |
[AArch64] Prefer to fold dup into fmul/fma as opposed to ld1r
There is a fold to create LD1DUPpost from dup(load) that can be postinc. If the
dup is used by a "by element" operation such as fmul or fma then it can be
slightly better to fold the dup into the fmul instead, which produces slightly
fast code.
ld1r { v1.4s }, [x0], #4
fmul v0.4s, v1.4s, v0.4s
vs
ldr s1, [x0], #4
fmul v0.4s, v0.4s, v1.s[0]
This could also be done with integer operations such as smull/umull too, so
long as the load/dup gets correctly combined into the mul operation. Currently
this just operates on foating point types.
Differential Revision: https://reviews.llvm.org/D145184
Diffstat (limited to 'lldb/test/API/python_api/debugger/TestDebuggerAPI.py')
0 files changed, 0 insertions, 0 deletions
