math: Use tanf from CORE-MATH

The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows better performance to the generic tanf. The code was adapted to glibc style, to use the definition of math_config.h, to remove errno handling, and to use a generic 128 bit routine for ABIs that do not support it natively. Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (neoverse1, gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1): latency master patched improvement x86_64 82.3961 54.8052 33.49% x86_64v2 82.3415 54.8052 33.44% x86_64v3 69.3661 50.4864 27.22% i686 219.271 45.5396 79.23% aarch64 29.2127 19.1951 34.29% power10 19.5060 16.2760 16.56% reciprocal-throughput master patched improvement x86_64 28.3976 19.7334 30.51% x86_64v2 28.4568 19.7334 30.65% x86_64v3 21.1815 16.1811 23.61% i686 105.016 15.1426 85.58% aarch64 18.1573 10.7681 40.70% power10 8.7207 8.7097 0.13% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>
author: Adhemerval Zanella <adhemerval.zanella@linaro.org> 2024-11-08 13:24:28 -0300
committer: Adhemerval Zanella <adhemerval.zanella@linaro.org> 2024-11-22 10:52:27 -0300
commit: bccb0648ea29f89a7b1b64f3e5674d2338e3798e (patch)
tree: 31849761a6d493625977c973db58c50669935fc9 /SHARED-FILES
parent: d846f4c12d7636efd5b7cff173456e616a185e24 (diff)
1 files changed, 6 insertions, 0 deletions
diff --git a/SHARED-FILES b/SHARED-FILES
index 033ce7f092..9f4de81b66 100644
--- a/SHARED-FILES
+++ b/SHARED-FILES
@@ -288,3 +288,9 @@ sysdeps/ieee754/flt-32/e_lgammaf_r.c:
   - remove the errno stuff (this is done by the wrapper)
   - replace 0x1p127f * 0x1p127f by math_narrow_eval (x * 0x1p127f)
   - add libm_alias_finite (__ieee754_lgammaf_r, __lgammaf_r) at the end
+sysdeps/ieee754/flt-32/s_tanf.c:
+  (src/binary32/tan/tanf.c in CORE-MATH)
+  - The code was adapted to use glibc code style and internal
+    functions to handle errno, overflow, and underflow.  It was changed
+    to use an internal wrapper for 128 bit unsigned integer operations
+    for ABIs that do not support the type natively.
author	Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-08 13:24:28 -0300
committer	Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-22 10:52:27 -0300
commit	bccb0648ea29f89a7b1b64f3e5674d2338e3798e (patch)
tree	31849761a6d493625977c973db58c50669935fc9 /SHARED-FILES
parent	d846f4c12d7636efd5b7cff173456e616a185e24 (diff)