summaryrefslogtreecommitdiffstats
path: root/arch/mips/include
diff options
context:
space:
mode:
authorMaciej W. Rozycki <macro@orcam.me.uk>2021-04-20 04:50:48 +0200
committerGreg Kroah-Hartman <gregkh@linuxfoundation.org>2021-05-22 10:59:47 +0200
commitc70877c0fa032cc7efebe013d67827cdf9032dd8 (patch)
treeb799fdeb03976a4e0b77e9d0d62305b9d08bc48a /arch/mips/include
parent1506af3076817306f61c64fd67b180746b14c7fe (diff)
downloadlinux-stable-c70877c0fa032cc7efebe013d67827cdf9032dd8.tar.gz
linux-stable-c70877c0fa032cc7efebe013d67827cdf9032dd8.tar.bz2
linux-stable-c70877c0fa032cc7efebe013d67827cdf9032dd8.zip
MIPS: Avoid DIVU in `__div64_32' is result would be zero
commit c1d337d45ec0a802299688e17d568c4e3a585895 upstream. We already check the high part of the divident against zero to avoid the costly DIVU instruction in that case, needed to reduce the high part of the divident, so we may well check against the divisor instead and set the high part of the quotient to zero right away. We need to treat the high part the divident in that case though as the remainder that would be calculated by the DIVU instruction we avoided. This has passed correctness verification with test_div64 and reduced the module's average execution time down to 1.0445s and 0.2619s from 1.0668s and 0.2629s respectively for an R3400 CPU @40MHz and a 5Kc CPU @160MHz. Signed-off-by: Maciej W. Rozycki <macro@orcam.me.uk> Signed-off-by: Thomas Bogendoerfer <tsbogend@alpha.franken.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Diffstat (limited to 'arch/mips/include')
-rw-r--r--arch/mips/include/asm/div64.h6
1 files changed, 4 insertions, 2 deletions
diff --git a/arch/mips/include/asm/div64.h b/arch/mips/include/asm/div64.h
index b252300e299d..a882a7bdf63e 100644
--- a/arch/mips/include/asm/div64.h
+++ b/arch/mips/include/asm/div64.h
@@ -68,9 +68,11 @@
\
__high = __div >> 32; \
__low = __div; \
- __upper = __high; \
\
- if (__high) { \
+ if (__high < __radix) { \
+ __upper = __high; \
+ __high = 0; \
+ } else { \
__asm__("divu $0, %z1, %z2" \
: "=x" (__modquot) \
: "Jr" (__high), "Jr" (__radix)); \