This intrinsic inserts an SMLALDX instruction
into the instruction stream generated by the compiler. It enables
you to exchange the halfwords of the second operand, and perform
two signed 16-bit multiplications, adding both results to a 64-bit
accumulate operand. Overflow is only possible as a result of the
64-bit addition. This overflow is not detected if it occurs. Instead, the
result wraps around modulo 264.
unsigned long long__smlaldx(unsigned int val1, unsigned int val2, unsigned long long val3)
where:
val1
holds the first halfword operands for each multiplication
val2
holds the second halfword operands for each multiplication
val3
holds the accumulate value.
The __smlald intrinsic returns the product
of each multiplication added to the accumulate value.
Example:
unsigned int dual_multiply_accumulate(unsigned int val1, unsigned int val2, unsigned int val3)
{
unsigned int res;
res = __smlald(val1,val2,val3); /* p1 = val1[15:0] × val2[31:16]
p2 = val1[31:16] × val2[15:0]
sum = p1 + p2 + val3[63:32][31:0]
res[63:32] = sum[63:32]
res[31:0] = sum[31:0]
*/
return res;
}
This site uses cookies to store information on your computer. By continuing to use our site, you consent to our cookies.
ARM websites use two types of cookie: (1) those that enable the site to function and perform as required; and (2) analytical cookies which anonymously track visitors only while using the site. If you are not happy with this use of these cookies please review our Privacy Policy to learn how they can be disabled. By disabling cookies some features of the site will not work.