Complex Matrix Multiplication

riscv_status riscv_mat_cmplx_mult_f16(const riscv_matrix_instance_f16 *pSrcA, const riscv_matrix_instance_f16 *pSrcB, riscv_matrix_instance_f16 *pDst)
riscv_status riscv_mat_cmplx_mult_f32(const riscv_matrix_instance_f32 *pSrcA, const riscv_matrix_instance_f32 *pSrcB, riscv_matrix_instance_f32 *pDst)
riscv_status riscv_mat_cmplx_mult_q15(const riscv_matrix_instance_q15 *pSrcA, const riscv_matrix_instance_q15 *pSrcB, riscv_matrix_instance_q15 *pDst, q15_t *pScratch)
riscv_status riscv_mat_cmplx_mult_q31(const riscv_matrix_instance_q31 *pSrcA, const riscv_matrix_instance_q31 *pSrcB, riscv_matrix_instance_q31 *pDst)
group CmplxMatrixMult

Complex Matrix multiplication is only defined if the number of columns of the first matrix equals the number of rows of the second matrix. Multiplying an M x N matrix with an N x P matrix results in an M x P matrix.

When matrix size checking is enabled, the functions check:

  • that the inner dimensions of pSrcA and pSrcB are equal;

  • that the size of the output matrix equals the outer dimensions of pSrcA and pSrcB.

Functions

riscv_status riscv_mat_cmplx_mult_f16(const riscv_matrix_instance_f16 *pSrcA, const riscv_matrix_instance_f16 *pSrcB, riscv_matrix_instance_f16 *pDst)

Floating-point Complex matrix multiplication.

Floating-point, complex, matrix multiplication.

Parameters
  • pSrcA[in] points to first input complex matrix structure

  • pSrcB[in] points to second input complex matrix structure

  • pDst[out] points to output complex matrix structure

Returns

execution status

  • RISCV_MATH_SUCCESS : Operation successful

  • RISCV_MATH_SIZE_MISMATCH : Matrix size check failed

riscv_status riscv_mat_cmplx_mult_f32(const riscv_matrix_instance_f32 *pSrcA, const riscv_matrix_instance_f32 *pSrcB, riscv_matrix_instance_f32 *pDst)

Floating-point Complex matrix multiplication.

Floating-point, complex, matrix multiplication.

Parameters
  • pSrcA[in] points to first input complex matrix structure

  • pSrcB[in] points to second input complex matrix structure

  • pDst[out] points to output complex matrix structure

Returns

execution status

  • RISCV_MATH_SUCCESS : Operation successful

  • RISCV_MATH_SIZE_MISMATCH : Matrix size check failed

riscv_status riscv_mat_cmplx_mult_q15(const riscv_matrix_instance_q15 *pSrcA, const riscv_matrix_instance_q15 *pSrcB, riscv_matrix_instance_q15 *pDst, q15_t *pScratch)

Q15 Complex matrix multiplication.

Q15, complex, matrix multiplication.

Conditions for optimum performance

Input, output and state buffers should be aligned by 32-bit

Scaling and Overflow Behavior

The function is implemented using an internal 64-bit accumulator. The inputs to the multiplications are in 1.15 format and multiplications yield a 2.30 result. The 2.30 intermediate results are accumulated in a 64-bit accumulator in 34.30 format. This approach provides 33 guard bits and there is no risk of overflow. The 34.30 result is then truncated to 34.15 format by discarding the low 15 bits and then saturated to 1.15 format.

Parameters
  • pSrcA[in] points to first input complex matrix structure

  • pSrcB[in] points to second input complex matrix structure

  • pDst[out] points to output complex matrix structure

  • pScratch[in] points to an array for storing intermediate results

Returns

execution status

  • RISCV_MATH_SUCCESS : Operation successful

  • RISCV_MATH_SIZE_MISMATCH : Matrix size check failed

riscv_status riscv_mat_cmplx_mult_q31(const riscv_matrix_instance_q31 *pSrcA, const riscv_matrix_instance_q31 *pSrcB, riscv_matrix_instance_q31 *pDst)

Q31 Complex matrix multiplication.

Q31, complex, matrix multiplication.

Scaling and Overflow Behavior

The function is implemented using an internal 64-bit accumulator. The accumulator has a 2.62 format and maintains full precision of the intermediate multiplication results but provides only a single guard bit. There is no saturation on intermediate additions. Thus, if the accumulator overflows it wraps around and distorts the result. The input signals should be scaled down to avoid intermediate overflows. The input is thus scaled down by log2(numColsA) bits to avoid overflows, as a total of numColsA additions are performed internally. The 2.62 accumulator is right shifted by 31 bits and saturated to 1.31 format to yield the final result.

Parameters
  • pSrcA[in] points to first input complex matrix structure

  • pSrcB[in] points to second input complex matrix structure

  • pDst[out] points to output complex matrix structure

Returns

execution status

  • RISCV_MATH_SUCCESS : Operation successful

  • RISCV_MATH_SIZE_MISMATCH : Matrix size check failed