TY - JOUR

T1 - New systolic arrays for matrix multiplication

AU - Chen, Sau-Gee

AU - Lee, Jiann Cherng

AU - Li, Chieh Chih

PY - 1994/1/1

Y1 - 1994/1/1

N2 - In this paper, three new systolic arrays for matrix multiplication are proposed. The first systolic array has the minimum number of 3n-2 clock cycles in completing a matrix multiplication among the known structures, with n 2 processors elements (PE's). It is achieved by applying a new input data flow and deposition scheme. The second array is derived by combining the data flow technique with the simple Blahut's matrix multiplication algorithm. Not only the second array has the least amount of processing time of 3n-2 clock cycles, it has the least area complexity of about n2/2 PE's. By further modifying its input data flow patterns, the third array is obtained. Its processing time is further reduced to 2.5n-2 clock cycles. The proposed architectures exhibit better performances than the known structures, according to several standard performance measures.

AB - In this paper, three new systolic arrays for matrix multiplication are proposed. The first systolic array has the minimum number of 3n-2 clock cycles in completing a matrix multiplication among the known structures, with n 2 processors elements (PE's). It is achieved by applying a new input data flow and deposition scheme. The second array is derived by combining the data flow technique with the simple Blahut's matrix multiplication algorithm. Not only the second array has the least amount of processing time of 3n-2 clock cycles, it has the least area complexity of about n2/2 PE's. By further modifying its input data flow patterns, the third array is obtained. Its processing time is further reduced to 2.5n-2 clock cycles. The proposed architectures exhibit better performances than the known structures, according to several standard performance measures.

UR - http://www.scopus.com/inward/record.url?scp=4243593467&partnerID=8YFLogxK

U2 - 10.1109/ICPP.1994.134

DO - 10.1109/ICPP.1994.134

M3 - Conference article

AN - SCOPUS:4243593467

VL - 2

JO - Proceedings of the International Conference on Parallel Processing

JF - Proceedings of the International Conference on Parallel Processing

SN - 0190-3918

M1 - 5727789

Y2 - 15 August 1994 through 19 August 1994

ER -