TY - JOUR
T1 - New systolic arrays for matrix multiplication
AU - Chen, Sau-Gee
AU - Lee, Jiann Cherng
AU - Li, Chieh Chih
PY - 1994/1/1
Y1 - 1994/1/1
N2 - In this paper, three new systolic arrays for matrix multiplication are proposed. The first systolic array has the minimum number of 3n-2 clock cycles in completing a matrix multiplication among the known structures, with n 2 processors elements (PE's). It is achieved by applying a new input data flow and deposition scheme. The second array is derived by combining the data flow technique with the simple Blahut's matrix multiplication algorithm. Not only the second array has the least amount of processing time of 3n-2 clock cycles, it has the least area complexity of about n2/2 PE's. By further modifying its input data flow patterns, the third array is obtained. Its processing time is further reduced to 2.5n-2 clock cycles. The proposed architectures exhibit better performances than the known structures, according to several standard performance measures.
AB - In this paper, three new systolic arrays for matrix multiplication are proposed. The first systolic array has the minimum number of 3n-2 clock cycles in completing a matrix multiplication among the known structures, with n 2 processors elements (PE's). It is achieved by applying a new input data flow and deposition scheme. The second array is derived by combining the data flow technique with the simple Blahut's matrix multiplication algorithm. Not only the second array has the least amount of processing time of 3n-2 clock cycles, it has the least area complexity of about n2/2 PE's. By further modifying its input data flow patterns, the third array is obtained. Its processing time is further reduced to 2.5n-2 clock cycles. The proposed architectures exhibit better performances than the known structures, according to several standard performance measures.
UR - http://www.scopus.com/inward/record.url?scp=4243593467&partnerID=8YFLogxK
U2 - 10.1109/ICPP.1994.134
DO - 10.1109/ICPP.1994.134
M3 - Conference article
AN - SCOPUS:4243593467
VL - 2
JO - Proceedings of the International Conference on Parallel Processing
JF - Proceedings of the International Conference on Parallel Processing
SN - 0190-3918
M1 - 5727789
Y2 - 15 August 1994 through 19 August 1994
ER -