Cost-effective VLSI architectures and buffer size optimization for full-search block matching algorithms

Yuan Hau Yeh, Chen-Yi Lee

Research output: Contribution to journalArticlepeer-review

30 Scopus citations

Abstract

This paper presents two efficient very large scale integration (VLSI) architectures and buffer size optimization for full-search block matching algorithms. Starting from an overlapped data flow of search area, both systolic- and semisystolic-array architectural solutions are derived. By means of exploiting stream memory banks, not only input/output (I/O) bandwidth can be minimized, but also processor element efficiency can be improved. In addition, the controller structure for both solutions are very straightforward, making them very suitable for VLSI implementation to meet computational requirements. Moreover, by exploring the dependency graph, we focus on the problem of reducing the internal buffer size under minimal I/O bandwidth constraint to derive guidelines on reducing redundant internal buffer as well as to achieve area-efficient VLSI architectures. Simulation results show that, for N = P = 16 (N is the reference block size and P is the search range), I/O bandwidth can be reduced by 2.4 times, while buffer size increases less than 38%. Two prototype chips for N = P = 16 have been designed and fabricated. Test results show that clock rate can be up to 90 MHz.

Original languageEnglish
Pages (from-to)345-358
Number of pages14
JournalIEEE Transactions on Very Large Scale Integration (VLSI) Systems
Volume7
Issue number3
DOIs
StatePublished - 1 Jan 1999

Fingerprint Dive into the research topics of 'Cost-effective VLSI architectures and buffer size optimization for full-search block matching algorithms'. Together they form a unique fingerprint.

Cite this