Optimized memory access support for data layout conversion on heterogeneous multi-core systems

Chia Chen Hsu, Cheng Yen Lin, Shin Kai Chen, Chih-Wei Liu, Jenq Kuen Lee

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Heterogeneous multi-core systems that contain multiple CPUs and GPUs are gaining momentum, as they are providing different computation power to meet the performance demand of modern applications. On such systems, developers try to fully utilize the computation power both for CPU and GPU by using the emerging programming models such as CUDA and OpenCL. To achieve the maximal performance, developers must carefully offload the appropriate workload to the compute devices according to the characteristics of target architecture. Under such scenario, seamlessly data motion between different processors become crucial. Additionally, re-organizing the data layout to fit the target architectures, such as array-of-structure (AOS) for CPU, structure-of-array (SOA) for GPU, and coordinate (COO) format to ELLPACK (ELL) for sparse computation, address such concern. In this paper, we propose a hardware memory manager, which efficiently optimizes the conversion of data layouts for heterogeneous multi-core systems on-the-fly. We address coalescing and sparse format conversion issue in our design. A novel ping-pong transpose architecture is devised to reorganize non-coalescing access pattern, and a histogram unit and sparse address generator are presented to process sparse storage format transformation. Our design reduces the overhead of data transfer and layout transformation among CPU and GPU. In our experiment, our design achieves 68.5 to 2.19 times speed up comparing to software-based library depending on data size.

Original languageEnglish
Title of host publication2014 IEEE 12th Symposium on Embedded Systems for Real-Time Multimedia, ESTIMedia 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages128-137
Number of pages10
ISBN (Electronic)9781479963072
DOIs
StatePublished - 18 Nov 2014
Event12th IEEE Symposium on Embedded Systems for Real-Time Multimedia, ESTIMedia 2014 - Greater Noida, India
Duration: 16 Oct 201417 Oct 2014

Publication series

Name2014 IEEE 12th Symposium on Embedded Systems for Real-Time Multimedia, ESTIMedia 2014

Conference

Conference12th IEEE Symposium on Embedded Systems for Real-Time Multimedia, ESTIMedia 2014
CountryIndia
CityGreater Noida
Period16/10/1417/10/14

Fingerprint Dive into the research topics of 'Optimized memory access support for data layout conversion on heterogeneous multi-core systems'. Together they form a unique fingerprint.

Cite this