Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

Data-flow analysis and optimization for data coherence in heterogeneous architectures

Full text
Author(s):
Sousa, Rafael [1] ; Pereira, Marcio [1] ; Quintao Pereira, Fernando Magno [2] ; Araujo, Guido [1]
Total Authors: 4
Affiliation:
[1] Univ Estadual Campinas, UNICAMP, Inst Comp, Campinas, SP - Brazil
[2] Univ Minas Gerais, Dept Comp Sci, UFMG, Belo Horizonte, MG - Brazil
Total Affiliations: 2
Document type: Journal article
Source: JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING; v. 130, p. 126-139, AUG 2019.
Web of Science Citations: 1
Abstract

Although heterogeneous computing has enabled developers to achieve impressive program speed-ups, the cost of moving and keeping data coherent between host and device may easily eliminate any performance gains achieved by acceleration. To deal with this problem, this paper introduces DCA: a pair of two data-flow analyses that determine how variables are used by host/device at each program point. It also introduces DCO, a code optimization technique that uses DCA information to: (a) allocate OpenCL shared buffers between host and devices; and (b) insert appropriate OpenCL function calls into program points so as to minimize the number of data coherence operations. We have used the AClang compiler to measure the impact of DCA and DCO when generating code from Parboil, Polybench and Rodinia benchmarks for a set of discrete/integrated CPUs. The experimental results showed speed-ups of up to 5.25x (average of 1.39x) on an ARM Mali-T880 and up to 8.87x (average of 1.66x) on an NVIDIA GPU Pascal Titan X. (C) 2019 Elsevier Inc. All rights reserved. (AU)

FAPESP's process: 13/08293-7 - CCES - Center for Computational Engineering and Sciences
Grantee:Munir Salomao Skaf
Support Opportunities: Research Grants - Research, Innovation and Dissemination Centers - RIDC