Advanced search
Start date
Betweenand


Improving Speculative taskloop in Hardware Transactional Memory

Full text
Author(s):
Salamanca, Juan ; Baldassin, Alexandro ; McIntoshSmith, S ; DeSupinski, BR ; Klinkenberg, J
Total Authors: 5
Document type: Journal article
Source: OPENMP: ENABLING MASSIVE NODE-LEVEL PARALLELISM, IWOMP 2021; v. 12870, p. 15-pg., 2021-01-01.
Abstract

Previous work proposed and evaluated Speculative taskloop (STL) on Intel Core implementing new clauses and constructs in OpenMP. The results indicated that, despite achieving some speed-ups, there was a phenomenon called the Lost-Thread Effect that caused the performance degradation of STL parallelization. This issue is caused by the nonmonotonic scheduling implemented in the LLVM OpenMP Runtime Library. This paper presents an improvement in the STL mechanism by modifying the OpenMP runtime to allow monotonic scheduling of tasks generated by taskloop. We perform an evaluation with two different versions of the OpenMP runtime, both optimized for STL revealing that, for certain loops, infinite slowdowns (deadlocks) using the original OpenMP runtime can be transformed in speed-ups by applying monotonic scheduling. The experimental results show the performance improvement of STL using the modified version of the runtime, reaching speed-ups of up to 2.18x. (AU)

FAPESP's process: 18/15519-5 - Performance optimizations for multicore architectures
Grantee:Alexandro José Baldassin
Support Opportunities: Research Grants - Young Investigators Grants - Phase 2
FAPESP's process: 18/07446-8 - Integrating Speculative Execution to Loop and Task-based Parallelization
Grantee:Juan Jesús Salamanca Guillén
Support Opportunities: Scholarships in Brazil - Post-Doctoral