Computer Science Technical Reports
CS at VT

CoreTSAR: Task Scheduling for Accelerator-aware Runtimes

Scogland, Thomas R. W. and Feng, Wu-chun and Rountree, Barry and de Supinski, Bronis R. (2012) CoreTSAR: Task Scheduling for Accelerator-aware Runtimes. Technical Report TR-12-20, Computer Science, Virginia Tech.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.
TechReport-PPoPP13.pdf (412380)

Abstract

Heterogeneous supercomputers that incorporate computational accelerators such as GPUs are increasingly popular due to their high peak performance, energy efficiency and comparatively low cost. Unfortunately, the programming models and frameworks designed to extract performance from all computational units still lack the flexibility of their CPU-only counterparts. Accelerated OpenMP improves this situation by supporting natural migration of OpenMP code from CPUs to a GPU. However, these implementations currently lose one of OpenMP’s best features, its flexibility: typical OpenMP applications can run on any number of CPUs. GPU implementations do not transparently employ multiple GPUs on a node or a mix of GPUs and CPUs. To address these shortcomings, we present CoreTSAR, our runtime library for dynamically scheduling tasks across heterogeneous resources, and propose straightforward extensions that incorporate this functionality into Accelerated OpenMP. We show that our approach can provide nearly linear speedup to four GPUs over only using CPUs or one GPU while increasing the overall flexibility of Accelerated OpenMP.

Item Type:Departmental Technical Report
Keywords:GPU, heterogeneous computing, parallel programming models, run-time systems
Subjects:Computer Science > Parallel Computation
ID Code:1212
Deposited By:Administrator, Eprints
Deposited On:12 November 2012