CoreTSAR: Task Scheduling for Accelerator-aware Runtimes
2012) CoreTSAR: Task Scheduling for Accelerator-aware Runtimes. Technical Report TR-12-20, Computer Science, Virginia Tech. (
Heterogeneous supercomputers that incorporate computational accelerators such as GPUs are increasingly popular due to their high peak performance, energy efficiency and comparatively low cost. Unfortunately, the programming models and frameworks designed to extract performance from all computational units still lack the flexibility of their CPU-only counterparts. Accelerated OpenMP improves this situation by supporting natural migration of OpenMP code from CPUs to a GPU. However, these implementations currently lose one of OpenMP’s best features, its flexibility: typical OpenMP applications can run on any number of CPUs. GPU implementations do not transparently employ multiple GPUs on a node or a mix of GPUs and CPUs. To address these shortcomings, we present CoreTSAR, our runtime library for dynamically scheduling tasks across heterogeneous resources, and propose straightforward extensions that incorporate this functionality into Accelerated OpenMP. We show that our approach can provide nearly linear speedup to four GPUs over only using CPUs or one GPU while increasing the overall flexibility of Accelerated OpenMP.
|Item Type:||Departmental Technical Report|
|Keywords:||GPU, heterogeneous computing, parallel programming models, run-time systems|
|Subjects:||Computer Science > Parallel Computation|
|Deposited By:||Administrator, Eprints|
|Deposited On:||12 November 2012|