Concurrency Using CUDA Streams and Events

Richard Ansorge

doi:10.1017/9781108855273.008

7 - Concurrency Using CUDA Streams and Events

Published online by Cambridge University Press: 04 May 2022

Richard Ansorge

Show author details

Richard Ansorge: Affiliation:
University of Cambridge

Book contents

Get access

Summary

Chapter 7 explores the ability of GPUs to perform multiple tasks simultaneously, including overlapping IO with computation and the simultaneous running of multiple kernels. CUDA streams and events are advanced features that allow users to manage multiple asynchronous tasks running on the GPU. Examples are given and the NVIDIA visual profiler (NVVP) is used to visualise the timeline for tasks in multiple CUDA streams. Asynchronous disk IO on the host PC can also be performed and examples using the C++ <threads> are given. Finally, the new CUDA graphs feature is introduced. This provides a wrapper for efficiently launching large numbers of kernel calls for complex workloads.

Keywords

concurrent execution concurrent disk IO CUDA events CUDA streams CUDA graphs

Type: Chapter
Information: Programming in Parallel with CUDA
A Practical Guide
, pp. 209 - 238

DOI: https://doi.org/10.1017/9781108855273.008 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2022

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

7 - Concurrency Using CUDA Streams and Events

Summary

Keywords

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive