Fundamentals of Accelerated Computing with CUDA Python (FACCP)
This course explores how to use Numba—the just-in-time, type-specializing Python function compiler—to accelerate Python programs to run on massively parallel NVIDIA GPUs. You’ll learn how to: · Use...
This course explores how to use Numba—the just-in-time, type-specializing Python function compiler—to accelerate Python programs to run on massively parallel NVIDIA GPUs. You’ll learn how to: · Use Numba to compile CUDA kernels from NumPy universal functions (ufuncs). Use Numba to create and launch custom CUDA kernels · Apply key GPU memory management techniques Upon completion, you’ll be able to use Numba to compile and launch CUDA kernels to accelerate your Python applications on NVIDIA GPUs.
…
Il n'y a pour le moment aucune question fréquente sur ce produit. Si vous avez besoin d'aide ou une question, contactez notre équipe support.
This course explores how to use Numba—the just-in-time, type-specializing Python function compiler—to accelerate Python programs to run on massively parallel NVIDIA GPUs. You’ll learn how to: · Use...
This course explores how to use Numba—the just-in-time,
type-specializing Python function compiler—to accelerate Python
programs to run on massively parallel NVIDIA GPUs. You’ll learn how
to: · Use Numba to compile CUDA kernels from NumPy universal
functions (ufuncs). Use Numba to create and launch custom CUDA
kernels · Apply key GPU memory management techniques Upon
completion, you’ll be able to use Numba to compile and launch CUDA
kernels to accelerate your Python applications on NVIDIA GPUs.
Please note that once a booking has been confirmed, it is
non-refundable. This means that after you have confirmed your seat
for an event, it cannot be cancelled and no refund will be issued,
regardless of attendance.
At the conclusion of the workshop, you’ll have an understanding
of the fundamental tools and techniques for GPU-accelerated Python
applications with CUDA and Numba:
- GPU-accelerate NumPy ufuncs with a few lines of code.
- Configure code parallelization using the CUDA thread
hierarchy.
- Write custom CUDA device kernels for maximum performance and
flexibility.
- Use memory coalescing and on-device shared memory to increase
CUDA kernel bandwidth.
Introduction
- Meet the instructor.
- Create an account at https://learn.nvidia.com/join
Introduction to CUDA Python with Numba
- Begin working with the Numba compiler and CUDA programming in
Python.
- Use Numba decorators to GPU-accelerate numerical Python
functions.
- Optimize host-to-device and device-to-host memory transfers.
Custom CUDA Kernels in Python with Numba
- Learn CUDA’s parallel thread hierarchy and how to extend parallel
program possibilities.
- Launch massively par...
Fast Lane werkt met Nederlandse trainers die didactische vaardigheden combineren met veel practische ervaring.
Il n'y a pour le moment aucune question fréquente sur ce produit. Si vous avez besoin d'aide ou une question, contactez notre équipe support.

