Migrating and Optimizing the Convolution Separable from CUDA to SYCL

Overview

The convolution separable is a process in which a single convolution can be divided into two or more convolutions to produce the same output. The original CUDA* source code is migrated to SYCL for portability across GPUs from multiple vendors.

Area	Description
What you will learn	Migrate convolution separable sample from CUDA to SYCL
Time to complete	15 minutes
Category	Code Optimization

Key Implementation Details

This sample implements a separable convolution filter of a 2D image with an arbitrary kernel. There are two functions in the code named convolutionRowsGPU and convolutionColumnsGPU in which the kernel functions (convolutionRowsKernel & convolutionColumnsKernel) are called where the loading of the input data and computations are performed. We validate the results with reference CPU separable convolution implementation by calculating the relative L2 norm.

For more information on the convolutionSeparable SYCL migrated sample and build details on CPU and GPU, refer here.

Original CUDA source files: convolutionSeparable.

Migrated SYCL source files including step by step instructions: guided_convolutionSeparable_SYCLmigration.

选择您的语言

使用 Intel.com 搜索

快速链接

最近搜索

高级搜索

仅搜索

Migrating and optimizing the convolution separable from CUDA* to SYCL*

Overview

Key Implementation Details

References

产品和性能信息