跳转至主要内容
英特尔标志 - 返回主页
我的工具

选择您的语言

  • Bahasa Indonesia
  • Deutsch
  • English
  • Español
  • Français
  • Português
  • Tiếng Việt
  • ไทย
  • 한국어
  • 日本語
  • 简体中文
  • 繁體中文
登录 以访问受限制的内容

使用 Intel.com 搜索

您可以通过多种方式轻松搜索整个 Intel.com 站点。

  • 品牌: 酷睿 i9
  • 文件号: 123456
  • Code Name: Emerald Rapids
  • 特殊作符: “Ice Lake”, Ice AND Lake, Ice OR Lake, Ice*

快速链接

您也可以尝试使用以下快速链接查看最受欢迎搜索的结果。

  • 产品信息
  • 支持
  • 驱动程序和软件

最近搜索

登录 以访问受限制的内容

高级搜索

仅搜索

Sign in to access restricted content.

不建议本网站使用您正在使用的浏览器版本。
请考虑通过单击以下链接之一升级到最新版本的浏览器。

  • Safari
  • Chrome
  • Edge
  • Firefox

Profile Heterogeneous Computing Performance with Intel® VTune™ Profiler

Profile Heterogeneous Computing Performance with Intel® VTune™ Profiler

@IntelDevTools

Subscribe Now

Stay in the know on all things CODE. Updates are delivered to your inbox.

Sign Up

Overview

Programming of heterogeneous platforms requires a deep understanding of system architecture on all levels, which helps application design to take advantage of the best data and work decomposition between CPUs and accelerating hardware like GPUs. However, in many cases the applications are being converted from a conventional CPU programming language (like C++) or from an accelerator-friendly but still low-level language (like OpenCL™ code). The main problem is to determine which part of the application benefits from being offloaded to a GPU. Another problem is to estimate how much performance increase one might gain due to the acceleration in the particular GPU device. Each platform has its unique limitations that affect the performance of offloaded computing tasks, for example: data transfer tax, task initialization overhead, memory latency, and bandwidth limitations. To take into account these constraints, software developers need tools to collect the right information and produce recommendations to make the best design and optimization decisions.

This presentation introduces two new GPU performance analysis types in Intel® VTune™ Profiler, and a methodology of heterogeneous applications performance profiling supported by the analyses. Intel VTune Profiler is an established tool for performance characterization on CPUs. It includes GPU offload analysis and GPU hot spot analysis of applications, written on most offloading models with OpenCL code, SYCL* (Data Parallel C++), and OpenMP* Offload.

 

Vladimir Tsmbal

Senior technical consulting engineer, Intel Corporation

Vladimir specializes in teaching customers how to use various Intel® Software Development Tools to develop, tune, and optimize their parallel applications on Intel® architecture. In particular, his focus is on the Intel® Parallel Studio XE product suite and the analysis tools it contains, including Intel VTune Profiler (which he helped develop), Intel® Advisor, and Intel® Inspector.

Prior to joining Intel in 2005, Vladimir worked as a research assistant, and developed hardware graphics accelerators and software and hardware systems for medical diagnostics. He holds a PhD in mathematics and computer science from Taganrog State University of Radio Engineering, Russia.

Jump to:

You May Also Like
 

Intel® VTune™ Profiler

Find and fix performance bottlenecks and optimize application and system performance and system configuration for HPC, cloud, IoT, media, storage, and more.

 

Get It Now

 

See All Tools

 

   

You May Also Like

Related Article

Optimize LLVM* Code Generation for Data Analytics Using Vectorization

  • 公司信息
  • 英特尔资本
  • 企业责任部
  • 投资者关系
  • 联系我们
  • 新闻发布室
  • 网站地图
  • 招贤纳士 (英文)
  • © 英特尔公司
  • 沪 ICP 备 18006294 号-1
  • 使用条款
  • *商标
  • Cookie
  • 隐私条款
  • 请勿分享我的个人信息 California Consumer Privacy Act (CCPA) Opt-Out Icon

英特尔技术可能需要支持的硬件、软件或服务激活。// 没有任何产品或组件能够做到绝对安全。// 您的成本和结果可能会有所不同。// 性能因用途、配置和其他因素而异。请访问 intel.cn/performanceindex 了解更多信息。// 请参阅我们的完整法律声明和免责声明。// 英特尔致力于尊重人权,并避免成为侵犯人权行为的同谋。请参阅英特尔的《全球人权原则》。英特尔产品和软件仅可用于不会导致或有助于任何国际公认的侵犯人权行为的应用。

英特尔页脚标志