跳转至主要内容
英特尔标志 - 返回主页
我的工具

选择您的语言

  • Bahasa Indonesia
  • Deutsch
  • English
  • Español
  • Français
  • Português
  • Tiếng Việt
  • ไทย
  • 한국어
  • 日本語
  • 简体中文
  • 繁體中文
登录 以访问受限制的内容

使用 Intel.com 搜索

您可以通过多种方式轻松搜索整个 Intel.com 站点。

  • 品牌: 酷睿 i9
  • 文件号: 123456
  • Code Name: Emerald Rapids
  • 特殊作符: “Ice Lake”, Ice AND Lake, Ice OR Lake, Ice*

快速链接

您也可以尝试使用以下快速链接查看最受欢迎搜索的结果。

  • 产品信息
  • 支持
  • 驱动程序和软件

最近搜索

登录 以访问受限制的内容

高级搜索

仅搜索

Sign in to access restricted content.

不建议本网站使用您正在使用的浏览器版本。
请考虑通过单击以下链接之一升级到最新版本的浏览器。

  • Safari
  • Chrome
  • Edge
  • Firefox

Intel® VTune™ Profiler

Find and Fix Performance Bottlenecks Quickly and Realize All the Value of Your Hardware

    

  • Overview
  • Download
  • Documentation & Resources

Performance Analysis for Applications & Systems

Intel® VTune™ Profiler optimizes application performance, system performance, and system configuration for AI, HPC, cloud, IoT, media, storage, and more.

  • CPU, GPU, and NPU: Tune the entire application’s performance―not just the accelerated portion.
  • Multilingual: Profile SYCL*, C, C++, C#, Fortran, OpenCL™ code, Python*, Google Go* programming language, Java*, .NET, Assembly, or any combination of languages.
  • System or Application: Get coarse-grained system data for an extended period or detailed results mapped to source code.
  • Power: Optimize performance while avoiding power- and thermal-related throttling.
Download as Part of the Toolkit

Intel VTune Profiler is included in the Intel® oneAPI Base Toolkit, which is a core set of tools and libraries for developing high-performance, data-centric applications across diverse architectures.

Get It Now
Download the Stand-Alone Version

A stand-alone download of Intel VTune Profiler is available. You can download binaries from Intel or choose your preferred repository.

Download

      

Features

Algorithm Optimization

  • Locate hot spots—the most time-consuming parts of your code.
  • Visualize hot code paths and time spent in each function and with its callees with Flame Graph.

Analyze Hot Code Paths

Analyze Hot Spots

 

Microarchitecture and Memory Bottlenecks

  • Identify the most significant hardware issues that affect the performance of your application with microarchitecture exploration analysis.
  • Pinpoint memory-access-related issues such as cache misses and high-bandwidth problems.

Code-Tuning Methods for Intel CPU Microarchitecture

Profile a Memory-Bound Application

Accelerators and XPUs

  • Optimize GPU offload schema and data transfers for SYCL, OpenCL code, Microsoft DirectX*, or OpenMP* offload code. Identify the most time-consuming GPU kernels for further optimization.
  • Analyze GPU-bound code for performance bottlenecks caused by microarchitectural constraints or inefficient kernel algorithms.
  • Understand how much data is transferred between a neural processing unit (NPU) and DDR memory and identify the most time-consuming tasks running on the NPU.

Optimize Software for Intel GPUs

Profile OpenMP Offload Code on a GPU

显示更多 显示较少

Parallelism

  • Examine how efficiently the code is threaded. Identify threading issues that impact performance.
  • Evaluate compute-intensive or throughput HPC applications for efficient CPU use, vectorization, and memory use.

Method for OpenMP Code Analysis

Schedule Overhead in Intel® oneAPI Threading Building Blocks (oneTBB) Applications

Platform and I/O

  • Locate performance bottlenecks in I/O-intensive applications. Explore how effectively the hardware processes I/O traffic generated by external PCIe* devices or integrated accelerators.
  • Get a fine-grained overview for short-running workloads with System Overview.

Effective Use of Intel® Data Direct I/O Technology

Multi-Node

  • Characterize performance aspects of large-scale message passing interface (MPI) and OpenMP workloads.
  • Identify scalability issues and get recommendations for in-depth analysis.

Profile MPI Applications

显示更多 显示较少

What's New in 2025.1

  • Identify performance bottlenecks of AI workloads that are calling DirectML or Windows* Machine Learning (WinML) APIs.
  • Understand the overall accelerator performance by seeing GPU and NPU offload bottlenecks in one view.
  • Pinpoint the most time-consuming code sections and critical code paths for Python 3.12.


For a more complete and up-to-date list, see the release notes. 

Get Started

Download

Get Intel VTune Profiler as a stand-alone tool or as part of the Intel oneAPI Base Toolkit.

Get Intel VTune Profiler Only

Get the Intel oneAPI Base Toolkit

System Requirements

Try It Out

Get started with Intel VTune Profiler and use an introductory code sample to see how it works.

Get Started Guide

Learn Analysis Techniques

Use these learning tools and workflows to understand and analyze performance bottlenecks in your application.

Tutorials and Videos

Intel VTune Profiler Cookbook

Profiling GPUs

Profile Machine Learning Applications

Profile OpenVINO™ Toolkit Applications

GPU Optimization Workflow

Workflow to Offload and Optimize OpenMP Applications 

显示更多 显示较少

What Customers Are Saying

"Ensuring the best possible performance of systems for our users is a top priority for us. Intel VTune Amplifier helps us do that with effective workload management."

— Dennis O’Connell, senior director of performance engineering, Verizon*

Optimize Application Performance with Powerful Profiling

"Intel VTune Profiler is an invaluable tool for identifying hotspots when optimizing code. Its user interface is easy to use and informative, quickening the pace of development. Without access to Intel VTune's line-by-line performance counters, we would never have been able to identify the reasons why our mixed-precision code was running slower than our original double-precision code."

— Dr. Perri Needham, postdoctoral researcher, Walker Molecular Dynamic Laboratory

"We recommend using Intel® MPI for best performance and tools such as Intel VTune Profiler and Intel® Advisor to help better understand performance optimizations and how to best migrate your workloads to the cloud."

— Ilias Katsardis, HPC solution lead, Google Cloud*

"Intel VTune Profiler [helped us] to analyze code performance and further enhance it to run optimally on our products."

— Won-Chul Bang, PhD, vice president and head of product strategy, Samsung Medison*

"The Application Performance Snapshot feature of Intel VTune Profiler helped us analyze HemeLB running at 96K MPI ranks on SuperMUC-NG of the Leibniz Supercomputing Centre. It was straightforward and effective in its operation and analysis output."

— Dr. Jon McCullough, University College London

"We are always looking for new methods to accelerate workloads in our data center. Our teams used Intel VTune Profiler’s Flame Graph feature and found it intuitive to use and practical for interpreting performance data. This tool [part of the Intel oneAPI Base Toolkit] has become essential to optimizing code and workflows, and its ability to work across Intel CPUs and GPUs adds to our productivity and performance optimization efforts."

— Dr. Markus Rampp, head of HPC Applications Division and deputy director, Max Planck Computing & Data Facility

"We rely super heavily on Intel VTune Profiler and some of the other Intel products that are our primary way to understand performance at very large scale."

— Dan Stanzione, executive director, Texas Advanced Computing Center (TACC)

"Intel® Advanced Vector Extensions 512 (Intel® AVX-512) and Vector Neural Network Instructions (VNNI) acceleration techniques and advanced debugging and profiling capabilities of Intel VTune Profiler helped Netflix* optimize and boost performance in a variety of use cases such as video encoding, microservices latency and throughput improvements, and accelerating machine learning inference tasks."
– Amer Ather, senior cloud architect, Netflix

显示更多 显示较少

Case Studies

Specifications

Processor:
  • Intel® Xeon® processor family (based on formerly code named Ice Lake)
  • 3rd generation Intel® Xeon® Scalable processor family (or later)
  • 10th generation Intel® Core™ processor (or later)
GPUs:
  • Intel® UHD Graphics for 11th generation Intel processors or newer
  • Intel® Iris® Xe graphics
  • Intel® Arc™ graphics
  • Intel® Server GPU
  • Intel® Data Center GPU Flex Series
  • Intel® Data Center GPU Max Series
Languages:
  • SYCL
  • C and C++
  • C#
  • Fortran
  • OpenCL code
  • Google Go programming language
  • Java
  • Python
  • .NET
Development environments:
  • Windows*: Microsoft Visual Studio*, Visual Studio Code
  • Linux*: Eclipse*
  • Virtual machine support: Kernel-based virtual machine (KVM), Hyper-V*, VMware*
  • Container support: Docker*, Singularity*, LXC, Apache Mesos*
  • Interface: Desktop or web GUI, command line

For more information, see the system requirements.

Host operating systems:
  • Windows
  • Linux
Target operating systems:
  • Windows
  • Linux
  • FreeBSD*
Compilers:
  • Intel® compilers
  • Microsoft* compilers
  • GNU Compiler Collection (GCC)*
Threading analysis:
  • OpenMP
  • Intel® oneAPI Threading Building Blocks
  • Native threads
Distributed environments:
  • MPI (MPICH-based, OpenMPI)

Get Help

Your success is our success. Access these support resources when you need assistance.

  • Intel VTune Profiler Forum
  • General oneAPI Support

Related Tools

Intel® Advisor

This design and analysis tool achieves high application performance through efficient threading, vectorization, memory use, and GPU offload on current and future Intel hardware. It supports C, C++, Fortran, DPC++, OpenMP, and Python.

  • Offload Advisor: Get your code ready for efficient GPU offload even before you have the hardware
  • Automated Roofline Analysis: See performance headroom against hardware limitations and get insights for an effective optimization roadmap
  • Vectorization Advisor: Enable more vector parallelism and get guidance to improve its efficiency
  • Threading Advisor: Model, tune, and test threading design options
  • 公司信息
  • 英特尔资本
  • 企业责任部
  • 投资者关系
  • 联系我们
  • 新闻发布室
  • 网站地图
  • 招贤纳士 (英文)
  • © 英特尔公司
  • 沪 ICP 备 18006294 号-1
  • 使用条款
  • *商标
  • Cookie
  • 隐私条款
  • 请勿分享我的个人信息 California Consumer Privacy Act (CCPA) Opt-Out Icon

英特尔技术可能需要支持的硬件、软件或服务激活。// 没有任何产品或组件能够做到绝对安全。// 您的成本和结果可能会有所不同。// 性能因用途、配置和其他因素而异。请访问 intel.cn/performanceindex 了解更多信息。// 请参阅我们的完整法律声明和免责声明。// 英特尔致力于尊重人权,并避免成为侵犯人权行为的同谋。请参阅英特尔的《全球人权原则》。英特尔产品和软件仅可用于不会导致或有助于任何国际公认的侵犯人权行为的应用。

英特尔页脚标志