跳转至主要内容
英特尔标志 - 返回主页
我的工具

选择您的语言

  • Bahasa Indonesia
  • Deutsch
  • English
  • Español
  • Français
  • Português
  • Tiếng Việt
  • ไทย
  • 한국어
  • 日本語
  • 简体中文
  • 繁體中文
登录 以访问受限制的内容

使用 Intel.com 搜索

您可以使用几种方式轻松搜索整个 Intel.com 网站。

  • 品牌名称: 酷睿 i9
  • 文件号: 123456
  • Code Name: Emerald Rapids
  • 特殊操作符: “Ice Lake”、Ice AND Lake、Ice OR Lake、Ice*

快速链接

您也可以尝试使用以下快速链接查看最受欢迎搜索的结果。

  • 产品信息
  • 支持
  • 驱动程序和软件

最近搜索

登录 以访问受限制的内容

高级搜索

仅搜索

Sign in to access restricted content.

不建议本网站使用您正在使用的浏览器版本。
请考虑通过单击以下链接之一升级到最新版本的浏览器。

  • Safari
  • Chrome
  • Edge
  • Firefox

Machine Learning Using oneAPI

Learn how to accelerate machine learning workloads using packages like scikit-learn*,
XGBoost, NumPy, SciPy, and pandas—all powered by oneAPI.
Realize performance gains with a few extra lines of code on the latest Intel® CPUs and GPUs.
Use stock versions or Intel® Extension for Scikit-learn*, which is part of AI Tools from Intel.

Overview

AI Tools give data scientists, AI developers, and researchers familiar Python* tools and frameworks to accelerate end-to-end data science and analytics pipelines on Intel architecture. The components are built using oneAPI libraries for low-level compute optimizations. AI Tools maximize performance from preprocessing through machine learning and provides interoperability for efficient model development.

This learning path enables you to:

  • Achieve drop-in acceleration for data preprocessing and machine learning workflows with compute-intensive Python packages, scikit-learn*, and XGBoost, optimized for Intel.
  • Gain direct access to analytics and AI optimizations from Intel to ensure that your software works together seamlessly.

Who is this for?

Data scientists, data engineers, and software developers who want to learn how to accelerate machine learning workloads.

What will I be able to do?

  • Adapt common scikit-learn algorithms to offload computation to accelerator devices like CPUs and GPUs.
  • Apply and describe how to engage XGBoost, powered by oneAPI.
  • Analyze Python code to find low-performing Python loops and list comprehensions. Replace these slow methods with faster vectorized equivalents that are more readable, more performant, and easier to adapt to new Intel innovations in libraries and hardware instruction sets.

Modules

Machine Learning Using a Notebook

Use a Jupyter Notebook to modify and run code as part of the exercises.

To begin, open the file: TeacherKit.ipynb. This is a hyperlink-driven course for the following modules.

Program Structure

  • Describe the dataset and algorithms used.
  • Use follow-me instructions in select cells in hands-on lab exercises to learn the basics.
  • Secure your knowledge with practicums without the follow-me instructions.

Intel® Extension for Scikit-learn* for CPUs

  • Describe the basics of AI Tools components and where the Intel Extension for Scikit-learn fits in the broader package.
  • Describe where to download and how to install the tools.
  • Describe the advantages of one component in AI Tools and Intel Extension for Scikit-learn that is invoked via the sklearnex library.
  • Apply the patch and unpatch functions with varying granularities to Python scripts and within Jupyter cells, from whole-file applications to more surgical patches applied to a single algorithm.
  • List the optimized scikit-learn algorithms.

Applied Patching for a CPU

  • Build a scikit-learn implementation of K-means that targets a CPU using patching.
  • Apply patching:
    • With dynamic versus lexical scope approaches
    • To the support vector classifier (SVC) algorithm
  • Describe and apply the correct surgical patching method to patch pairwise_distance.
  • Recall that Intel Extension for Scikit-learn does not optimize the Euclidean metric, but that it does optimize the metrics cosine and correlation.
  • Describe the application of pairwise_distance to the problem of finding all-time series charts that are similar to a chosen pattern.
  • Solidify your knowledge:
    • Apply code changes to try different classifiers optimized with Intel Extension for Scikit-learn.
    • Use the target decision tree for replacement by a classifier (x2) of your choice.
    • Apply patching to the Principal Components Analysis (PCA) and K-means.
    • Synthesize your learning by applying patching to the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm

Image Clustering for CPUs

  • Perform image clustering using PCA, K-means, and DBSCAN:
    • Examine the following Jupyter Notebook to identify opportunities to apply patching_sklearn() to algorithms applied to tabular data.
    • Explore and interpret the image dataset.
    • Apply Intel Extension for Scikit-learn patches to PCA, K-means, and DBSCAN algorithms.
    • Synthesize your understanding by searching for ways to patch or unpatch any applicable cell to maximize the performance of each cell.

Galaxy Classification for a CPU

  • Apply multiple classification algorithms on the CPU to determine the most accurate model for classifying the stars that belong to each galaxy within a combined super galaxy.
  • Apply an Intel Extension for Scikit-learn patch and SYCL context to compute on the CPU.
  • Synthesize your comprehension by searching for opportunities in each cell to maximize performance.
  • Investigate adding pairwise distance as a means for all the stars that are within three light years distance.

Introduction to Using a GPU with Intel Extension for Scikit-learn

This current notebook is not intended to demonstrate performance but rather the functionality of how to target current and future Intel GPUs with scikit-learn algorithms powered by oneAPI.

  • Learn how to apply patching while targeting an Intel GPU.
  • Apply Intel Extension for Scikit-learn to a Random Forest classifier on an Intel GPU.
  • Describe how to apply data parallel control (dpctl) compute follows data with patching.
  • Use the compute follows data methodology using the dpctl library from Intel to target an Intel GPU.
  • Apply dpctl and patching to a variety of scikit-learn algorithms in a simple test harness structure.

Image Clustering for a GPU

  • Explore and interpret the image dataset.
  • Apply Intel Extension for Scikit-learn patches to Principal Components Analysis (PCA), K-means, and DBSCAN algorithms.
  • Solidify your understanding by searching for ways to patch or unpatch any applicable cells to maximize the performance of each cell.
  • Apply a q.sh script to submit a job.

 

Galaxy Classification for a GPU

  • Apply multiple classification algorithms with a GPU to classify stars belonging to each galaxy within a combined super galaxy to determine the most accurate model.
  • Apply an Intel Extension for Scikit-learn patch and SYCL context to compute on an available GPU resource.
  • Synthesize your comprehension by searching for opportunities in each cell to maximize performance.
  • Investigate adding pairwise distance as a means for all the stars that are within three light years distance.

Introduction to NumPy Powered by oneAPI

  • Describe why replacing inefficient code (such as time-consuming loops) wastes resources and time.
  • Describe why using Python for highly repetitive small tasks is inefficient.
  • Describe the additive value of using packages such as NumPy, which are powered by oneAPI in a cloud.
  • Describe the importance of keeping oneAPI and a third-party package such as NumPy or SciPy.
  • Describe and apply NumPy universal functions (ufuncs), aggregations, and broadcasting.
  • Apply NumPy Where or Select clauses to conditional loops in a fast vectorized way.
  • Describe several domain areas spanned by SciPy.
  • Apply the SciPy Floyd's algorithm to accelerate an all-pairs shortest-path task.
  • Apply and compare various methods of accelerating matrix multiplication, including NumPy broadcasting, NumPy dot, NumPy matrix multiplication, and SciPy linear algebra.
  • Apply and compare various methods of accelerating pairwise distances.
  • Describe where Intel Extension for Scikit-learn applies to the subset of metrics.
  • Overview
  • Modules
  • 公司信息
  • 英特尔资本
  • 企业责任部
  • 投资者关系
  • 联系我们
  • 新闻发布室
  • 网站地图
  • 招贤纳士 (英文)
  • © 英特尔公司
  • 沪 ICP 备 18006294 号-1
  • 使用条款
  • *商标
  • Cookie
  • 隐私条款
  • 请勿分享我的个人信息 California Consumer Privacy Act (CCPA) Opt-Out Icon

英特尔技术可能需要支持的硬件、软件或服务激活。// 没有任何产品或组件能够做到绝对安全。// 您的成本和结果可能会有所不同。// 性能因用途、配置和其他因素而异。请访问 intel.cn/performanceindex 了解更多信息。// 请参阅我们的完整法律声明和免责声明。// 英特尔致力于尊重人权,并避免成为侵犯人权行为的同谋。请参阅英特尔的《全球人权原则》。英特尔产品和软件仅可用于不会导致或有助于任何国际公认的侵犯人权行为的应用。

英特尔页脚标志