跳转至主要内容
英特尔标志 - 返回主页
我的工具

选择您的语言

  • Bahasa Indonesia
  • Deutsch
  • English
  • Español
  • Français
  • Português
  • Tiếng Việt
  • ไทย
  • 한국어
  • 日本語
  • 简体中文
  • 繁體中文
登录 以访问受限制的内容

使用 Intel.com 搜索

您可以使用几种方式轻松搜索整个 Intel.com 网站。

  • 品牌名称: 酷睿 i9
  • 文件号: 123456
  • Code Name: Emerald Rapids
  • 特殊操作符: “Ice Lake”、Ice AND Lake、Ice OR Lake、Ice*

快速链接

您也可以尝试使用以下快速链接查看最受欢迎搜索的结果。

  • 产品信息
  • 支持
  • 驱动程序和软件

最近搜索

登录 以访问受限制的内容

高级搜索

仅搜索

Sign in to access restricted content.

不建议本网站使用您正在使用的浏览器版本。
请考虑通过单击以下链接之一升级到最新版本的浏览器。

  • Safari
  • Chrome
  • Edge
  • Firefox

Natural Language Processing

Summary

This course provides an overview of natural language processing (NLP) on modern Intel® architecture. Topics include:

  • How to manipulate text for language models
  • Text generation and topic modeling
  • The basics of machine learning through more advanced concepts

By the end of this course, students will have practical knowledge of:

  • Application of string preprocessing techniques
  • How to apply machine learning algorithms for text classification and other language tasks

The course is structured around eight weeks of lectures and exercises. Each week requires three hours to complete.

Prerequisites

Python* programming

Calculus

Linear algebra

Week 1

This class introduces the uses and history of NLP. Topics include: 

  • The history of NLP and how it is used in the industry today
  • How to parse strings using powerful regular expression tools in Python
Download
Week 2

This class teaches how to use NLP toolkits and preprocessing techniques. Topics include:

  • Explore techniques such as tokenization, stop-word removal, and punctuation manipulation
  • Implement such techniques using Python libraries such as NLTK, TextBlob, spaCy, and Gensim
Download
Week 3

This class introduces how to measure similarity between words. Learn more about:

  • Levenshtein distance, which is used to compare the similarity of two words
  • How computers encode pieces of text into a document-term matrix and what the bag of words assumption is
Download
Week 4

This class shows how machine learning is used for basic text classification. Topics include:

  • The basics of machine learning and a refresher on the terminology
  • A typical machine learning workflow for two different machine learning approaches to classify emails as either spam or not spam
Download
Week 5

This class teaches an algorithm for natural language understanding and topic modeling. Learn more about:

  • How to use the latent Dirichlet allocation algorithm to extract topics from the document-term matrices
Download
Week 6

This class continues to teach how to model and extract topics in text. Learn more about:

  • Alternative algorithms for discovering the topics embedded in texts
Download
Week 7

This week teaches machine learning algorithms for NLP. Topics include:

  • How to use a neural network to transform words into vectors
  • Potential applications of these vectors (such as text classification and information retrieval)
Download
Week 8

Continuing with the topic of machine learning, this class teaches more about applying neural networks. Topics include:

  • Text generation using Markov chains and recurrent neural networks
  • Advanced topics in NLP, such as seq2seq
Download
  • 公司信息
  • 英特尔资本
  • 企业责任部
  • 投资者关系
  • 联系我们
  • 新闻发布室
  • 网站地图
  • 招贤纳士 (英文)
  • © 英特尔公司
  • 沪 ICP 备 18006294 号-1
  • 使用条款
  • *商标
  • Cookie
  • 隐私条款
  • 请勿分享我的个人信息 California Consumer Privacy Act (CCPA) Opt-Out Icon

英特尔技术可能需要支持的硬件、软件或服务激活。// 没有任何产品或组件能够做到绝对安全。// 您的成本和结果可能会有所不同。// 性能因用途、配置和其他因素而异。请访问 intel.cn/performanceindex 了解更多信息。// 请参阅我们的完整法律声明和免责声明。// 英特尔致力于尊重人权,并避免成为侵犯人权行为的同谋。请参阅英特尔的《全球人权原则》。英特尔产品和软件仅可用于不会导致或有助于任何国际公认的侵犯人权行为的应用。

英特尔页脚标志