Alibaba Cloud: POLARDB*, A Cloud-Hosted Database

Software-defined storage powered by Intel® Optane™ SSDs for data centers improves performance and reduces latency.

Relational database management systems (RDBMSs) are the foundation of enterprises worldwide. They are also a prerequisite for company operations and digital transformation.

An on-premises RDBMS, though, can require significant investment in infrastructure, time, cost, and personnel to keep modernized, scale to meet business growth, and ensure it’s capable of supporting digital transformation. One solution to these obstacles is cloud computing, which has evolved into a mainstream technology used by organizations of all sizes.

Relational databases are widely available as cloud-hosted services, known as database-as-a-service (DBaaS) solutions. But a cloud-based RDBMS can present its own challenges, which exposes a need for cloud-hosted relational database solutions to digitally transform. Popular databases include MySQL* and Oracle Database*; but when hosted on-premises or managed as a service in the cloud, they can be difficult to secure, manage, and scale, and they might not deliver the needed performance or quality of service (QoS).

Alibaba Cloud has developed an innovative, commercial-grade, cloud-hosted relational database, POLARDB*, that addresses these issues and offers enterprises a new DBaaS model. At the foundation of POLARDB is a distributed shared-storage architecture, which was engineered to use Intel® Optane™ Solid State Drives (SSDs) and Intel® 3D NAND SSDs.

Traditional Challenges of Cloud-Hosted Databases
In a public cloud environment, growth in the number of users, user services, and data volumes can result in issues with backups, performance, migration, upgrades, disk capacity, and delays caused by the binary log. The time needed for scaling, backing up, and migrating data increases as the data volume grows. It can take hours, or even days, to back up terabytes of data. Upgrades, scaling, backing up, and data migration can require that a database be taken offline and restarted, which can impact the business, its users, and its IT personnel. Migrating data can slow the performance of mission-critical applications. Maintaining everyday business operations can require significant CPU resources, which can cause delays and reduce price/performance when compounded over millions of transactions.

Intel Optane SSDs power a 30% improvement of queries per second (QPS), for an improved customer experience with POLARDB1; 76% improvement on 95th-percentile latency with POLARDB1; and 100 TB Capacity for each POLARDB database instance.

POLARDB Redefines Hosting a Database in the Cloud
POLARDB innovates on the existing DBaaS model. It uses an innovative software-defined scale-out system that incorporates fast, low latency Intel Optane SSDs and low-cost, high-capacity Intel 3D NAND SSDs. Enterprises with large, growing volumes of data—and that require consistent low latency, high throughput, and a high QoS, combined with a good price/performance ratio—can benefit from POLARDB.

The storage paradigm of POLARDB, combined with other innovations, delivers the performance and availability of a commercial-grade database in a DBaaS model. Alibaba Cloud engineered POLARDB with a focus on business customers’ on-cloud costs, online transaction processing (OLTP) performance, business continuity, business growth, and security. All of the hardware and software are engineered to work effectively together, including with the underlying Intel hardware, in a cloud-native design developed for high reliability and high availability. The architecture provides high performance and requires roughly one-tenth of the cost of a traditional on-premises relational database.2

The architecture of POLARDB enables it to deliver up to six times the throughput of standard MySQL 5.6 and 5.7 when MySQL is run on NAND-based Peripheral Component Interconnect Express* (PCIe*) and NVM Express* (NVMe*) SSDs. And POLARDB delivers throughput of up to one million input/output operations per second (IOPS) with latency of just milliseconds.2 POLARDB can also scale up/in or out/down on demand in less than 10 minutes. It supports a capacity of up to 100 TB per database instance compared to just 64 TB for similar solutions. Storage capacity and clones can automatically scale. And POLARDB is 100-percent backward compatible with MySQL 5.6 and 5.7.

Intel Optane SSDs Optimize Storage Performance
The POLARDB distributed storage design with Intel Optane SSDs and Intel 3D NAND SSDs provides excellent storage efficiency, high QoS, high IOPS, high throughput, and consistent performance. Intel® SSDs also help keep costs and performance tuned. Intel Optane SSDs help Alibaba Cloud eliminate data center storage bottlenecks, accommodate bigger, more affordable datasets, accelerate applications, reduce transaction costs for latency-sensitive workloads, and improve the overall data center total cost of ownership (TCO) for POLARDB.

By combining high-speed, high-capacity Intel Optane SSDs with high-throughput Intel 3D NAND SSDs at the software layer, POLARDB implements a hybrid storage layer that can deliver consistent low latency, high throughput, high QoS, and a lower overall price/performance ratio compared to using only Intel 3D NAND SSDs for the storage layer.

With Intel Optane SSDs and Intel 3D NAND SSDs, Alibaba Cloud has realized:

  • A 30-percent improvement on queries per second (QPS), for an improved customer experience with POLARDB1
  • A 76-percent improvement on 95th-percentile latency with POLARDB1

The POLARDB Architecture
The software-defined POLARDB architecture uses distributed storage, the Storage Performance Development Kit (SPDK), remote direct memory access (RDMA), and other innovations to create a high performance, scalable, cloud-based database. The POLARDB architecture separates the compute and storage pools. When CPU resources and memory are insufficient, the computing resource pool is expanded separately from the storage pool; and when the capacity or IOPS is low, the storage resource pool scales independent of the compute pool.

The POLARDB distributed storage architecture utilizes three replicas or nodes instead of a local store. One replica is used for read and write, and the other two for read-only. When a new read-only instance is needed, no data needs to be copied; and no matter how large the new data volume is, it can be completed within five minutes compared to the 70 hours it can take to add an instance for a traditional database with more than 3 TB of data.1 The primary instance and the read-only instances share the same storage resources, which increases read-only copy performance at no additional cost for new storage. Users pay only for CPU and memory consumed by the read-only instances. POLARDB uses this distributed storage configuration to locate data and scale faster than a traditional database.

Figure 1. The Alibaba Cloud POLARDB* architecture with Intel® SSDs.

Each POLARDB storage node uses one Intel® Optane™ SSD DC P4800X Series drive for fast journaling and indexing and to support Intel SSD DC P4500 Series drives, which are used for data storage. The SPDK writes data to each Intel 3D NAND SSDs’ NVMe interface without having to access synchronized memory. Before implementing Intel Optane SSDs, journaling and data were collocated on NVMe-based Intel SSDs for the data center, which required accessing synchronized memory. Bypassing synchronized memory allows POLARDB to increase performance and reduce latency.

Further performance benefits with POLARDB result from:

  • Using the RDMA over Converged Ethernet (RoCE) network protocol to lower latency, reduce the CPU load, and increase bandwidth
  • Bypassing the Linux* kernel with a software-defined stack that enables POLARDB to run in user mode, which helps decrease overhead
  • Removing the need for the binary log, which reduces transaction times and input/output (I/O) overhead

Access a Commercial-Grade Database as a Service
A DBaaS implementation can free an enterprise IT team from managing databases in-house and from buying and maintaining the hardware that an on-premises database requires. The IT team can then focus on innovating the core business needs that help the enterprise digitally transform and improve its competitive opportunities. Where other DBaaS solutions can present challenges, POLARDB from Alibaba Cloud can help eliminate those challenges with a commercial-grade DBaaS solution. POLARDB empowers enterprises with an on-demand solution that can help maintain a good price/performance ratio. As a commercial-grade database, it helps enterprises manage large volumes of data while enabling low latency, high throughput, high QoS, and rapid scaling.

Intel Optane SSDs and Intel® 3D NAND SSDs for POLARDB
POLARDB was originally developed to meet the needs of Alibaba Cloud’s online shopping center. Alibaba Cloud initially made use of Intel Optane SSDs and Intel 3D NAND SSDs for POLARDB for shopping center peak usage. Alibaba Cloud then realized a performance increase of up to six times for MySQL compared to running MySQL only on NAND-based PCIe NVMe SSDs.1

Today, by using the Intel Optane SSD DC P4800X Series for the POLARDB offering, Alibaba Cloud makes use of SSDs that:

  • Are the most responsive data center SSDs available with Intel® Optane™ technology3
  • Offer a capacity of up to 750 GB
  • Have enabled increased performance for POLARDB

Using Intel 3D NAND SSDs, in addition to Intel Optane SSDs, brings economic data storage with outstanding quality, reliability, and advanced manageability and serviceability to minimize disruptions for POLARDB.

By working closely with customers like Alibaba Cloud to innovate databases, Intel is helping improve the enterprise database experience for customers worldwide.

About Alibaba Cloud
Alibaba Cloud is the largest e-commerce business in China, and it is a rapidly growing cloud service provider. Alibaba Cloud delivers a range of global cloud computing services that power online businesses worldwide. The company also manages and maintains the e-commerce ecosystem for its parent company, Alibaba Group.

Alibaba Cloud’s global operations are registered and headquartered in Singapore. It also maintains offices in Dubai, Frankfurt, Hong Kong, London, New York, Paris, San Mateo, Seoul, Singapore, Sydney, and Tokyo.

探索相关产品和解决方案

Intel® Xeon® Scalable Processors

Drive actionable insight, count on hardware-based security, and deploy dynamic service delivery with Intel® Xeon® Scalable processors.

Learn more

Intel® Optane™ SSDs

Intel® Optane™ technology is the first major memory and storage breakthrough in 25 years.

Learn more

Intel® SSD DC Series

Intel® SSDs for the data center are optimized for performance, reliability, and endurance.

Learn more

通知和免责声明

英特尔® 技术的特性和优势取决于系统配置,并可能需要支持的硬件、软件或服务激活。实际性能可能因系统配置的不同而有所差异。没有任何计算机系统能够保证绝对安全。请咨询您的系统制造商或零售商,也可登录 www.intel.cn 获取更多信息。// 性能测试中使用的软件和工作负载仅在英特尔® 微处理器上针对性能进行了优化。SYSmark 和 MobileMark 等性能测试使用特定的计算机系统、组件、软件、操作和功能进行测量。上述任何要素的变动都有可能导致测试结果的变化。您应该查询其他信息和性能测试,以帮助您对正在考虑购买的产品作出全面的评估,包括该产品在与其他产品结合使用时的性能表现。如欲了解更多完整信息,请访问 www.intel.cn/benchmarks。// 性能结果基于配置中所规定日期的测试,可能无法反映所有公开的安全更新。有关详细信息,请参见配置信息披露。没有任何产品或组件能保证绝对安全。// 所描述的成本降低方案仅用作示例,表明某些基于英特尔® 的产品在特定环境和配置下会如何影响未来的成本,并节约成本。环境各不相同。英特尔不保证任何成本和成本的节约。// 英特尔并不控制或审核本文档引用的第三方基准资料或网站。您应访问引用的网站,确认参考资料准确无误。// 在某些测试案例中,结果以英特尔内部分析或架构模拟或建模为基础来评测或模拟,且仅供参考。您的系统硬件、软件或配置的任何不同均可能会影响实际性能。

产品和性能信息

1Testing completed by Alibaba Cloud on two 2-socket CPUs with 32 cores each, 256 GB DRAM, and dual 25 gigabit Ethernet (GbE) network interface controllers (NICs). 1 x Alibaba Cloud Elastic Compute Service* (ECS*), 1 x POLARDB* (1 x database and 3 x PolarStore* replica). Benchmark: Amazon Aurora* (sysbench*), thread = 1, iodepth = 1, 100 percent write, 4 TB database, request blocksize = 16 KB.
2Alibaba Cloud. “100TB Capacity and 6x Performance Improvement with Alibaba Cloud PolarDB.” April 2018. alibabacloud.com/blog/100tb-capacity-and-6x-performance-improvement-with-alibaba-cloud-polardb_578294.
3Responsiveness defined as average read latency measured at queue depth 1 during a 4K random write workload. Measured using FIO 2.15*. Common configuration: 2 x Intel® Xeon® processor E5-2699 v4 (2.20 GHz, 22 cores), CentOS 7.2*, kernel 3.10.0-327.el7.x86_64, 396 GB DDR RAM at 2,133 MHz. Intel® drives evaluated: 375 GB Intel® Optane™ SSD DC P4800X Series and 1,600 GB Intel® SSD DC P3700 Series. Samsung* drives evaluated: Samsung SSD PM1725a*, Samsung SSD PM1725*, Samsung SSD PM963*, Samsung SSD PM953*. Micron* drive evaluated: Micron 9100 PCIe NVMe SSD*. Toshiba* drive evaluated: Toshiba ZD6300*. Test: queue depth 1, random read, 4K latency; queue depth 1, random read/write, 4K 70-percent read latency; queue depth 1, random write, 4K latency using FIO 2.15.