Home > Articles

Supporting Multiple Page Sizes in the Solaris Operating System

  • PrintPrint
  • Share ThisShare This
  • DiscussDiscuss
Close Window

Sun Microsystems

Learn more…

IPsec -- A Secure Deployment Option
Sep 24, 2004
Using pGINA to Authenticate Users in Microsoft Windows Environments
Aug 27, 2004
Best Practices for Deploying the Sun StorADE Utility
Aug 20, 2004
Performing Network Solaris Installations Without a Local Boot Server
Aug 13, 2004
Using Solaris Resource Manager With Sun Ray
Aug 6, 2004
N1 Grid Architecture Realized: Strategic Flexibility
Jul 16, 2004
Global Grid Connectivity Using Globus Toolkit With Solaris Operating System
Jun 25, 2004
Building a Bootable DVD to Deploy a Solaris Flash Archive
Jun 18, 2004
Building OpenSSH--Tools and Tradeoffs, Updated for OpenSSH 3.7.1p2
Jun 18, 2004
Maximizing the Performance a Gigabit Ethernet NIC Interface
Jun 18, 2004
Dynamic Reconfiguration for High-End Servers: Part 2--Implementation Phase
Jun 11, 2004
Supporting Multiple Page Sizes in the Solaris Operating System Appendix
Jun 11, 2004
Dynamic Reconfiguration for High-End Servers: Part 1 --- Planning Phase
Jun 4, 2004
Supporting Multiple Page Sizes in the Solaris Operating System
Jun 4, 2004
Data Center Best Practices for High-End Servers
May 28, 2004
Understanding Tuning TCP
May 28, 2004
Sun Ray Deployment On Shared Networks
Apr 30, 2004
LDAP Triggers: A Framework for Sun Java System Directory Server
Apr 23, 2004
Taming Your Emu to Improve Application Performance
Apr 23, 2004
Best Practices for Deploying the Sun StorADE Utility
Apr 16, 2004
Sun Fire 15K/12K Auto Diagnosis and Recovery
Apr 16, 2004
Dynamic Reconfiguration and Oracle 9i Dynamically Resizeable SGA
Apr 9, 2004
Solaris Operating System Availability Features
Apr 2, 2004
Design, Features, and Applicability of Solaris File Systems
Mar 26, 2004
Securing the Sun Fire 12K/15K System Controller
Mar 19, 2004
Securing the Sun Fire 12K/15K Domains
Mar 12, 2004
Enterprise Network Design Patterns: High Availability
Feb 20, 2004
Performance Forensics
Feb 13, 2004
Migrating to the Solaris Operating System: Migrating From Tru64 UNIX
Feb 6, 2004
Tuning ORACLE to Minimize Recovery Time: For Solaris Operating System on SPARC
Feb 6, 2004
Securing Linux Systems With Host-Based Firewalls Implemented With Linux iptables
Jan 30, 2004
Securing Web Applications through a Secure Reverse Proxy
Jan 30, 2004
Hardware Replication Challenges
Jan 23, 2004
Solaris Volume Manager Performance Best Practices
Jan 23, 2004
Sun Fire 6800/4810/4800/3800 Systems Auto Diagnosis and Recovery Enhancements
Jan 16, 2004
Responding to a Customer's Security Incidents, Part 4: Processing Incident Data
Jan 9, 2004
Desktop Architecture Selection Guide
Dec 31, 2003
Sun ONE Portal Server 6 Best Practices
Dec 23, 2003
Migrating to the Solaris Operating System: Migration Strategies
Oct 31, 2003
Responding to Customer's Security Incidents--Part 3: Following Up After an Incident
Oct 31, 2003
Minimizing Domains for Sun Fire V1280, 6800, 12K, and 15K Systems, Part II
Oct 24, 2003
Using the LDAP to NIS+ Gateway
Oct 24, 2003
Deploying the Solaris Operating Environment Using a Solaris Security Toolkit CD
Oct 17, 2003
Minimizing Domains for Sun Fire V1280, 6800, 12K, and 15K Systems, Part I
Oct 17, 2003
Building Secure Sun Fire Link Interconnect Networks Using Sun Fire 15K and Sun Fire 12K Servers
Sep 26, 2003
Linux Overview for Solaris Users
Sep 26, 2003
Securing Sun Linux Systems: Part II, Network Security
Sep 26, 2003
Sun Fire V1280/Netra 1280 Server Considerations for Improving RAS
Sep 26, 2003
Sun ONE Portal Server and Lotus iNotes Integration Recipe
Sep 26, 2003
Transition Guide--Upgrading From the iPlanet Directory Server 5.1 Software to the Sun ONE Directory Server 5.2 Software
Sep 26, 2003
Capacity Planning as a Performance Tuning Tool—Case Study for a Very Large Database Environment
Sep 19, 2003
Securing Sun Linux Systems: Part I, Local Access and File Systems
Sep 19, 2003
Sun Fire 15K/12K Server Preferred Practices
Sep 19, 2003
Sun Grid Engine, Enterprise Edition—Configuration Use Cases and Guidelines
Sep 19, 2003
The IT Utility Model—Part I
Sep 19, 2003
Using filesync for Disaster Recovery, Business Continuance, and Mobility
Sep 19, 2003
Role Based Access Control and Secure Shell—A Closer Look At Two Solaris Operating Environment Security Features
Sep 12, 2003
Solaris Operating Environment Network Settings for Security: Updated for Solaris 9 Operating Environment
Sep 12, 2003
Using NTP on the Sun Fire 15K/12K Server
Sep 12, 2003
Consolidation Methodology
Sep 5, 2003
Using the Sun ONE Application Server 7 to Enable Collaborative B2B Transactions
Sep 5, 2003
An Architecture for Creating and Managing Integrated Software Stacks
Aug 29, 2003
Auditing System Security
Aug 29, 2003
Integrating the Secure Shell Software
Aug 29, 2003
Sun Cluster 3.0 Series: Guide to Installation—Part 2
Aug 29, 2003
Sun ONE Portal Server and Microsoft Exchange Integration Cookbook
Aug 29, 2003
Building a Global Compute Grid - Two Examples Using the Sun ONE Grid Engine and the Globus Toolkit
Aug 22, 2003
Configuring the Secure Shell Software
Aug 22, 2003
Responding to Customer's Security Incidents—Part 2: Executing a Policy
Aug 22, 2003
Sun Cluster 3.0 Series: Guide to Installation—Part 1
Aug 22, 2003
Sun Fire 6800/4810/4800/3800 Auto Diagnosis and Recovey Features
Aug 22, 2003
Provisioning in Replicated, Mission-Critical Environments
Aug 15, 2003
Responding to Customer's Security Incidents, Part 1: Establishing Teams and a Policy
Aug 15, 2003
Securing the Sun Fire 12K and 15K System Controllers
Aug 15, 2003
Writing an Authentication Plug-in for a Sun ONE Directory Server
Aug 15, 2003
Securing the Sun Cluster 3.x Software
Aug 8, 2003
Securing the Sun Fire 12K and 15K Domains
Aug 8, 2003
Understanding Gigabit Ethernet Performance on Sun Fire Servers
Aug 8, 2003
Using Midframe Servers to Build Secure Sun Fire Link Interconnect Networks
Aug 8, 2003
BluePrint for Benchmarking Success
Aug 1, 2003
System Management Services Software: An Inside Look
Aug 1, 2003
A Patch Management Strategy for the Solaris Operating Environment
May 23, 2003
Building OpenSSH—Tools and Tradeoffs
May 23, 2003
Configuring Databases Using Soft Links
May 23, 2003
Managing Shared Storage in a Sun Cluster 3.0 Environment With Solaris Volume Manager Software
May 23, 2003
Modeling Sun Cluster Availability
May 23, 2003
Performance Oriented System Administration For Solaris
May 23, 2003
A Strategy for Managing Performance
Apr 18, 2003
Solaris Operating Environment Security: Updated for Solaris 9 Operating Environment
Apr 18, 2003
Trust Modeling for Security Architecture Development
Apr 18, 2003
Understanding Solaris 9 Operating Environment Directory Services
Apr 18, 2003
A New Open Resource Management Architecture in the Sun HPC ClusterTools Environment
Feb 21, 2003
Campus Clusters Based on Sun Cluster Software
Feb 14, 2003
Memory Hierarchy in Cache-Based Systems
Feb 14, 2003
Designing Highly Available Architectures: A Methodology
Feb 7, 2003
Internet Protocol Network Multipathing (Update)
Feb 7, 2003
Minimizing the Solaris Operating Environment for Security: Updated for Solaris 9 Operating Environment
Feb 7, 2003
Configuring Boot Disks With Solaris Volume Manager Software
Jan 24, 2003
Managing Data Centers With Sun Management Center Change Manager
Jan 24, 2003
SQL*Net Performance Tuning Using Underlying Network Protocols
Jan 24, 2003
Extending Authentication in the Solaris 9 Operating Environment Using Pluggable Authentication Modules (PAM): Part II
Jan 17, 2003
HPC Administration Tips and Techniques
Jan 17, 2003
Sun Fire Midframe Server Best Practices for Firmware Update 5.13.x
Jan 17, 2003
Extending Authentication in the Solaris 9 Operating Environment Using Pluggable Authentication Modules: Part I
Dec 27, 2002
Sun Fire Systems Design and Configuration Guide
Dec 27, 2002
Consolidation in the Data Center
Dec 20, 2002
Enterprise Network Design Patterns: High Availability
Dec 20, 2002
Introduction to the Solaris Cluster Grid - Part 2
Dec 20, 2002
Introduction to the Sun Cluster Grid, Part 1
Sep 26, 2002
Sun's Quality, Engineering, and Deployment (QED) Test Train Model
Sep 26, 2002
Customizing JumpStart Framework for Installation and Recovery
Sep 20, 2002
Sun StorEdge Instant Image 3.0 and Oracle8i Database Best Practices
Sep 20, 2002
Windows NT Server Consolidation and Performance Improvements with Solaris PC NetLink 2.0 Software
Sep 20, 2002
Sun ONE Portal Server 3.0 Rewriter Configuration and Management Guide
Sep 13, 2002
Securing the Sun Fire 12K and 15K Domains, Updated for SMS 1.2
Sep 6, 2002
Securing the Sun Fire 12K and 15K System Controllers, Updated for SMS 1.2
Sep 6, 2002
An Information Technology Management Reference Architecture Implementation
Aug 30, 2002
Reducing the Backup Window With Sun StorEdge Instant Image Software
Aug 30, 2002
An Information Technology Management Reference Architecture
Aug 16, 2002
Drill-Down Monitoring of Database Servers
Aug 16, 2002
LAN-Free Backups Using the Sun StorEdge Instant Image 3.0 Software
Aug 16, 2002
Network Storage Evaluations Using Reliability Calculations
Aug 16, 2002
Securing LDAP Through TLS/SSL: A Cookbook
Aug 16, 2002
Securing the Sun Fire Midframe System Controller
Aug 16, 2002
Deployment Considerations for Data Center Management Tools
Aug 9, 2002
Guide to Installation-Part II: Sun Cluster 3.0 Software Management Services
Aug 9, 2002
How Hackers Do It: Tricks, Tools, and Techniques
Aug 9, 2002
Metropolitan Area Sun Ray Services
Aug 9, 2002
Securing the Sun Cluster 3.0 Software
Aug 9, 2002
Guide to Installation, Part I: Sun Cluster Management Services
May 24, 2002
Service Level Agreement in the Solaris OE Data Center
May 24, 2002
Solaris OE Enterprise Management Systems Part I: Architectures and Standards
May 24, 2002
Solaris OE Storage Resource Management: A Practitioner's Approach
May 24, 2002
Sun Fire 3800-6800 Servers Dynamic Reconfiguration
May 24, 2002
Using Live Upgrade 2.0 With JumpStart Technology and Web Start Flash
May 24, 2002
Enterprise Quality of Service Part II: Enterprise Solution using Solaris Bandwidth Manager 1.6 Software
May 17, 2002
Introduction to SunTone Clustered Database Platforms
May 17, 2002
Securing the Sun Enterprise 10000 System Service Processors
May 17, 2002
Service Level Management in the Data Center
May 17, 2002
Solaris Application Performance Optimization
May 17, 2002
Using Live Upgrade 2.0 With a Logical Volume Manager
May 17, 2002
Establishing a Solaris OE Architectural Model
Apr 5, 2002
Configuring OpenSSH for the Solaris Operating Environment
Mar 22, 2002
Data Center Design Philosophy
Mar 22, 2002
Enterprise Quality of Service (QoS): Part I - Internals
Mar 22, 2002
Issues in Selecting a Job Management System
Mar 22, 2002
Managing Solaris Operating Environment Upgrades With Live Upgrade 2.0
Mar 22, 2002
Securing Sun Fire 15K Domains
Mar 22, 2002
Server Virtualization Using Trusted Solaris 8 Operating Environment
Mar 22, 2002
Sun Cluster 3.0 Implementation Guide: Hardware Setup
Mar 22, 2002

Sorry, this author hasn't posted any blogs.

The Solaris 9 Operating System contains a feature to enable the use of larger memory page sizes for the heap and stack segments of a program. The use of larger page sizes is often able to deliver significant performance gain for a large range of applications. This article explains how to engage the MPSS feature and how to analyze the performance effect. This article requires an intermediate to advanced level reader.
Editor's Note: This article has an Appendix containing additional data.

The availability of both processor and operating system (OS) support for 64-bit address spaces has enabled applications to take a quantum leap in the size and efficiency with which they manipulate data. UltraSPARC® processor-based servers from Sun Microsystems paired with the SolarisTM Operating System (Solaris OS) have enabled applications that were once limited to a virtual memory size of 4 gigabytes to a virtually unlimited span. This dramatic change in the size of virtual memory has changed the way developers create their applications and how organizations use them. Database management systems can have larger tables than ever before and often, they can fit entirely in main memory, significantly reducing their I/O requirements. Large-scale simulations can run with fewer restrictions on array sizes. In fact, working set sizes of several gigabytes are now common.

The performance of a memory-intensive application is dependent on the performance of the underlying memory management infrastructure. Time spent converting virtual addresses into physical addresses will slow down an application, often in a manner that is not evident to standard performance tools. In many cases, there is an opportunity to increase the performance of the underlying memory management infrastructure, resulting in higher application performance.

We can often increase an application's performance by increasing the memory management page size. The memory management unit (MMU) in Sun's UltraSPARC processors typically has just a few hundred entries, each of which can translate 8 kilobytes of address space by default, resulting in access to only a few megabytes of memory before performance is affected. Fortunately, recent improvements in the Solaris OS allow these limitations to be overcome.

Beginning with the Solaris 9 OS, multiple page sizes can be supported on UltraSPARC processors so administrators can optimize performance by changing the page size on behalf of an application. Typical performance measurement tools do not provide sufficient detail for evaluating the impact of page size and do not provide the needed support to make optimal page size choices.

This article explains how to use new tools to determine the potential performance gain. In addition, it explains how to configure larger page sizes using the multiple page size support (MPSS) feature of the Solaris 9 OS. The article addresses the following topics:

  • "Understanding Why Virtual-to-Physical Address Translation Affects Performance"

  • "Working With Multiple Page Sizes in the Solaris OS"

  • "Configuring for Multiple Page Sizes"

Understanding Why Virtual-to-Physical Address Translation Affects Performance

The faster the microprocessor converts virtual addresses into physical addresses, the faster the application can run. Ideally, the MMU converts virtual addresses into physical addresses quickly enough (every microprocessor cycle) that the microprocessor won't stall and wait. Under certain circumstances, however, the translation process can take considerably longer. In fact, this can typically take from tens to hundreds of cycles.

We can often minimize the time taken to translate virtual addresses for applications with large working sets by increasing the page size used by the MMU. However, some applications that work well with small pages might see performance degradation when they are forced to use large pages. For example, processes that are short-lived, have small working sets, or have memory access patterns with poor spatial locality could suffer from the overhead of using large pages. Additionally, copy-on-write (COW) faults for a large page require a significant amount of time to process, as does first-touch access, which involves allocating and zeroing the entire page. For these reasons, we must analyze the application to determine whether the use of large pages is beneficial. Methods for determining when applications will benefit from large page sizes are presented later in this article.

Solaris OS Address Translation

Memory is abstracted so that applications only need to deal with virtual addresses and virtual memory. Behind the scenes, the OS and hardware, in a carefully choreographed dance, transparently translate the application's virtual addresses into the physical addresses for use by the hardware memory system.

The task of translating a virtual address into a physical address is accomplished by software and hardware modules. The software translates the mappings within an address space of a process (or the kernel) into hardware commands to program the microprocessor's MMU. The hardware then translates requests for virtual memory from the running instructions into physical addresses in real time. Optimally, this happens in the time of one microprocessor cycle. Some microprocessors, such as UltraSPARC, require assistance from the OS to manage this process, facilitated by a hardware-generated exception into system software where these helper tasks are performed.

Figure 1FIGURE 1 Solaris OS Virtual-to-Physical Memory Management

The combined virtual memory system translates virtual addresses into physical addresses in page-size chunks of memory as depicted in the preceding figure.

The hardware uses a table known as the translation lookaside buffer (TLB) in the microprocessor to convert virtual addresses to physical addresses on-the-fly. The software programs the microprocessor's TLB with entries identifying the relationship of the virtual and physical addresses. Because the size of the TLB is limited by hardware, the TLB is typically supplemented by a larger (but slower) in-memory tables of virtual-to-physical translations. On UltraSPARC processors, these tables are known as the translation storage buffer (TSB); on most other architectures, they are known as a page table. When the microprocessor needs to convert a virtual address into a physical address, it searches the TLB (a hardware search), and if a physical address is not found (for example, the hardware encounters a TLB miss), the microprocessor searches the larger in-memory table. The following figure illustrates the relationship of these components.

Figure 2FIGURE 2 Virtual Address Translation Hardware and Software Components

UltraSPARC I–IV microprocessors use a software TLB replacement strategy. For example, when a TLB miss occurs, software is invoked to search the in-memory table (the TSB) for the required translation entry.

Let's step through a simple example. Suppose a process allocates some memory within its heap by calling malloc(). Further suppose that malloc() returns to the program a virtual address of the requested memory. When that memory is first referenced, the virtual memory layer requests a physical memory page from the system's free lists. The newly acquired page has an associated physical address within physical memory. The virtual memory system then constructs a translation entry in memory containing the virtual address (the start of the page returned by malloc) and the physical address of the new page. The newly created translation entry is then inserted into the TSB and programmed into an available slot in the microprocessor's TLB. The entry is also kept in software, linked to the address space of the process to which it belongs. Later, when the program accesses the virtual address, if the new TLB entry is still in the TLB virtual-to-physical address is translated on-the-fly. However, if the TLB entry has been evicted by other activity, a TLB miss occurs. The corresponding hardware exception looks up the translation entry in the larger TSB and reloads it into the TLB.

The TSB is also limited in size. In extreme circumstances, a TSB miss can occur. Translating the virtual address of a TSB miss requires a lengthy search of the software structures associated with the process.

The mechanism is similar for other processors using hardware TLB-miss strategies, including the Intel x86, except that a hardware TLB replacement strategy is used to refill the TLB rather than software. When a TLB miss occurs, an in-hardware engine is invoked to search the page table.

TLB Reach and Application Performance

The objective of the TLB is to cache as many recent page translations in hardware as possible so that it can satisfy a process's memory accesses by performing all of the virtual-to-physical translations on-the-fly. Most TLBs are limited in size because of the amount of transistor space available on the CPU die. For example, the UltraSPARC I and II TLBs are only 64 entries. This means that the TLB can address no more than 64 pages of translations at any time; therefore, on UltraSPARC, the TLB can address 64 \ 8 kilobytes (512 kilobytes).

The amount of memory the TLB can concurrently address is known as the TLB reach. The UltraSPARC I and II have a TLB reach of 512 kilobytes. If an application makes heavy use of less than 512 kilobytes of memory, the TLB will be able to cache the entire set of translations. However, if the application were to make heavy use of more than 512 kilobytes of memory, the TLB will begin to miss, and translations will have to be loaded from the larger TSB.

The following table shows the TLB miss rate and the amount of time spent servicing TLB misses from a study of older SPARCTM architectures. We can see from the table that only a small range of compute-bound applications fit well in the SuperSPARCTM TLB (gcc, ML, and pthor), whereas the others applications spend a significant amount of their time in the TLB miss handlers.

TABLE 1 Sample TLB Miss Data From a SuperSPARC Processor Study

Workload

Total Time (secs)

User Time (secs)

# User TLB Misses

% User Time in TLB Miss Handling

Cache Misses ('000s)

Peak Memory Usage (MB)

coral

177

172

85974

50

71053

19.9

nasa7

387

385

152357

40

64213

3.5

compress

99

77

21347

28

21567

1.4

fftpde

55

53

11280

21

14472

14.7

wave5

110

107

14510

14

4583

14.3

mp3d

37

36

4050

11

5457

4.8

spice

620

617

41923

7

81949

3.6

pthor

48

35

2580

7

6957

15.4

ML

945

917

38423

4

314137

32.0

gcc

118

105

2440

2

9980

5.6


TLB effectiveness has become a larger issue in the past few years because the average amount of memory used by applications has grown significantly (almost doubling year upon per year according to recent statistical data). The easiest way to increase the effectiveness of the TLB is to increase the TLB reach so that the working set of the application fits within the TLB's reach.

The TLB reach can be improved using either of the following methods:

  • Increase the number of entries in the TLB. This approach adds complexity to the TLB hardware and increases the number of transistors required; therefore, requiring more valuable die space.

  • Increase the page size for each entry. This approach increases the TLB reach without the need to increase the size of the TLB.

One trade-off of increasing the page size is that doing so might boost the performance of some applications at the expense of slower performance elsewhere. This trade-off is caused by wasted space that results from larger memory allocation units. We would almost certainly increase the memory usage of many applications.

Luckily, a solution is at hand. Some of the newer processor architectures allow us to use two or more different page sizes at the same time. For example, UltraSPARC provides hardware support concurrently to select 8 kilobyte, 64 kilobyte, 512 kilobyte, or 4 megabyte pages. If we were to use 4 megabyte pages to map all memory, then the TLB would have a theoretical reach of 64 \ 4 megabytes (256 megabytes).

  • Share ThisShare This
  • Your Account

Discussions

Make a New Comment

You must log in in order to post a comment.

Informit Network