Home > Articles > Programming > Web Services/ XML/ SOA/ WebSphere/ WCF

Memory Hierarchy in Cache-Based Systems

  • PrintPrint
  • Share ThisShare This
  • DiscussDiscuss
Close Window

Sun Microsystems

Learn more…

IPsec -- A Secure Deployment Option
Sep 24, 2004
Using pGINA to Authenticate Users in Microsoft Windows Environments
Aug 27, 2004
Best Practices for Deploying the Sun StorADE Utility
Aug 20, 2004
Performing Network Solaris Installations Without a Local Boot Server
Aug 13, 2004
Using Solaris Resource Manager With Sun Ray
Aug 6, 2004
N1 Grid Architecture Realized: Strategic Flexibility
Jul 16, 2004
Global Grid Connectivity Using Globus Toolkit With Solaris Operating System
Jun 25, 2004
Building a Bootable DVD to Deploy a Solaris Flash Archive
Jun 18, 2004
Building OpenSSH--Tools and Tradeoffs, Updated for OpenSSH 3.7.1p2
Jun 18, 2004
Maximizing the Performance a Gigabit Ethernet NIC Interface
Jun 18, 2004
Dynamic Reconfiguration for High-End Servers: Part 2--Implementation Phase
Jun 11, 2004
Supporting Multiple Page Sizes in the Solaris Operating System Appendix
Jun 11, 2004
Dynamic Reconfiguration for High-End Servers: Part 1 --- Planning Phase
Jun 4, 2004
Supporting Multiple Page Sizes in the Solaris Operating System
Jun 4, 2004
Data Center Best Practices for High-End Servers
May 28, 2004
Understanding Tuning TCP
May 28, 2004
Sun Ray Deployment On Shared Networks
Apr 30, 2004
LDAP Triggers: A Framework for Sun Java System Directory Server
Apr 23, 2004
Taming Your Emu to Improve Application Performance
Apr 23, 2004
Best Practices for Deploying the Sun StorADE Utility
Apr 16, 2004
Sun Fire 15K/12K Auto Diagnosis and Recovery
Apr 16, 2004
Dynamic Reconfiguration and Oracle 9i Dynamically Resizeable SGA
Apr 9, 2004
Solaris Operating System Availability Features
Apr 2, 2004
Design, Features, and Applicability of Solaris File Systems
Mar 26, 2004
Securing the Sun Fire 12K/15K System Controller
Mar 19, 2004
Securing the Sun Fire 12K/15K Domains
Mar 12, 2004
Enterprise Network Design Patterns: High Availability
Feb 20, 2004
Performance Forensics
Feb 13, 2004
Migrating to the Solaris Operating System: Migrating From Tru64 UNIX
Feb 6, 2004
Tuning ORACLE to Minimize Recovery Time: For Solaris Operating System on SPARC
Feb 6, 2004
Securing Linux Systems With Host-Based Firewalls Implemented With Linux iptables
Jan 30, 2004
Securing Web Applications through a Secure Reverse Proxy
Jan 30, 2004
Hardware Replication Challenges
Jan 23, 2004
Solaris Volume Manager Performance Best Practices
Jan 23, 2004
Sun Fire 6800/4810/4800/3800 Systems Auto Diagnosis and Recovery Enhancements
Jan 16, 2004
Responding to a Customer's Security Incidents, Part 4: Processing Incident Data
Jan 9, 2004
Desktop Architecture Selection Guide
Dec 31, 2003
Sun ONE Portal Server 6 Best Practices
Dec 23, 2003
Migrating to the Solaris Operating System: Migration Strategies
Oct 31, 2003
Responding to Customer's Security Incidents--Part 3: Following Up After an Incident
Oct 31, 2003
Minimizing Domains for Sun Fire V1280, 6800, 12K, and 15K Systems, Part II
Oct 24, 2003
Using the LDAP to NIS+ Gateway
Oct 24, 2003
Deploying the Solaris Operating Environment Using a Solaris Security Toolkit CD
Oct 17, 2003
Minimizing Domains for Sun Fire V1280, 6800, 12K, and 15K Systems, Part I
Oct 17, 2003
Building Secure Sun Fire Link Interconnect Networks Using Sun Fire 15K and Sun Fire 12K Servers
Sep 26, 2003
Linux Overview for Solaris Users
Sep 26, 2003
Securing Sun Linux Systems: Part II, Network Security
Sep 26, 2003
Sun Fire V1280/Netra 1280 Server Considerations for Improving RAS
Sep 26, 2003
Sun ONE Portal Server and Lotus iNotes Integration Recipe
Sep 26, 2003
Transition Guide--Upgrading From the iPlanet Directory Server 5.1 Software to the Sun ONE Directory Server 5.2 Software
Sep 26, 2003
Capacity Planning as a Performance Tuning Tool—Case Study for a Very Large Database Environment
Sep 19, 2003
Securing Sun Linux Systems: Part I, Local Access and File Systems
Sep 19, 2003
Sun Fire 15K/12K Server Preferred Practices
Sep 19, 2003
Sun Grid Engine, Enterprise Edition—Configuration Use Cases and Guidelines
Sep 19, 2003
The IT Utility Model—Part I
Sep 19, 2003
Using filesync for Disaster Recovery, Business Continuance, and Mobility
Sep 19, 2003
Role Based Access Control and Secure Shell—A Closer Look At Two Solaris Operating Environment Security Features
Sep 12, 2003
Solaris Operating Environment Network Settings for Security: Updated for Solaris 9 Operating Environment
Sep 12, 2003
Using NTP on the Sun Fire 15K/12K Server
Sep 12, 2003
Consolidation Methodology
Sep 5, 2003
Using the Sun ONE Application Server 7 to Enable Collaborative B2B Transactions
Sep 5, 2003
An Architecture for Creating and Managing Integrated Software Stacks
Aug 29, 2003
Auditing System Security
Aug 29, 2003
Integrating the Secure Shell Software
Aug 29, 2003
Sun Cluster 3.0 Series: Guide to Installation—Part 2
Aug 29, 2003
Sun ONE Portal Server and Microsoft Exchange Integration Cookbook
Aug 29, 2003
Building a Global Compute Grid - Two Examples Using the Sun ONE Grid Engine and the Globus Toolkit
Aug 22, 2003
Configuring the Secure Shell Software
Aug 22, 2003
Responding to Customer's Security Incidents—Part 2: Executing a Policy
Aug 22, 2003
Sun Cluster 3.0 Series: Guide to Installation—Part 1
Aug 22, 2003
Sun Fire 6800/4810/4800/3800 Auto Diagnosis and Recovey Features
Aug 22, 2003
Provisioning in Replicated, Mission-Critical Environments
Aug 15, 2003
Responding to Customer's Security Incidents, Part 1: Establishing Teams and a Policy
Aug 15, 2003
Securing the Sun Fire 12K and 15K System Controllers
Aug 15, 2003
Writing an Authentication Plug-in for a Sun ONE Directory Server
Aug 15, 2003
Securing the Sun Cluster 3.x Software
Aug 8, 2003
Securing the Sun Fire 12K and 15K Domains
Aug 8, 2003
Understanding Gigabit Ethernet Performance on Sun Fire Servers
Aug 8, 2003
Using Midframe Servers to Build Secure Sun Fire Link Interconnect Networks
Aug 8, 2003
BluePrint for Benchmarking Success
Aug 1, 2003
System Management Services Software: An Inside Look
Aug 1, 2003
A Patch Management Strategy for the Solaris Operating Environment
May 23, 2003
Building OpenSSH—Tools and Tradeoffs
May 23, 2003
Configuring Databases Using Soft Links
May 23, 2003
Managing Shared Storage in a Sun Cluster 3.0 Environment With Solaris Volume Manager Software
May 23, 2003
Modeling Sun Cluster Availability
May 23, 2003
Performance Oriented System Administration For Solaris
May 23, 2003
A Strategy for Managing Performance
Apr 18, 2003
Solaris Operating Environment Security: Updated for Solaris 9 Operating Environment
Apr 18, 2003
Trust Modeling for Security Architecture Development
Apr 18, 2003
Understanding Solaris 9 Operating Environment Directory Services
Apr 18, 2003
A New Open Resource Management Architecture in the Sun HPC ClusterTools Environment
Feb 21, 2003
Campus Clusters Based on Sun Cluster Software
Feb 14, 2003
Memory Hierarchy in Cache-Based Systems
Feb 14, 2003
Designing Highly Available Architectures: A Methodology
Feb 7, 2003
Internet Protocol Network Multipathing (Update)
Feb 7, 2003
Minimizing the Solaris Operating Environment for Security: Updated for Solaris 9 Operating Environment
Feb 7, 2003
Configuring Boot Disks With Solaris Volume Manager Software
Jan 24, 2003
Managing Data Centers With Sun Management Center Change Manager
Jan 24, 2003
SQL*Net Performance Tuning Using Underlying Network Protocols
Jan 24, 2003
Extending Authentication in the Solaris 9 Operating Environment Using Pluggable Authentication Modules (PAM): Part II
Jan 17, 2003
HPC Administration Tips and Techniques
Jan 17, 2003
Sun Fire Midframe Server Best Practices for Firmware Update 5.13.x
Jan 17, 2003
Extending Authentication in the Solaris 9 Operating Environment Using Pluggable Authentication Modules: Part I
Dec 27, 2002
Sun Fire Systems Design and Configuration Guide
Dec 27, 2002
Consolidation in the Data Center
Dec 20, 2002
Enterprise Network Design Patterns: High Availability
Dec 20, 2002
Introduction to the Solaris Cluster Grid - Part 2
Dec 20, 2002
Introduction to the Sun Cluster Grid, Part 1
Sep 26, 2002
Sun's Quality, Engineering, and Deployment (QED) Test Train Model
Sep 26, 2002
Customizing JumpStart Framework for Installation and Recovery
Sep 20, 2002
Sun StorEdge Instant Image 3.0 and Oracle8i Database Best Practices
Sep 20, 2002
Windows NT Server Consolidation and Performance Improvements with Solaris PC NetLink 2.0 Software
Sep 20, 2002
Sun ONE Portal Server 3.0 Rewriter Configuration and Management Guide
Sep 13, 2002
Securing the Sun Fire 12K and 15K Domains, Updated for SMS 1.2
Sep 6, 2002
Securing the Sun Fire 12K and 15K System Controllers, Updated for SMS 1.2
Sep 6, 2002
An Information Technology Management Reference Architecture Implementation
Aug 30, 2002
Reducing the Backup Window With Sun StorEdge Instant Image Software
Aug 30, 2002
An Information Technology Management Reference Architecture
Aug 16, 2002
Drill-Down Monitoring of Database Servers
Aug 16, 2002
LAN-Free Backups Using the Sun StorEdge Instant Image 3.0 Software
Aug 16, 2002
Network Storage Evaluations Using Reliability Calculations
Aug 16, 2002
Securing LDAP Through TLS/SSL: A Cookbook
Aug 16, 2002
Securing the Sun Fire Midframe System Controller
Aug 16, 2002
Deployment Considerations for Data Center Management Tools
Aug 9, 2002
Guide to Installation-Part II: Sun Cluster 3.0 Software Management Services
Aug 9, 2002
How Hackers Do It: Tricks, Tools, and Techniques
Aug 9, 2002
Metropolitan Area Sun Ray Services
Aug 9, 2002
Securing the Sun Cluster 3.0 Software
Aug 9, 2002
Guide to Installation, Part I: Sun Cluster Management Services
May 24, 2002
Service Level Agreement in the Solaris OE Data Center
May 24, 2002
Solaris OE Enterprise Management Systems Part I: Architectures and Standards
May 24, 2002
Solaris OE Storage Resource Management: A Practitioner's Approach
May 24, 2002
Sun Fire 3800-6800 Servers Dynamic Reconfiguration
May 24, 2002
Using Live Upgrade 2.0 With JumpStart Technology and Web Start Flash
May 24, 2002
Enterprise Quality of Service Part II: Enterprise Solution using Solaris Bandwidth Manager 1.6 Software
May 17, 2002
Introduction to SunTone Clustered Database Platforms
May 17, 2002
Securing the Sun Enterprise 10000 System Service Processors
May 17, 2002
Service Level Management in the Data Center
May 17, 2002
Solaris Application Performance Optimization
May 17, 2002
Using Live Upgrade 2.0 With a Logical Volume Manager
May 17, 2002
Establishing a Solaris OE Architectural Model
Apr 5, 2002
Configuring OpenSSH for the Solaris Operating Environment
Mar 22, 2002
Data Center Design Philosophy
Mar 22, 2002
Enterprise Quality of Service (QoS): Part I - Internals
Mar 22, 2002
Issues in Selecting a Job Management System
Mar 22, 2002
Managing Solaris Operating Environment Upgrades With Live Upgrade 2.0
Mar 22, 2002
Securing Sun Fire 15K Domains
Mar 22, 2002
Server Virtualization Using Trusted Solaris 8 Operating Environment
Mar 22, 2002
Sun Cluster 3.0 Implementation Guide: Hardware Setup
Mar 22, 2002

Sorry, this author hasn't posted any blogs.

Resource Management

Like this article? We recommend
Resource Management

This article will help the reader understand the architecture of modern microprocessors by introducing and explaining the most common terminology and addressing some of the performance related aspects. Written for programmers and people who have a general interest in microprocessors, this article presents introductory information on caches and is designed to provide understanding on how modern microprocessors work and how a cache design impacts performance.

This article is to help the reader understand the architecture of modern microprocessors. It introduces and explains the most common terminology and addresses some of the performance related aspects.

This is an introductory article on caches. After reading this article you should understand how modern microprocessors work and how a cache design impacts performance.

This article is written for programmers and people who have a general interest in microprocessors.

Despite improvements in technology, microprocessors are still much faster than main memory. Memory access time is increasingly the bottleneck in overall application performance. As a result, an application might spend a considerable amount of time waiting for data. This not only negatively impacts the overall performance, but the application cannot benefit much from a processor clock-speed upgrade either.

One way to overcome this problem is to insert a small high-speed buffer memory between the processor and main memory. Such a buffer is generally referred to as cache memory, or cache for short.

The application can take advantage of this enhancement by fetching data from the cache instead of main memory. Thanks to the shorter access time to the cache, application performance is improved. Of course, there is still traffic between memory and the cache, but it is minimal. This relatively simple concept works out well in practice. The vast majority of applications benefit from caches.

This article describes how the basic idea of caches is implemented, what sort of caches are found in most modern systems, and their impact on performance.

Because this article is accessible to a relatively large group of readers many important details are omitted.

Cache Hierarchy

As FIGURE 1 shows, the cache [Handy] is placed between the CPU and the main memory.

Figure 1FIGURE 1 Example of a Cache-Based Memory System.


The system first copies the data needed by the CPU from memory into the cache, and then from the cache into a register in the CPU. Storage of results is in the opposite direction. First the system copies the data into the cache. Depending on the cache architecture details, the data is then immediately copied back to memory (write-through), or deferred (write-back). If an application needs the same data again, data access time is reduced significantly if the data is still in the cache.

To amortize the cost of the memory transfer, more than one element is loaded into the cache. The unit of transfer is called a cache block or cache line.1 Access to a single data element brings an entire line into the cache. The line is guaranteed to contain the element requested.

Related to this is the concept of sub-blocking. With sub-blocking, a cache allocates a line/block with a length that is a multiple of the cache line. The slots within the larger block are then filled with the individual cache lines (or sub-blocks). This design works well if lines are accessed consecutively, but is less efficient in case of irregular access patterns, because not all slots within one block may be filled.

So far, we have only applied caches to data transfer. There is, however, no reason why you could not use caches for other purposes—to fetch instructions, for example. Cache Functionality and Organization explores these other purposes in more detail.

Thanks to advances in chip process technology, it is possible to implement multiple levels of cache memory. Some of these levels will be a part of the microprocessor (they are said to be on-chip), whereas other levels may be external to the chip.

To distinguish between these caches, a level notation is used. The higher the level, the farther away the cache is from the CPU. FIGURE 2 shows an example. The level 1 (L1) cache is on-chip, whereas the level 2 (L2) cache is external to the microprocessor.

Note that in FIGURE 2, and in the remainder of this article, we distinguish between the CPU and microprocessor. CPU refers to the execution part of the processor, whereas microprocessor refers to the entire chip, which includes more than the CPU.

Figure 2FIGURE 2 Multiple Levels of Cache Memory

In FIGURE 2, the size of the cache increases from left to right, but the speed decreases. In other words, the capacity increases, but it takes longer to move the data in and out.

In some designs, there are three levels of cache. To complicate matters even further, caches at a certain level can also be shared between processors. This topic however is beyond the scope of this paper.

Latency and Bandwidth

Latency and bandwidth are two metrics associated with caches and memory. Neither of them is uniform, but is specific to a particular component of the memory hierarchy.

The latency is often expressed in processor cycles or in nanoseconds, whereas bandwidth is usually given in megabytes per second or gigabytes per second.

Although not entirely correct, in practice the latency of a memory component is measured as the time it takes to fetch one unit of transfer (typically a cache line). As the speed of a component depends on its relative location in the hierarchy, the latency is not uniform. As a rule of thumb, it is safe to say that latency increases when moving from left to right in FIGURE 2.

Some of the memory components, the L1 cache for example, may be physically located on the microprocessor. The advantage is that their speed will scale with the processor clock. It is, therefore, meaningful to express the latency of such components in processor clock cycles, instead of nanoseconds.

On some microprocessors, the integrated (on-chip) caches do not always run at the speed of the processor. They operate at a clock rate that is an integer quotient (1/2, 1/3, and so forth) of the processor clock.

Cache components external to the processor do not usually, or only partially2, benefit from a processor clock upgrade. Their latencies are often given in nanoseconds. Main memory latency is almost always expressed in nanoseconds.

Bandwidth is a measure of the asymptotic speed of a memory component. This number reflects how fast large bulks of data can be moved in and out. Just as with latency, the bandwidth is not uniform. Typically, bandwidth decreases the further one moves away from the CPU.

Virtual Memory

Although not considered in detail in this article, virtual memory is mentioned for reasons of completeness and to introduce the TLB cache. For more details, refer to [CocPet] and [MauMcDl]. The latter covers the virtual memory in the Solaris™ operating environment (Solaris OE) in great detail.

On a virtual memory system, memory extends to disk. Addresses need not fit in physical memory. Certain portions of the data and instructions can be temporarily stored on disk, in the swap space. The latter is disk space set aside by the Solaris OE and used as an extension of physical memory. The system administrator decides on the size of the swap space. The Solaris OE manages both the physical and virtual memory.

The unit of transfer between virtual memory and physical memory is called a page. The size of a page is system dependent3.

If the physical memory is completely used up, but another process needs to run, or a running process needs more data, the Solaris OE frees up space in memory by moving a page out of the memory to the swap space to make room for the new page. The selection of the page that has to move out is controlled by the Solaris OE. Various page replacement policies are possible. These replacement policies are, however, beyond the scope of this article.

Certain components in the system (the CPU for example) use virtual addresses. These addresses must be mapped into the physical RAM memory. This mapping between a virtual and physical address is relatively expensive. Therefore, these translated addresses (plus some other data structures) are stored in an entry in the so-called Translation Lookaside Buffer (TLB). The TLB is a cache and behaves like a cache. For example, to amortize the cost of setting up an entry, you would like to re-use it as often as possible.

The unit of virtual management is a page; one entry in the TLB corresponds to one page.

  • Share ThisShare This
  • Your Account

Discussions

Make a New Comment

You must log in in order to post a comment.

Related Resources

Danny KalevMinutes from the October 2009 Meeting
By Danny Kalev on November 19, 2009 No Comments

The minutes from the Santa Cruz (October 2009) meeting are available here. Even if you're not a language layer at heart, I encourage you to read them.

Danny KalevA Reader's Opinion on Attributes
By Danny Kalev on October 20, 2009 No Comments

In August I dedicated a series to the debate about C++0x attributes. I believe that it covered the subject in a balanced and detailed way, but I keep getting complaints from C++ users who don't like attributes for various reasons. Here's a recent email I received from a Polish C++ programmer. While it  doesn't represent my opinion about attributes -- I'm rather neutral about this feature and consider it a "solution waiting for a problem" -- but it suggests that attributes are still a highly controversial issue that will haunt C++ for a long time. The email is quoted here with minor edits that and as usual, with all private details removed.

Danny KalevFollowup: The Web 2.0 Guy I Ain't
By Danny Kalev on October 16, 2009 1 Comment

Almost a year ago, I posted here The Web 2.0 Guy I Ain't. People wonder whether I still resist all those Web 2.0 features and technologies at the end of 2009.

See All Related Blogs

Informit Network