Home > Articles > Software Development & Management

Like this article? We recommend

Tools Implementation Details

The following sections describe the details of the tools implemetation.

IronView

The IronView product set monitors and manages the networking equipment from Foundry Networks. IronView provides a console for direct access to the network topology. Its main function is the administration of the network topology. It was implemented on the deladmin server.

The Foundry Network devices issue SNMP traps directly to Netcool for the Network Operations Center (NOC) and SLA processing. In this implementation, IronView is installed co-resident with the Message Server (Sun ONE) delegated administrator services because the IronView processing requirements are very low.

Sun Management Center 3.0

The SunMC 3.0 software is an element manager that details and monitors the operation of Sun servers, peripherals, and applications. In the context of the iFRC program, SunMC is used primarily to provide low-level monitoring of the application servers to report on hardware and operating system stresses and faults.

Loren Pearce from SunPS collected and described the following details on Sun Management Center 3.0.

SunMC 3.0 has a three-tier architecture. The first tier includes the "agents" that collect information and alerts from the individual systems being monitored. The second tier is a server component that collects information from the monitoring agents and reports this information to the clients. This tier also provides automation capabilities.

The third tier is the client tier that is usually provided by the server, console, and an agent component all executing on one system. Additional agents are deployed to the application servers for monitoring of these systems. The console is viewed using Sun Ray™ appliance displays. While this is not unusual, it is not necessary to combine the three components of SunMC 3.0 on one system.

In addition to monitoring and reporting on the application servers, SunMC 3.0 agents are deployed on the systems management servers in order to monitor their operation and alert the systems administrator to any problems within the monitoring complex.

Figure 3FIGURE 3 SunMC 3.0 Configuration

TABLE 2 details how the various components of SunMC 3.0 are distributed throughout the system topology.

TABLE 2 SunMC 3.0 Component Distribution

Component

System

Port(s)

Notes

SunMC server

sunmc

162

163

164

165

166

167

168

2099

 

SunMC console

sunmc

Variable

 

SunMC agent

sunmc

161

SunMC agent replaces standard snmp daemon

 

ncdb

161

 

 

ncimp

161

 

 

teamq

161

 

 

smtpr01

161

 

 

smtpr02

161

 

 

smtps01

161

 

 

smtps02

161

 

 

ha01

161

 

 

ha02

161

 

 

ldaps01

161

 

 

ldaps02

161

 

 

ldapm01

161

 

 

ldapm02

161

 

 

mmp01–mmp06

161

 

 


The Netcool SunMC probe collects alerts for reporting to the NOC and the SLA manager. This probe acts as a client to SunMC to capture the system level events and forwards them to the Netcool database.

SunMC also provides investigation and alert management through its console. The console is made available as part of the system administration view so that a system operator can obtain additional details pertaining to outages detected in the environment. With this information, the system administrator can determine the correct course of action for resolving the operational impact.

TeamQuest

TeamQuest performance software monitors the performance metrics of the environment against the best practices as defined in the Sun Professional ServicesSM (SunPS) program performance tuning and capacity planning methodology, outlined in the "Implementation Requirements" section of the previous article. When performance target violations are detected, alerts are sent to Netcool in the form of SNMP traps that are then processed and forwarded to the NOC and SLA views.

TeamQuest also provides capacity planning and performance tuning capabilities.

This section describes the details of how the iFRC implemented certain aspects of performance management. It was provided by Scott Johnson, Senior Engineer and Joe Rich, Manager Corporate Technical Relations, from TeamQuest Corporation, who provided the technical expertise and tools.

TeamQuest View and TeamQuest Model were the two products used in the iFRC to provide a mechanism to capture and report on the performance and capacity of the systems in the architecture.

TeamQuest View is a comprehensive, analytical tool that reports system performance and helps locate bottlenecks. Performance data is collected from several different sources on the system and stored into a local database for real-time or historical analysis.

TeamQuest Model provides predictive analysis required for long-term capacity planning. This tool, using data retrieved from the architecture systems, was used to build models of the systems and applications. These models were validated against the benchmark data and were used to experiment with "what-if" questions relating to the number of users and the effects of equipment changes.

TeamQuest View

TeamQuest View consists of a server-based component (framework) that is responsible for (among other things) performance data collection and storage and threshold violation notification. TeamQuest View also has a Motif-based client GUI (TQView) which, when connected over TCP/IP to the server component, produces graphical displays of both real-time and historical performance data. In the iFRC, the client GUI with associated reports is located on a separate server with automatic login capability from any one of several management stations connected to a SunRay server. Upon login, TQView automatically establishes a connection to the desired architecture server and produces a series of predefined performance reports (operational views). The reports consist of a number of statistics that were selected from baseline performance monitoring metrics as defined in the SunPS performance tuning and capacity planning methodology. FIGURE 4 is one example, showing important CPU metrics, the top 10 busiest disks, the number of RPC client calls per second and the buffer read and write cache hit percentages. In addition to these reports, TeamQuest provides a large number of pre-defined reports and also supports ad hoc report generation.

Figure 4FIGURE 4 Operational View Example of the Message Server (Sun ONE) from TeamQuest View

Threshold Violation Detection and Notification

The TeamQuest Framework component samples data at a preset interval and stores that data into its performance database. default sampling rate of one minute is implemented. As the data is collected, TeamQuest automatically checks the values of the statistics against any defined threshold levels. A set of thresholds, based on baseline performance monitoring metrics as defined in the SunPS performance tuning and capacity planning methodology, was established for all of the servers in the architecture.

When a threshold criterion is met, an alert is issued. The default action is to log the event into the performance database in the alarm log. In addition, the iFRC team chose to send all alerts through an SNMP trap to the MicroMuse Netcool Omnibus console. MicroMuse assisted in the integration of the TeamQuest View client as a tool under Netcool so that it could be launched in context to the server that issued the alert. In this way, a report showing a graph of the statistic in violation can automatically be generated. FIGURE 5 and FIGURE 6 show drilling down from an alert event sent to Netcool to the statistic in violation on the server under stress.

Figure 5FIGURE 5 MicroMuse/TeamQuest View Integration

Figure 6FIGURE 6 TeamQuest View Report as a Result of the "Start in Context" by MicroMuse

Application Level Resource Reporting

When monitoring, tuning, and predicting capacity of systems, identifying the critical applications (or workloads) of the system is imperative. SLAs are usually created for these critical applications, and thus dictate which workloads must be monitored.

TeamQuest provides a mechanism, through identification of the processes that comprise an application, to track the resource consumption of that group of processes. Through a definitional language, the user builds process groups that are called workloads.

For the IDC Mail and Messaging Architecture, a common set of workload definitions is created and loaded into the TeamQuest performance database on each system.

NOTE

To enhance the data collection, Solaris OE process accounting was enabled on all of the systems. Thus when a sample was taken, both the running process activity and the completed process activity was merged to provide the best capture ratio.

TeamQuest reduction processing was deactivated so that each and every process captured was recorded into the performance database.

As TeamQuest process-level data was collected, it was compared to the workload definitions and deposited into the appropriate workload container. This allowed the iFRC team to do several very important functions:

  • Monitor (and alert on) the resource consumption by workload.

  • Provide the basic operational entities for the capacity planning exercises described later. In addition to the prestored workload definitions, TeamQuest has the capability to reprocess the raw process data after the fact into workloads, thus allowing for "tuning" and further refinement of the workloads. TABLE 3 lists the workloads and their definitions for the MAIL servers.

TABLE 3 Sample and Workload Definitions

Workload name

Description

Definition

Management Services

Processes associated with TeamQuest Performance software or SunMC monitoring agents

command = /tq.*/ or command = esd

Cluster

Processes associated with Sun Cluster

fullcmd = /.*cluster.*/

Stored

Message store maintenance program (deadlock checks, message deletion, and so forth)

command = stored and

login = mailsrv

Post Office Protocol (POP)

Processes that handle POP3 messages to/from MMP servers

command = popd and

login = mailsrv

(Internet Mail Access Protocol (IMAP)

Processes that handle IMAP4 messages to/from MMP servers

command = imapd and

login = mailsrv

Simple Mail Transfer Protocol (SMTP)

Processes that handle the MTA queue for (SMTP) messages

login = mailsrv and

((command = tcp_smtp_server) or (command = ims_master))

Other Message Server (Sun ONE)

Any other processes associated with the Message Server (Sun ONE) application

login = mailsrv

Other

Any processes not explicitly included in the preceding workloads

 


FIGURE 7 is a screen display of one of the mail servers showing the overall CPU utilization along with the same utilization broken out by course application workloads and finally by a finer granularity set of workloads. It is easy to see how much of the resource each application is consuming and, within the Message Server 5.1 (Sun ONE) application, which specific mail protocol is using the most resources (POP).

Figure 7FIGURE 7 Overall CPU Utilization By Workload

Capacity Planning

As mentioned earlier, TeamQuest Model is the capacity planning tool that was used in this study. TeamQuest Model applies operational analysis principles to measured computer system performance data to build baseline performance models. You can then modify the baseline models to predict system performance when configurations and/or workloads change.

The models are known as queuing network models (QNMs). A QNM consists of a set of resources (CPU, disks, terminals, and so forth) that can be visited by customers from one or more workloads. Workloads can have unique intensities (arrival rates or populations), priorities, and average service demands (visits and service times per visit) at resources. Solving a QNM produces estimates of throughputs, response times, queue lengths, and utilizations for each workload at each resource, as well as estimates of overall workload throughputs and response times.

Results are presented in spreadsheet like displays and charts. For example, components of response time charts allow performance analysts to quickly assess impacts of queuing delays at resources on workload response times.

TeamQuest Model provides two methods to solve QNMs—approximate mean value analysis (MVA) and simulation. Approximate MVA is normally used, since it is much faster than simulation. Approximate MVA iterates solving for response times, throughputs and queue lengths until convergence occurs for all workloads at all resources. Approximate MVA is the method that was used to model the messaging IDC reference architecture systems.

Baseline models of several representative benchmarks were created, including an enterprise model, built by including a representative system from each server class in the messaging system.

Since actual transaction completion rates were not available for the user activity, a synthetic transaction rate was constructed based on the email sender test profile and the number of users. FIGURE 8 shows the throughput of the two major workloads for the enterprise. The throughput rate is the transactions per second based on the synthetic transaction definition discussed previously.

Figure 8FIGURE 8 Major Workload Throughput

The SMTP workload throughput increases as the number of email senders increases. However, the throughput for the POP workload increases steadily only to the 54000/29000 user level and reaches a plateau by the 64000/34000 user level. This indicates that something is constraining the activity of the POP workload.

Further analysis indicates that the constraint occurred in the MMP server. FIGURE 9 shows that the CPU for mmp01 is about 95 percent utilized and is most likely the constraining factor.

Figure 9FIGURE 9 Five Most Active Resources for System mmp01

In the benchmark, the configuration consisted of six MMP servers, which would appear to be able to support about 8000 email readers. The models suggest that to support the 94,000 email reader level would require 12 Netra T1 systems.

Since TeamQuest Model allows you to play "what-if" scenarios both on population growth and on configuration changes, you can see the results of the same workload run on a two-CPU or a four-CPU Netra t1405. (The relative performance characteristics of the CPUs were provided by SunPS).

With the MMP servers upgraded to a two-CPU Netra t1405, the enterprise is able to adequately process the work. FIGURE 10 shows the throughput for both the POP and SMTP workloads steadily increasing as the number of users increases.

Figure 10FIGURE 10 POP and SMTP Throughputs on MMP Servers as Two-CPU Netra t1405 Servers

The active resource utilization for each system indicates that there are not any bottlenecks in any of the systems (MMP – 8 percent, messaging – 39 percent, MTA – 18 percent). However, if we continue our modeling exercise, and double the number of active users, the MMP server becomes saturated. Then, increasing the hardware on the MMP server is necessary. By modeling a number of "what-if" scenarios, we arrive at a four-CPU configuration that adequately supports the doubling of the population. FIGURE 11, FIGURE 12, and FIGURE 13 show the active resource utilization for each system type, supporting up to a 194000/99000 user level.

Figure 11FIGURE 11 mmp01 Active Resource Utilization (4-CPU Netra t1405 Server)

Figure 12FIGURE 12 smtp01 Active Resource Utilization (4-CPU Netra t1405 Server)

Figure 13FIGURE 13 ha01 Active Resource Utilization (12-CPU Sun Fire 6800 Server)

The ha01 system, being one of the message store systems, is handling the mail demands of the increased user population easily.

Through the TeamQuest Performance software, the iFRC team could provide accurate server sizing information based on the various end-user usage requirements.

Micromuse

The Micromuse Netcool product suite is implemented to collect events and information from the other system management tools, consolidate this information, assess the impact on the environment, and produce service level reports and information against the application set. This is accomplished using the following Micromuse products

  • Object Server: The Netcool Object Server is an in-memory database that provides event collection, filter design, and view services. The Object Server provides the foundation upon which the other products in the Netcool suite operate. In the iFRC configuration, redundant object servers are installed in order to provide fault tolerance to the collection system.

  • Internet Service Monitors: The Micromuse Internet Service Monitors (ISM) allow monitoring of 17 different protocols and constructing a simulated transaction. They can be set up in a distributed master/slave relationship, and do not require agents on the remote host that provides the service being monitored. Using the ISMs, you can monitor the Message Server 5.1 (Sun ONE) functionality (LDAP, SMTP, POP, and IMAP) and report their availability for SLA monitoring purposes.

  • Impact: The Micromuse Impact product allows super-imposition of business logic onto the data that has been captured by probes and monitors and is in the Micromuse Object Server. Impact also allows sophisticated event correlation over time, execution of tests to verify events, event consolidation and event enrichment to name a few possibilities.

  • Visionary: The Micromuse Visionary product is an analysis tool that utilizes weighted measurements of various SNMP collected values to assert certain fault conditions exist. It is an attempt to be an engineer in a box. Visionary uses weighted MIB expressions to define the state of something measurable via SNMP (BGP health, Host status, and so forth). You can create your own MIB expressions to be used in the polling and analysis.

  • WebTop: The Micromuse WebTop product is a two-directional Java_ interface into the Netcool Object Server Event List. WebTop can be configured to provide a one-directional (read-only) view to events pertinent to a specific customer.

Netcool Topology

As previously mentioned, the Netcool Object server has been installed on two servers in the IDC Mail and Messaging Reference Architecture implementation in order to provide some level of redundancy to the Netcool system. These Object servers are connected to each other using a bidirectional gateway. With this gateway, a Virtual Object Server is created with the two "real" Object Servers as the participants. Clients need only reference the name of the Virtual Object Server. Object Server operations are performed by one of the two real Object Servers and then synchronized with the other. This scheme provides fault tolerance for the object server.

Monitoring of the Message server 5.1 (Sun ONE) application environment is accomplished using probes provided with the Micromuse suite of products. These probes collect information from the management tools, normalize the information into a standard format, and then inject the resulting record into the database. Probes installed in this environment are:

Syslog: The syslog probe monitors the syslog output of the servers and reports potential problems and failures to the SLA manager and the NOC.

mttrapd: The mmtrapd facility listens for and handles traps. In this implementation, this facility is receiving traps from TeamQuest and normalizing the information. From these traps, the following information is collected:

  • System name that generated alarm

  • User text added to alarm

  • TeamQuest severity

  • Sampled value

  • Normal/Previous/Current threshold values

  • Timestamp of alarm occurrence (from TeamQuest)

  • User text description line 1

  • User text description line 2

  • User text description line 3

  • Current TeamQuest database name and hostname where TeamQuest detected the alarm, main alert bucket, subgroup1, subgroup2, subgroup3, application statistics

SunMC: The SunMC probe connects to the SunMC 3.0 server in order to receive SunMC events and passes filtered and normalized information to the Netcool object server. The contents of the specific record passed back to the Netcool object server are dependent upon the type of alert that was generated by SunMC.

The Netcool components are distributed in the iFRC as follows:

TABLE 4 Netcool Component Distribution

Server

Component

Notes

Ncdb

Object Server

Part of virtual object server

 

ISM

Monitors Message Server (Sun ONE) applications

 

mttrapd

Gets traps from TeamQuest

 

syslog probe

 

 

Process Automation

 

 

License Server

 

Ncimp

Object Server

Part of virtual object server

 

syslog probe

 

 

Bidirectional Gateway

Combines with ncdb to create a virtual object server

 

Process Automation

 

SunMC

SunMC probe

 

 

Process Automation

Monitors SunMC probe

 

License Server

With SunMC probe license


Netcool Console Services

The Netcool implementation provides a NOC view of the environment that is a consolidation of the alerts and information from the other monitoring components. This is a standard feature of Netcool and is provided on the NOC console for Netcool.

In addition to the NOC view, Netcool Impact generates a Service Level Management (SLM) view of the environment. By monitoring the information being generated by the other monitoring tools and the ISMs, Impact can determine how service levels on the application are affected and generate alarms on the SLM view.

In cases where there are performance impacts to the monitored applications, you have the ability to interrogate live TeamQuest graphs and reports directly from the Netcool SLM view. This is accomplished by creating a customized menu tool on the Netcool display and linking that directly to the TeamQuest agent that sent the alarm.

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.

Overview


Pearson Education, Inc., 221 River Street, Hoboken, New Jersey 07030, (Pearson) presents this site to provide information about products and services that can be purchased through this site.

This privacy notice provides an overview of our commitment to privacy and describes how we collect, protect, use and share personal information collected through this site. Please note that other Pearson websites and online products and services have their own separate privacy policies.

Collection and Use of Information


To conduct business and deliver products and services, Pearson collects and uses personal information in several ways in connection with this site, including:

Questions and Inquiries

For inquiries and questions, we collect the inquiry or question, together with name, contact details (email address, phone number and mailing address) and any other additional information voluntarily submitted to us through a Contact Us form or an email. We use this information to address the inquiry and respond to the question.

Online Store

For orders and purchases placed through our online store on this site, we collect order details, name, institution name and address (if applicable), email address, phone number, shipping and billing addresses, credit/debit card information, shipping options and any instructions. We use this information to complete transactions, fulfill orders, communicate with individuals placing orders or visiting the online store, and for related purposes.

Surveys

Pearson may offer opportunities to provide feedback or participate in surveys, including surveys evaluating Pearson products, services or sites. Participation is voluntary. Pearson collects information requested in the survey questions and uses the information to evaluate, support, maintain and improve products, services or sites, develop new products and services, conduct educational research and for other purposes specified in the survey.

Contests and Drawings

Occasionally, we may sponsor a contest or drawing. Participation is optional. Pearson collects name, contact information and other information specified on the entry form for the contest or drawing to conduct the contest or drawing. Pearson may collect additional personal information from the winners of a contest or drawing in order to award the prize and for tax reporting purposes, as required by law.

Newsletters

If you have elected to receive email newsletters or promotional mailings and special offers but want to unsubscribe, simply email information@informit.com.

Service Announcements

On rare occasions it is necessary to send out a strictly service related announcement. For instance, if our service is temporarily suspended for maintenance we might send users an email. Generally, users may not opt-out of these communications, though they can deactivate their account information. However, these communications are not promotional in nature.

Customer Service

We communicate with users on a regular basis to provide requested services and in regard to issues relating to their account we reply via email or phone in accordance with the users' wishes when a user submits their information through our Contact Us form.

Other Collection and Use of Information


Application and System Logs

Pearson automatically collects log data to help ensure the delivery, availability and security of this site. Log data may include technical information about how a user or visitor connected to this site, such as browser type, type of computer/device, operating system, internet service provider and IP address. We use this information for support purposes and to monitor the health of the site, identify problems, improve service, detect unauthorized access and fraudulent activity, prevent and respond to security incidents and appropriately scale computing resources.

Web Analytics

Pearson may use third party web trend analytical services, including Google Analytics, to collect visitor information, such as IP addresses, browser types, referring pages, pages visited and time spent on a particular site. While these analytical services collect and report information on an anonymous basis, they may use cookies to gather web trend information. The information gathered may enable Pearson (but not the third party web trend services) to link information with application and system log data. Pearson uses this information for system administration and to identify problems, improve service, detect unauthorized access and fraudulent activity, prevent and respond to security incidents, appropriately scale computing resources and otherwise support and deliver this site and its services.

Cookies and Related Technologies

This site uses cookies and similar technologies to personalize content, measure traffic patterns, control security, track use and access of information on this site, and provide interest-based messages and advertising. Users can manage and block the use of cookies through their browser. Disabling or blocking certain cookies may limit the functionality of this site.

Do Not Track

This site currently does not respond to Do Not Track signals.

Security


Pearson uses appropriate physical, administrative and technical security measures to protect personal information from unauthorized access, use and disclosure.

Children


This site is not directed to children under the age of 13.

Marketing


Pearson may send or direct marketing communications to users, provided that

  • Pearson will not use personal information collected or processed as a K-12 school service provider for the purpose of directed or targeted advertising.
  • Such marketing is consistent with applicable law and Pearson's legal obligations.
  • Pearson will not knowingly direct or send marketing communications to an individual who has expressed a preference not to receive marketing.
  • Where required by applicable law, express or implied consent to marketing exists and has not been withdrawn.

Pearson may provide personal information to a third party service provider on a restricted basis to provide marketing solely on behalf of Pearson or an affiliate or customer for whom Pearson is a service provider. Marketing preferences may be changed at any time.

Correcting/Updating Personal Information


If a user's personally identifiable information changes (such as your postal address or email address), we provide a way to correct or update that user's personal data provided to us. This can be done on the Account page. If a user no longer desires our service and desires to delete his or her account, please contact us at customer-service@informit.com and we will process the deletion of a user's account.

Choice/Opt-out


Users can always make an informed choice as to whether they should proceed with certain services offered by InformIT. If you choose to remove yourself from our mailing list(s) simply visit the following page and uncheck any communication you no longer want to receive: www.informit.com/u.aspx.

Sale of Personal Information


Pearson does not rent or sell personal information in exchange for any payment of money.

While Pearson does not sell personal information, as defined in Nevada law, Nevada residents may email a request for no sale of their personal information to NevadaDesignatedRequest@pearson.com.

Supplemental Privacy Statement for California Residents


California residents should read our Supplemental privacy statement for California residents in conjunction with this Privacy Notice. The Supplemental privacy statement for California residents explains Pearson's commitment to comply with California law and applies to personal information of California residents collected in connection with this site and the Services.

Sharing and Disclosure


Pearson may disclose personal information, as follows:

  • As required by law.
  • With the consent of the individual (or their parent, if the individual is a minor)
  • In response to a subpoena, court order or legal process, to the extent permitted or required by law
  • To protect the security and safety of individuals, data, assets and systems, consistent with applicable law
  • In connection the sale, joint venture or other transfer of some or all of its company or assets, subject to the provisions of this Privacy Notice
  • To investigate or address actual or suspected fraud or other illegal activities
  • To exercise its legal rights, including enforcement of the Terms of Use for this site or another contract
  • To affiliated Pearson companies and other companies and organizations who perform work for Pearson and are obligated to protect the privacy of personal information consistent with this Privacy Notice
  • To a school, organization, company or government agency, where Pearson collects or processes the personal information in a school setting or on behalf of such organization, company or government agency.

Links


This web site contains links to other sites. Please be aware that we are not responsible for the privacy practices of such other sites. We encourage our users to be aware when they leave our site and to read the privacy statements of each and every web site that collects Personal Information. This privacy statement applies solely to information collected by this web site.

Requests and Contact


Please contact us about this Privacy Notice or if you have any requests or questions relating to the privacy of your personal information.

Changes to this Privacy Notice


We may revise this Privacy Notice through an updated posting. We will identify the effective date of the revision in the posting. Often, updates are made to provide greater clarity or to comply with changes in regulatory requirements. If the updates involve material changes to the collection, protection, use or disclosure of Personal Information, Pearson will provide notice of the change through a conspicuous notice on this site or other appropriate way. Continued use of the site after the effective date of a posted revision evidences acceptance. Please contact us if you have questions or concerns about the Privacy Notice or any objection to any revisions.

Last Update: November 17, 2020