Home > Articles > Software Development & Management

IT Management Reference Guide

Jul 9, 2004

␡

⎙ Print

< Back Page 172 of 205 Next >

This is the third installment of a four-part section that identifies and discusses the 13 cardinal steps (see Figure 1) needed to initiate and maintain a business continuity program. In Part One I covered the first four of these steps and in the second part I discussed the steps five through eight. In this installment I explain steps nine through twelve. Steps nine and ten involve the development of business continuity recovery plans oriented toward business users and technical users, respectively. Steps eleven and twelve describe how to conduct validation and simulation tests. Step thirteen is the topic of Part Four of this series and explores operational tests, the most comprehensive and complex of the three testing exercises.

Acquire executive support.
Conduct a business impact analysis.
Perform threat analysis
Perform vulnerability analysis
Conduct risk assessment
Develop high-level recovery strategies
Develop detailed recovery strategies
Determine number and scope of recovery plans
Develop recovery plans oriented to business users
Develop recovery plans oriented to technical users
Conduct validation tests
Conduct simulation tests
Conduct operational tests

Figure 1 The 13 Cardinal Steps of a Business Continuity Program

Step 9: Develop Recovery Plans Oriented To Business Users

Up to this point I have shown you how to identify the critical business processes that need to be restored in the event of a disaster, and how to develop the high-level and detailed recovery strategies needed to enact such a restoration. We next need to develop the actual business continuity plans that will be used by business users to recover their critical processes.

I have seen a variety of methods used to develop such plans. Some shops keep it very simple and use nothing more than Word documents to prescribe their recovery steps. On the other end of the spectrum are those who use sophisticated, and expensive, tools specifically designed to this purpose. Many of my clients use a SQL relational database product from Strohl Software called the Living Disaster Recovery Planning System (LDRPS). It is very comprehensive and ideal for large shops with hundreds of plans to maintain. Many financial organizations use LDRPS because of their need to centralize and standardize plans for hundreds of branch offices.

The disaster recovery service provider Sungard also provides a tool, slightly less sophisticated than LDRPS, for developing plans. IBM and HP also supply business continuity plan development tools. Regardless of the tool selected, I believe there are six important attributes that characterize an effective business continuity plan:

Understandable – use simple wording that the reader will comprehend
Comprehensive – include all critical business processes and their dependencies
Accurate – ensure currency of phone numbers, personnel, software, hardware
Accessible – make the plans easily accessible; consider keeping copies on laptops, in thumb-drives, or at-home hardcopies
Maintainable – develop plans that are easy to update and distribute
Organized – organize the plan in a logical manner that follows actual recovery

As to the organization of the plan, it usually follows a pattern of four main sections, each with subgroups:

Response

Call trees

Internal contacts

Resources

recovery teams

suppliers

customers

software

hardware

Recovery

relocation procedures

business processes and dependencies

special supplies and telecommunications

Resumption

reverting back to permanent site

analysis of impact of the event

documentation of unique information

Business recovery plans will vary in size, complexity and scope depending on the type of environment they pertain to, but all will have these essential parts included in them.

Step 10: Develop Recovery Plans Oriented To Technical Users

Business continuity recovery plans oriented to technical users are very similar to those oriented to business users with one important exception: technical plans include steps to recover the IT infrastructure. Most business processes today depend heavily on software applications, databases, and network connections. These are the essential components of an IT infrastructure, and must be recovered in the event of a disaster in order to restore the business processes they support.

Some shops still refer to these types of IT business continuity plans as disaster recovery plans. If the components being restored are of a technical nature then this would be true. But normally there are business processes associated with the IT environment and for this reason the element of business continuity becomes a part of these plans as well.

Step 11: Conduct Validation Tests

There are primarily three types of testing, or exercises, used with business continuity plans:

Validation tests (conducted approximately every 3-6 months)
Simulation tests (conducted approximately every 6-12 months)
Operational tests (conducted approximately every 12-18 months)

This section describes validation tests, and the next two sections describe the other two. A validation test verifies the accuracy of the data within the plan. The specific data checked for includes:

employees' office telephone numbers
employees' mobile telephone numbers
employees' home telephone numbers
customers' contact information
suppliers' contact information
identification of all critical business processes
current recovery time objectives (RTOs) of all processes
current response point objectives (RPOs) of all processes
all dependencies of all critical business processes
identification of all currently needed software
current version, release and patch levels of software
identification of all currently needed hardware
current model numbers of all needed hardware

Planners usually organize telephone numbers into call trees in which a higher level person, such as a manager or a lead, calls several subordinates who in turn may call other members of the team. In this way planners can contact the maximum number of individuals in the minimum amount of time. Organizers conduct call tree tests by having each person who is assigned numbers actually call the individuals, usually off hours, and tracking if the people called and the numbers used are still accurate.

Plan owners normally contact business users to verify that business processes and their dependencies are still valid. Similarly, planners will contact appropriate IT personnel and suppliers to ensure that software versions and hardware model numbers remain current.

Step 12: Conduct Simulation Tests

A simulation test is often referred to as a Table Top Exercise because it is usually conducted with all key participants of the recovery sitting around a table (or teleconferencing in) and going through the business continuity plan step by step to assess the validity and viability of the plan. A previous segment of this Management Guide offers a detailed discussion of this topic in a four-part series under the heading of 'Conducting an Effective Table Top Exercise' in the Business Continuity Section.

This third part covered the development of business continuity plans for both business and technical users, and the conducting of two of the three types of tests: validation and simulation. Part Four is the final installment of this series on implementing a business continuity program. It explains operational testing in which business processes and their supporting software applications are functionally restored and tested by business users at recovery sites.

< Back Page 172 of 205 Next >

🔖 Save To Your Account

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.

Email Address