Skip to main content
Wednesday, May 16, 2012

Awareness and Training

Another critical component, which will determine the success of business continuity management (BCM) is awareness and training. This section will focus on the creation of a program to increase employee awareness and enhance the skills required to develop, implement, maintain, and execute the business continuity plan.

Audits...

Audits are reports that continually review plans, emergency and recovery tests and exercises. The purpose of auditing is to ensure the continuing readiness of the plan and should be conducted prior to, during, and after a test exercise. It is the responsibility of the BCM manager to make sure that the plan is periodically audited. His or her responsibilities include scheduling, conducting, coordinating and evaluating the auditing process and making sure that senior management is kept aware of the results. These audits make senior management aware of the fact that all necessary critical data is being verified and is available should the need occur.

Some examples of auditing frequencies are:

  • Off-site storage or vault inventories audits should be conducted a minimum of once every quarter both with and without notification. Calling back critical data sets from the vault, and testing/exercising the application on-site or off-site, is a viable way to conduct and report on auditing results.
  • Pyramid Call Tree audit should be conducted at least semi-annually or more frequently depending on personnel changes. This will periodically audit and verify the accuracy of the employee contact information contained in the plan.
  • A Hot-Site test (if applicable) or exercise conducted no less than semi-annually should also include an audit that reports on the problems and situations encountered while exercising the Hot-Site recovery procedures.
  • On-site or remote test and exercise audits, which could include tabletop exercises, should be conducted semi-annually.
  • Critical applications audits should be conducted a minimum of once every quarter to verify that recovery can be performed with in established recovery time objectives.
  • Lessons learned reporting is also a beneficial way to provide senior management with a current status of the BCM program.

The BCM manager should evaluate the results of every audit carried out to ensure that it is properly carried out. The manager will then formulate a written report to senior management indicating results and recommendations for plan improvement.

Types of Audits:
  • Telecommunications and work flow verification.
  • Daily maintenance and debugging of applications.
  • System problems and resolutions.
  • Required critical forms.
  • On-site/off-site storage and shipping of critical data sets.
  • Outsource recovery plans.
  • Vendor problems and resolutions.
  • Resource problems and resolutions.

Training...

No matter how well crafted a plan may be, it is only as good as the performance of the participants. The purpose of the training is to get people to perform instinctively, with the precision that a live disaster requires. In order to achieve this purpose, the participants must first know the plan. The assumption that people know what to do when "the fire bell rings" is a dangerous one. The following types of training should be considered when developing a program:

  1. Orientation - We should conduct orientation when new procedures are introduced and when new people enter the workplace. Individual employees must be trained on basic emergency procedures, such as:
    1. Locations of fire extinguishers.
    2. Recognition of alarms, emergency exits and evacuation routes.
    3. Basic first aid procedures.
    4. Who to notify in the case of an emergency.
  2. Although managers and supervisors conduct the training, the plan coordinator is responsible for the content of the training and for the follow-up to see that training does, in fact, take place.

  3. On-site Emergency Preparedness Testing - The chief means by which the plan coordinator determines the effectiveness of orientation is by conducting emergency preparedness drills. The objectives of such drills are:
    1. Validate employee readiness.
    2. Verify that equipment is maintained, handled properly, and functions as intended.
    3. Discover changes made in procedures made but not updated in the plan.

  4. Tabletop/Scenario Testing - The target audience for tabletop scenarios are those data processing and management employees who are responsible for the more detailed and technical tasks in the plan. This group writes up the circumstances or scopes of an event for consideration.


  5. On-Site Functional Application Testing - As the title suggests, the test is performed to validate application functionality. The purpose is to discover any procedural changes that may affect the plan. Functional application testing is usually conducted each quarter.


  6. Off-Site Functional Application Testing - This test is performed to test the Hot or Back-up site environment. It is conducted in the same manner as the on-site functional application testing except that the environment is off-site. Off-site testing is usually conducted semi-annually.


  7. Responsibility of the Business Continuity Planner is:
    1. Recovery team members are trained.
    2. Attend appropriate education seminars to keep abreast of latest advances in recovery procedures.
    3. Scope and method of training - Should include classroom instruction as well as facility tours to show the locations of fire extinguishers, fire alarms, emergency exits, first aid kits, and all emergency utility shut-off.
    4. Frequency - Depending on each office environment, orientation should be conducted once a month for new employees with functional departmental testing occurring every 6 months.
    5. Evaluation of results - Should be conducted after each and every test to ensure that training has been properly conducted and to measure the level of employee understanding. A formal report should be written and submitted to management indicating results and recommendations to improve the training process.

Maintenance Testing...

Emergency tests are simulations of actual emergency and disaster conditions and if conducted correctly should determine a realistic timeframe for recovery.

  1. Responsibility - The recovery coordinator is responsible for periodic testing of the plan and his/her responsibilities would include:
    1. The scope of the test is agreed on.
    2. The objectives of the test are agreed on.
    3. A change management request is raised to book time and personnel.
    4. Contracts are raised with external vendors for equipment etc.
    5. Agreements are gained from affected bodies (internal or external).
    6. Briefings of personnel are held on a regular basis.
    7. Independent observers are selected when appropriate.
    8. Preparations and support is put in place (catering, accommodations, travel, etc).
    9. Business areas briefed about the test and the potential impact to those who are left.
    10. Notification of the test to all areas of the business.
    11. The test is executed to a strict project plan with a clear cut-off time.
    12. Detailed notes are taken during the test describing, in detail, the proceedings.
    13. A post review meeting is held to discuss outcome.
    14. A test report is written collating all logs and key findings.
    15. Plans are amended and strategies altered to reflect findings.

  2. Scope and Method of Training - On-site procedures, such as exercising critical applications and testing strategies and procedures stated in the plan, must be conducted periodically both with and without notification. Preparation for a test or exercise should include:
    1. The SCOPE - The goal you set for the test or exercise and a way to measure your accomplishments.
    2. The AGENDA - Schedule of events and times you plan to follow.
    3. The LEVEL OF SUPPORT - The involvement of special groups or teams, which will aid in the success of the test or exercise.
    4. The STATISTICS - These will give you the exact times used during a recovery test or exercise and an estimate of the recovery time needed during a real recovery event.

  3. Frequency
    1. On-site/Remote testing should be conducted once a quarter.
    2. Hot-site/Off-site testing semi-annually.
    3. Pyramid Call Tree testing semi-annually.

  4. Evaluation of Results - Should be conducted after each and every test to ensure that training has been properly conducted and to measure the level of employee understanding. A formal report should be written and submitted to management indicating results and recommendations to improve the training process.


  5. Types of Testing - Typically, recovery testing begins with component testing, and through a series of successful tests, progresses to comprehensive testing.
    1. Component testing - Is narrowed in focus and designed to test the recovery capability of a specific piece of the overall recovery plan. Some examples might include:
      1. Testing the Call Tree.
      2. Auditing the off-site procedures and contents.
      3. Restoring only the operating system.
    2. Comprehensive testing - Focuses on the entire set of business recovery plans and is designed to test overall recovery capabilities. Examples of comprehensive testing might include the following:
      1. Companies mainframe environment (operating systems, software products, production and test applications, data, etc).
      2. Companies midrange environment (operating systems, software products, production and test applications, data, etc).
      3. LAN/WAN environment.
      4. Centralized services (purchasing, legal, mail services, facilities, etc).
      5. Business operations areas.

Business Continuity Planning
Additional References