During Service Design and Service Transition, IT services are designed for availability and recovery. Service Operation is responsible for actually making the IT service available to the specified users at the required time and at the agreed levels.
During Service Operation the IT teams and users are in the best position to detect whether services actually meet the agreed requirements and whether the design of these services is effective.
What seems like a good idea during the Design phase may not actually be practical or optimal. The experience of the users and operationalfunctions makes them a primary input into the ongoing improvement of existing services and the design.
However, there are a number of challenges with gaining access to this knowledge:
Most of the experiences of the operational teams and users are either informal, or spread across multiple sources.
The process for collecting and collating this data needs to be formalized.
Users and operational staff are usually fully occupied with their regular activities and tasks and it is very difficult for them to be involved in regular planning and design activities. One argument often made here is that if design is improved, the operational teams will be less busy resolving problems and will therefore have more time to be involved in design activities. However, practice shows that as soon as staff are freed up, they often become the target of workforce reduction exercises.
Having said this, there are three key opportunities for operational staff to be involved in Availability Improvement, since these are generally viewed as part of their ongoing responsibility:
Reviewof maintenance activities. Service Design will define detailed maintenance schedules and activities, which are required to keep IT services functioning at the required level of performance and availability. Regular comparison of actual maintenance activities and times with the plans will highlight potential areas for improvement. One of the sources of this information is a review of whether Service Maintenance Objectives were met and, if not, why not.
Major problem reviews. Problems could be the result of any number of factors, one of which is poor design. Problem reviews therefore may include opportunities to identify improvements to the design of IT services, which will include availability and capacity improvement.
Involvement in specific initiatives using techniques such as Service Failure Analysis (SFA), Component Failure Impact Analysis (CFIA), or Fault Tree Analysis (FTA) or as members of Technical Observation (TO) activities – either as part of the follow-up to major problems or as part of an ongoing service improvement programme, in collaboration with dedicated Availability Management staff. These Availability Management techniques are explained in more detail in the Service Design publication.
There may be occasions when Operational Staff themselves need downtime of one or more services to enable them to conduct their operational or maintenance activities – which may impact on availability if not properly scheduled and managed. In such cases they must liaise with SLM and Availability Management staff – who will negotiate with the business/users, often using the Service Desk to perform this role, to agree and schedule such activities.