5 The Swiss Cheese model of safety incident causation
In the previous chapter, we learned that PCBUs are responsible for the health and safety of workers, wherever they undertake this work. Occupational health was identified as a specialist field requiring PCBUs to stay engaged with public health, WHS regulators, and to hire occupational health expertise as needed, ranging from consulting ergonomists through to occupational therapists. We also learned the difference between accidents, which we cannot control, and incidents which we, as HR managers involved in WHS management, seek to control to a reasonably practicable level. This chapter will specifically focus on incidents and their potential prevention through adopting James Reason’s theorical Swiss Cheese Model of safety incident causation.
Learning Objectives
This chapter introduces:
- The characteristics of large-scale WHS incidents and the lessons learned.
- James Reason’s Swiss Cheese Model of safety incident causation.
A WHS incident is “an unplanned event or chain of events that results in losses such as fatalities or injuries” (Wolters Kluwer, n.d., para. 1), however, incidents (lost time injuries/disease events) are only one of three types of potential WHS events:
- Lost time injuries/diseases: Those occurrences that resulted in a fatality, permanent disability or time lost from work of one day/shift or more.
- Injuries without lost time: Those occurrences that were not lost time injuries and for which first aid and/or medical treatment was administered.
- Near-misses: Unplanned incidents that occurred at the workplace that, although not resulting in any injury or disease, had the potential to do so.(Archer et al., 2015, p. 86)
Injuries without lost time and near misses, as we will learn across this chapter, can be useful indicators of the functionality and effectiveness of WHS management in your business.
Notably, most WHS theory has emerged from in-depth analysis of large-scale WHS incidents. These critical incidents may be defined as a “sudden, unexpected and overwhelming event, that is out of the range of expected experiences” (UNHCR, 2019, para. 1); these are low probability–high risk events that unfold rapidly across a specific point of time (Reason, 1997a).
James Reason is a preeminent WHS scholar who has examined the role of human factors in large-scale WHS incidents. He noted that most WHS managers were focused on human errors, the “errors and violations committed at the ‘sharp-end’ of the system…likely to have a direct impact on the safety of the system and, because of the immediacy of their adverse effects, these acts are termed active failures” (Reason, 1997a, p. 10). Reason, and other scholars such as Hopkins & Palser (1987) and Hudson (2007), identified that critical incidents were never just one person’s fault, there were different issues in and around organisations that affected the people involved and led to the incident. Reason wanted to prove that we really should not blame-the-victim for system failures.
Reason then observed that every workplace has hazards, but they are not injuring workers all the time or at the same rate – some organisations have catastrophes while others have low rates of injury even when workers are doing the very same job with exposure to the same hazards.
Reason concluded that it depends on the effectiveness of the organisation to identify the hazard, assess its risk—the actual potential for injury to occur to a worker—and undertake measures to reduce the risk to the workers through hazard control (Reason, 1997a). This led him to him conceptualising the Swiss Cheese Model of safety incident causation to explain this phenomenon (Figure 5.1).
Figure 5.1: Swiss Cheese Model of safety incident causation (after Reason,1997a)
Source: “Swiss Cheese Model” by Ben Aveling, Wikimedia Commons, CC BY-SA 4.0
Reason proposes that safety in a workplace may be conceptualised as a block of cheese comprised of slices that act as layers of defence against hazards. Given the internal and external factors putting pressure on organisations and their workers, weaknesses will develop in these defence layers. These weaknesses, ‘holes’, are not always predictable or static; they pop up when safety issues arise and disappear when safety issues are resolved. Weaknesses can occur in different defence layers, within different parts of the organisation. As such, Reason argues, work safety looks more like Swiss cheese (Reason, 1997a).
In the model, we can visualise a hazard as an arrow. If the arrow is able to pass through holes in all the safety defence layers, an incident occurs leading to worker injury or harm. Essentially, when multiple layers of safety defences are weak, there are more holes in the cheese and a greater probability that the hazard arrow can shoot straight through to cause more incidents (Reason, 2000). In contrast, a near miss, “an event that could have potentially resulted in…losses, but the chain of events stopped in time to prevent this” (Wolters Kluwer, n.d., para. 1), is when at least one of the safety defence layers stops the hazard passing through, therefore preventing the hazard from having contact with worker or at least minimising the harm. Ideally, as depicted in Figure 5.1, most hazards would be stopped and losses prevented.
Returning to consider Archer’s definition of WHS events (Archer et al., 2015), injuries without lost time and near misses can then be re-conceptualised as key performance indicators (KPIs) measuring the effectiveness of an organisation’s safety defence system. Reason’s Swiss Cheese Model infers that the more incidents you have, the weaker your safety defences, and that the more near misses you have, the greater the likelihood of an incident occurring, unless interventions are adopted to strengthen the safety defences (Reason, 1997a). Measuring WHS events, even the low impact ones, then becomes useful to effectively managing WHS (National Safety Council, 2013). Notably, organisations striving for good safety management would try to repair the holes in the Swiss cheese as promptly as possible to minimise the probability of all the holes lining up and the hazard arrow shooting through.
What comprises the defence layers? Reason (1997a, p. 7) states that the defences in the model are designed to:
- create understanding and awareness of the local hazards
- give clear guidance on how to operate safely
- provide alarms and warnings when danger is imminent
- restore the system to a safe state in an off-normal situation
- interpose safety barriers between the hazards and the potential losses
- contain and eliminate the hazards should they escape this barrier
- provide the means of escape and rescue should hazard containment fail.
Clearly this model is based on a blame-the-system, rather than a blame-the-victim, approach as the defence layers are comprised of more than solely addressing the errors made by individual workers.
A key assumption of the Swiss Cheese Model of safety incident causation is that:
Fallibility is an inescapable part of the human condition, it is now recognized that people working in complex systems make errors or violate procedures for reasons that generally go beyond the scope of individual psychology….poor design, gaps in supervision, undetected manufacturing defects or maintenance failures, unworkable procedures, clumsy automation, shortfalls in training, less than adequate tools and equipment…may be present for many years before they combine with local circumstances and active failures to penetrate the system’s many layers of defences. (Reason, 1997, p. 10)
Workers are not to blame, according to Reason, because it is the latent conditions that fail to protect workers and lead to incidents.
So perhaps we could consider managing active failures, the human errors, as only one layer of safety defences—a people layer—whereas the majority of safety defence layers may seek to uncover and resolve the latent conditions, issues that otherwise would lay dormant in the system until the active error occurs. If we reconsider Reason’s (1997) latent conditions, these might cluster around engineering (poor design, clumsy automation, less than adequate tools and equipment), procedural (gaps in supervision, unworkable procedures) and administrative (shortfalls in training). For our purposes, in an HR context, we might reconfigure these to think about an organisation’s internal systems as potential overarching safety defence layers:
- A people layer – safety focused workers with relevant values, behaviours and safe decision making (we will discuss safety culture in Chapter 6).
- An engineering layer – with alarms, physical barriers and automatic shutdowns.
- A procedural layer – with instructions on safe work practices.
- An administrative layer – ensuring people are hired with the right skills and provided the right training to perform tasks safely and are also, for example, given a safe work uniform.
Avoiding human error then only corresponds to only one layer of safety defences because, as Reason observed via his investigations of large-scale WHS disasters, you cannot blame-the-victim and achieve better organisational safety outcomes (Reason, 1997a). His Swiss Cheese Model illustrates that when there are weaknesses (latent conditions) in the safety defences layers (such as the engineering, procedural and administrative systems within an organisation) and these converge with human error, an incident will occur. Taking this view means that the Swiss Cheese Model of safety incident causation can see that it can be used two different ways:
- After an incident has occurred to help understand what went wrong, to learn from mistakes and improve the health and safety defences.
- Before an incident occurs to help us identify any weaknesses in our safety defences, to intervene and fix these before an incident occurs.
For example, we can now understand the preceding discussions of occupational health (Chapter 4)—particularly health monitoring—as being situated in the people safety defence layer and striving to reduce the risk of human errors (active failures), but clearly there are many other safety imperatives in our organisations that require a blame-the-system philosophy in order to manage them.
Box 5.1: Video 3, An introduction to work health and safety management
The following video reviews and conceptualises James Reason’s Swiss Cheese Model of safety incident causation.
A transcript of this video is available here.
Source: Sheridan, L. (producer, narrator). (2019). Video 3: An introduction to work health and safety management.
Preston, A. (audio engineer); Orvad, A., (artist) and Franks, R., (animator), Learning, Teaching and Curriculum. University of Wollongong, Australia. YouTube
Activity 5.1
Take this five question quick quiz to review your understanding of key concepts introduced throughout Chapter 5 and see how your learning is progressing.
When used proactively, the Swiss Cheese Model presents opportunities to examine the unique latent conditions that exist within our workplaces that, if we simply blamed the victim, we would not be able to identify, learn from and address in order to reduce the risk of hazards inflicting harm on workers. The following Contemporary WHS challenge, considers a culture of care leading to aged care staff incurring injuries. This was only discoverable by adopting a blame-the-system approach to understand why care workers were deliberately putting themselves at risk of harm when undertaking their duties of care towards the elderly.
Contemporary WHS Challenges: Aged Care work
Figure 5.2: Aged care as work
Source: pxhere.com, CC0
Due to the demographic aging of Australia, the demand for aged care services has, and will continue, to increase (Australian Government, 2023). The same trend is occurring in New Zealand (StatsNZ, 2022). The 2021 Australia Royal Commission into Aged Care Quality and Safety revealed serious concerns about the quality of care being provided to elderly people in aged care facilities, in part, because of the increased casualisation of work, the ‘gig economy’, and a lack of oversight of these workers with regards to their education and training (Royal Commission into Aged Care Quality and Safety, 2021). This lack of training, as a latent condition, has potential serious implications for managing WHS in aged care.
The aged care sector is, by its very nature, a challenging workplace with hazards and risks including “lifting, supporting and moving patients, moving and handling equipment such as beds, mattresses, trolleys and wheelchairs, occupational violence, work-related stress and bullying and harassment” (WorkSafe Victoria, n.d.). Ensuring adequate training for staff to manage these hazards, and their risks, is a challenging but important task. However, the nature of care work presents other, less obvious, challenges that still need to be managed.
Box 5.2: Neil Logan WHS in aged care
In the following video, Neil Logan discusses the conundrum of workers who injure themselves when striving to provide quality care to the elderly.
Source: Sheridan, L. (Producer). (2016). Dementia concepts for business curricula. Learning, Teaching and Curriculum, University of Wollongong, Australia. YouTube
Reflect:
Imagine you are an HR manager for an aged care organisation. How would you address the moral conundrum of ensuring care workers provide optimal care to your elderly residents while sustaining their compliance with your organisation’s WHS policies?
Further reading:
Sheridan, L. & Agim, T. (2014). Aged care safety dilemma: Caring for self versus caring for residents. Australian Journal on Aging 33(4), 283–285.
In conclusion, active failures are complex human-centric factors which makes them difficult to predict and control. However, latent conditions can be identified and fixed before an incident occurs. It is here that HR managers have a pivotal role to play in WHS management; in supporting all the policies, procedures, training and performance management that leads to effective identification of the pre-determinable hazards and the fortification of safety defences that prevent the occurrence of incidents. Keeping latent conditions under control is key to effective safety management. To achieve this, most organisations will come to choose a coordinated, cyclical, approach; they will adopt a safety management system —and these are the focus of the next chapter.
A PCBU, a person conducting a business or undertaking, “may be an individual person or an organisation. This does not include workers or officers of PCBUs, volunteer associations, or home occupiers that employ or engage a tradesperson to carry out residential work” (WorkSafe, 2019, p. 4).
Occupational health had its modest beginnings in first aid and disease controls for high risk heavy industry workplaces, such as mines, but gained greater recognition in the 1970s when the World Health Organization acknowledged its contribution to the identification of workplace-derived factors causing occupational illness and suggested its remit should be broadened to encompass public health. So, today, many occupational health specialists have more of a public health, and less of an immediate workplace, focus (Schilling, 1989).
“Public health is the science and art of preventing disease, prolonging life, and promoting physical health and efficiency through organized community efforts for the sanitation of the environment, the control of community infections, the education of the individual principles of personal hygiene, the organization of medical and nursing service for the early diagnosis and preventative treatment of disease, and the development of the social machinery which will ensure to every individual in the community a standard of living adequate for the maintenance of health” (Windslow, 1920, p. 30)
In principle, and in accordance with the Australian Model Work Health and Safety Bill, a regulator is a legally established government body whose functions are:
“ (a) to advise and make recommendations to the Minister and report on the operation and effectiveness of this Act;
(b) to monitor and enforce compliance with this Act;
(c) to provide advice and information on work health and safety to duty holders under this Act and to the community;
(d) to collect, analyse and publish statistics relating to work health and safety;
(e) to foster a co-operative, consultative relationship between duty holders and the persons to whom they owe duties and their representatives in relation to work health and safety matters;
(f) to promote and support education and training on matters relating to work health and safety;
(g) to engage in, promote and co-ordinate the sharing of information to achieve the object of this Act, including the sharing of information with a corresponding regulator;
(h) to conduct and defend proceedings under this Act before a court or tribunal;
(i) any other function conferred on the regulator by this Act”
(Safe Work Australia, 2023, section 152).
"Someone who studies the design of furniture or equipment and the way this affects people's ability to work effectively" (Cambridge Dictionary, 2023, para. 1).
"Occupational therapy is the art and science of helping people take part in everyday living through their occupations" (Occupational Therapy Board of New Zealand, n.d., para. 1)
Note: Occupation in this context is not limited to the occupations of work and employment.
An accident which is defined as “something bad that happens that is not expected or intended and that often damages something or injuries someone” (Cambridge Dictionary, 2023, para. 1) or “something that happens by chance or without expectation; an event that is without apparent or deliberate” (Oxford English Dictionary, n.d., para. 6).
In the WHS context, “An incident is an unplanned event or chain of events that results in losses such as fatalities or injuries, damage to assets, equipment, the environment, business performance or company reputation” (Wolters Kluwer, n.d., para. 1).
Australia's Model Work Health and Safety Bill defines reasonably practicable as:
"In this Act, reasonably practicable, in relation to a duty to ensure health and safety, means that which is, or was at a particular time, reasonably able to be done in relation to ensuring health and safety, taking into account and weighing up all relevant matters including:
(a) the likelihood of the hazard or the risk concerned occurring; and
(b) the degree of harm that might result from the hazard or the risk; and
(c) what the person concerned knows, or ought reasonably to know, about:
(i) the hazard or the risk; and
(ii) ways of eliminating or minimising the risk; and
(d) the availability and suitability of ways to eliminate or minimise the risk; and
(e) after assessing the extent of the risk and the available ways of eliminating or minimising the risk, the cost associated with available ways of eliminating or minimising the risk, including whether the cost is grossly disproportionate to the risk" (WorkSafe Australia, 2023, Section 18).
"James Reason proposed the image of "Swiss cheese" to explain the occurrence of system failures...According to this metaphor, in a complex system, hazards are prevented from causing human losses by a series of barriers. Each barrier has unintended weaknesses, or holes – hence the similarity with Swiss cheese. These weaknesses are inconstant – i.e., the holes open and close at random. When by chance all holes are aligned, the hazard reaches the patient and causes harm" (Perneger, 2005, p. 71).
"Theories about the causes of industrial accidents can be classified into two broad types: those which emphasize the personal characteristics of the workers themselves and those which locate the causes in the wider social, organisational or technological environment. The former approach is conveniently termed blaming-the-victim and the latter, blaming the-system" (Hopkins & Palser, 1987, p. 26).
"A group or set of related or associated things perceived or thought of as a unity or complex whole" (Oxford Dictionary, 2023, para. 1).
A hazard may be defined as "anything with the potential to harm life, health or property" (Dunn, 2012, p. 53).
"Likelihood and consequence of injury or harm occurring” (Standards Australia & Standards New Zealand, 2001, p. 5).
Hazard control is determining "...appropriate ways to eliminate the hazard, or control the risk when the hazard cannot be eliminated" (Canadian Centre for Occupational Health and Safety, n.d., para. 6).
Near misses are "unplanned incidents that occurred at the workplace that, although not resulting in any injury or disease, had the potential to do so” (Archer et al., 2015, p. 86). When contextualised in the Swiss Cheese Model of incident causation, they can be understood as "an event that could have potentially resulted in…losses, but the chain of events stopped in time to prevent this” (Wolters Kluwer, n.d., para. 1).
Three types of potential WHS events that occur at different scales and levels of severity:
“Lost time injuries/diseases: Those occurrences that resulted in a fatality, permanent disability or time lost from work of one day/shift or more.
Injuries without lost time: Those occurrences that were not lost time injuries and for which first aid and/or medical treatment was administered.
Near-misses: Unplanned incidents that occurred at the workplace that, although not resulting in any injury or disease, had the potential to do so” (Archer et al., 2015, p. 86)
“Key performance indicators (KPIs) are used to measure and monitor whether an organization is on the right track…KPIs play an important role in modern organizations improving performance is key to achieving organizational success” (Madsen and Stenheim, 2022, para. 1).
"Theories about the causes of industrial accidents can be classified into two broad types: those which emphasize the personal characteristics of the workers themselves and those which locate the causes in the wider social, organisational or technological environment. The former approach is conveniently termed blaming-the-victim and the latter, blaming the-system" (Hopkins & Palser, 1987, p. 26).
"Human beings contribute to the breakdown of such [complex technological] systems in two ways. Most obviously, it is by errors and violations committed at the 'shape end' of the system...Such unsafe acts are likely to have a direct impact on the safety of the system and, because of the immediacy of their adverse effects, these acts are termed active failures" (Reason, 1997, p. 10).
"Fallibility is an inescapable part of the human condition, it is now recognized that people working in complex systems make errors or violate procedures for reasons that generally go beyond the scope of individual psychology. These reasons are latent conditions....poor design, gaps in supervision, undetected manufacturing defects or maintenance failures, unworkable procedures, clumsy automation, shortfalls in training, less than adequate tools and equipment - may be present for many years before they combine with local circumstances and active failures to penetrate the system's many layers of defences" (Reason, 1997, p. 10).
“A culture that supports an organization’s OH&S management system is largely determined by top management and is the product of individual and group values, attitudes, managerial practices, perceptions, competencies and patters of activities that determine commitment to, and the style and proficiency of, its OH&S management system” (Standards Australia and Standards New Zealand, 2018), p. 27).
"A gig economy is a labor market that relies heavily on temporary and part-time positions filled by independent contractors and freelancers rather than full-time permanent employees" (Investopedia, 2022, para. 2)
A safety management system is “ a systematic approach to managing safety. It is designed to continuously improve safety performance through the identification of hazards, the collection and analysis of safety data and safety information, and the continuous assessment of safety risks” (Civil Aviation Authority of New Zealand, 2023, p. 12).