Major tech failures like the CrowdStrike crisis are a stark warning

A cloud storage vector illustration featuring an image of a cloud with connecting server links
(Image credit: Shutterstock)

Today’s society is more dependent on technology than ever before. Virtually every aspect of our daily lives is intertwined with IT infrastructure, from the way we communicate and work to how we access essential services like banking and healthcare. This deep integration means that even the smallest error in a system can have profound consequences, affecting millions of people and causing significant disruptions.

This was starkly illustrated by the recent IT outage caused by an update to CrowdStrike's cybersecurity software, which impacted 8.5 million Windows PCs globally. The disruption affected a wide range of critical services, from airlines and online transactions to cash machines and card payments for major retailers. It even led to the cancellation of NHS appointments and operations. Incidents like this highlight the vulnerabilities within our technological infrastructure and underscore the urgent need for organizations to take proactive steps to prevent such failures in the future.

Unfortunately, the CrowdStrike incident is not an isolated case but rather a symptom of a broader issue within our technology. In recent months, we’ve seen similar widespread outages affecting major companies like McDonald's, Greggs, Deliveroo, Tesco, and several organizations within the banking and finance sector. These outages left many people unable to access their online banking or send and receive payments. The impact on consumers is significant, leading to frustration, worry and a loss of trust in the systems they rely on. A recent survey conducted by Roq revealed that a third of Brits experienced a technology failure in the banking sector within the past year. This clearly indicates an urgent need for greater investment in ensuring that the technologies and systems we rely on every day are robust, reliable, and future-proof.

Stephen Johnson

Founder and CEO of Quality Engineering consultancy at Roq.

Quality engineering

As the CEO and founder of Roq, I have witnessed firsthand the repercussions of prioritizing speed and cost-cutting over Quality Engineering. Technology outages are becoming more frequent, more disruptive, and more damaging to both businesses and consumers. The current economic climate, with its emphasis on efficiency, has led many organizations to underinvest in quality—a short-sighted approach that is ultimately counterproductive. While cutting corners may save money in the short term, the long-term costs of repeated technology failures, in terms of financial loss and damage to reputation, can be far greater.

Unfortunately, these costs are usually borne by consumers, who are left dealing with the fallout—whether it’s being unable to access their bank accounts, missing out on critical healthcare services, or facing delays in essential transactions, the impact on consumers is both significant and unacceptable. Moreover, poor-quality technology today will still need fixing tomorrow, and by postponing necessary investments, organisations are merely kicking the can down the road. Eventually, they will have to spend even more to correct these issues. Organizations must do more to ensure the reliability of their systems by making a robust commitment to Quality Engineering, which is essential for avoiding repeated failures and protecting consumers in the long run.

Resilient systems

To prevent future incidents and build more resilient systems, I believe there are several key steps that organizations must take:

Understand business risks fully: Key stakeholders must invest time upfront to clearly define the quality standards required for each deployment and ensure these are met. Quality cannot be an afterthought.

Foster a culture of quality at speed: Shift the organizational mindset to align quality with rapid deployment, reducing the likelihood of costly errors and outages. Quality and speed should go hand in hand.

Commit to long-term quality investment: Ensure that sufficient resources are allocated for the ongoing maintenance and upgrades of core technology, preventing the consequences of underinvestment.

Encourage cross-functional collaboration: Promote collaboration between development, QA, security, and operations teams to build a unified approach to Quality Engineering and system robustness.

Implement rigorous change management processes: Thoroughly review, test, and document all changes to minimize the risk of introducing new issues into the system.

Increase delivery flow through continuous testing: Utilize extensive test automation to facilitate faster deployments without compromising on quality.

Strong commitment to quality

Ultimately, a strong commitment to quality must come from the top of the organization. Executives play a pivotal role in fostering a culture that values Quality Engineering and upholds the principles necessary to deliver high-quality, reliable technology. By acknowledging the importance of robust Quality Engineering processes and investing in the necessary tools, training, and resources, organizations can mitigate the risks associated with technology failures and potentially avoid them altogether. Until the quality of technology is seen as a serious risk factor at board level, we will continue to encounter these outages.

These failures should serve as a wake-up call for organizations worldwide. It’s time to build quality technology solutions at pace to deliver long-term resilience rather than short-term savings. Embracing Quality Engineering is not just about counting defects and preventing outages; it’s about building a resilient technological foundation that supports innovation, growth, and a commitment to consumer experience. As technology becomes increasingly critical to consumer trust, companies that invest in quality will enjoy stronger brand reputations and greater consumer confidence over time. The time to act is now. We must move from merely reacting to outages to proactively preventing them. By fortifying our systems and technology today, we can ensure that they remain robust and reliable for the consumers of tomorrow.

We've featured the best IT management tool.

This article was produced as part of TechRadarPro's Expert Insights channel where we feature the best and brightest minds in the technology industry today. The views expressed here are those of the author and are not necessarily those of TechRadarPro or Future plc. If you are interested in contributing find out more here: https://www.techradar.com/news/submit-your-story-to-techradar-pro

Stephen Johnson is founder and CEO of Quality Engineering consultancy at Roq.

Read more
A stylized depiction of a padlocked WiFi symbol sitting in the centre of an interlocking vault.
Being ready when the cyber crisis happens
A person standing in front of a rack of servers inside a data center
Is your business primed to respond to downtime?
Security
Removing software supply chain blind spots that put public sector organizations at risk
A digital representation of a lock
The true threat of business downtime
Eu
Is your business ready for DORA? Cisco ThousandEyes outlines the "three pillars" everyone needs to have in place to be resilient
Internet outage
Nearly all companies expect a major outage in 2025
Latest in Pro
cybersecurity
What's the right type of web hosting for me?
Security padlock and circuit board to protect data
Trust in digital services around the world sees a massive drop as security worries continue
Hacker silhouette working on a laptop with North Korean flag on the background
North Korea unveils new military unit targeting AI attacks
An image of network security icons for a network encircling a digital blue earth.
US government warns agencies to make sure their backups are safe from NAKIVO security issue
Laptop computer displaying logo of WordPress, a free and open-source content management system (CMS)
This top WordPress plugin could be hiding a worrying security flaw, so be on your guard
construction
Building in the digital age: why construction’s future depends on scaling jobsite intelligence
Latest in News
Quordle on a smartphone held in a hand
Quordle hints and answers for Sunday, March 23 (game #1154)
NYT Strands homescreen on a mobile phone screen, on a light blue background
NYT Strands hints and answers for Sunday, March 23 (game #385)
NYT Connections homescreen on a phone, on a purple background
NYT Connections hints and answers for Sunday, March 23 (game #651)
Google Pixel 9 Pro Fold main display opened
Apple is rumored to be prioritizing battery life on the foldable iPhone – which could also feature a liquid metal hinge for added durability
Google Pixel 9
The Google Pixel 10 just showed up in Android code – and may come with a useful speed boost
L-mount alliance
Sirui joins L-Mount Alliance to deliver its superb budget lenses for Leica, DJI, Sigma and Panasonic cameras