GuideSMB

Patch Management That Actually Works

Dephiant ResearchAugust 22, 20214 min read

Most patch programs fail not because tools are bad, but because no one owns the calendar. Here is the operating model we recommend.

Most patch management programs, despite significant investment in tooling and personnel, frequently fall short of their objectives. This widespread failure is rarely attributable to shortcomings in the sophisticated tools themselves, but rather to a fundamental flaw in operational ownership: a lack of clear accountability for the patching calendar and the associated deployment timelines. At Dephiant Consulting Inc., our experience indicates that without a well-defined operating model, organizations struggle to maintain a consistent and effective patching rhythm. This article outlines an operating model designed to address these common pitfalls, emphasizing structured timelines, foundational prerequisites, and meaningful metrics.

Three Tracks, Three Service Level Agreements (SLAs)

Effective patch management necessitates a tiered approach, recognizing that not all vulnerabilities pose the same level of immediate threat. We advocate for categorizing patches into three distinct tracks, each with its own stringent Service Level Agreement (SLA) for deployment. This structured approach ensures that resources are appropriately allocated based on risk, preventing critical vulnerabilities from languishing while less urgent ones consume disproportionate attention.

Critical (actively exploited or CVSS 9+): These vulnerabilities represent an imminent and severe threat, often indicating active exploitation in the wild or possessing a Common Vulnerability Scoring System (CVSS) score of 9 or higher. For such critical patches, we mandate a stringent 72-hour deployment window. This aggressive timeline requires pre-approved authority for after-hours deployment, minimizing delays caused by standard change control processes. The urgency here is paramount, as a delay could lead to significant breaches or operational disruption. Organizations must have crisis response protocols in place to accelerate testing and rollout for these critical updates.
High (CVSS 7-8.9): Vulnerabilities falling into this category carry a substantial risk, often characterized by a CVSS score between 7 and 8.9. While not as immediately exploitable as critical vulnerabilities, they still present a significant attack surface. For these, a 14-day deployment window is recommended. This allows for scheduled testing and deployment during regular business hours, aligning with typical sprint cycles or bi-weekly maintenance windows, without imposing undue strain on operations. The slightly extended timeline ensures thorough testing to prevent regressions while still addressing the risk promptly.
Medium and below: This category encompasses vulnerabilities with lower risk profiles, typically those with CVSS scores below 7. While these still require attention, their immediate impact is generally lower. Patches for these vulnerabilities should be incorporated into a monthly maintenance window. This approach consolidates multiple less critical updates, streamlining the deployment process and reducing the overhead associated with frequent, smaller updates. These monthly windows should be predictable and communicated well in advance to all stakeholders, allowing for proper planning and resource allocation.

The Unsexy Parts: Inventory Accuracy and Reboot Discipline

The most sophisticated patch management tools are ultimately ineffective if the underlying infrastructure of the organization is not meticulously maintained. Two fundamental, yet often overlooked, elements are paramount to the success of any patch program: inventory accuracy and reboot discipline. These "unsexy" parts form the bedrock upon which all successful patch management strategies are built.

A robust and accurate Configuration Management Database (CMDB) is non-negotiable. If your CMDB does not precisely reflect the current state of your IT environment, including all servers, workstations, network devices, and critical applications, your reported patch coverage will invariably be inflated. We have observed instances where inaccurate inventories lead to an overestimation of compliance by 20-40%, creating a false sense of security. Without precise knowledge of what assets exist and their current patching status, organizations are effectively patching in the dark, leaving significant vulnerabilities exposed. Continuous reconciliation and automated discovery tools are essential to maintain CMDB integrity.

Equally critical is the ability and willingness to reboot systems when necessary. Many patches, particularly for operating systems and core infrastructure components, require a system reboot to fully apply and activate the security fix. If organizational policies or operational constraints dictate that servers can only be rebooted during infrequent, perhaps quarterly, change windows, then your patches are effectively not applied for extended periods. This creates a dangerous incongruity where systems are "patched" on paper but remain vulnerable in practice. Developing a culture of reboot discipline, where necessary reboots are planned, communicated, and executed promptly after patch application, is vital. This may involve implementing rolling reboots, leveraging clustered environments, or improving application uptime capabilities to minimize service disruption during these essential processes.

Measure What Matters: Mean Time To Patch (MTTP)

Traditional metrics often focus on a snapshot of compliance, such as the "percentage of devices patched" at a specific point in time. While this offers some insight, it fails to capture the dynamic nature of vulnerability management and can be misleading. The truly impactful metric for assessing the efficacy of a patch program is Mean Time To Patch (MTTP), broken out by severity category.

MTTP provides a measure of how quickly, on average, vulnerabilities are remediated from discovery to full deployment, reflecting the agility and responsiveness of the patching process. By segmenting MTTP by severity (Critical, High, Medium), organizations gain granular insight into their adherence to the established SLAs and can identify bottlenecks specific to different risk levels. For instance, a high MTTP for critical vulnerabilities, despite a high overall compliance percentage, indicates a significant organizational weakness.

This metric should be trended monthly and presented to governance committees or the board. While compliance percentages tend to decay over time as new vulnerabilities are discovered and new assets are introduced, a trended MTTP tells a comprehensive story about the security posture's evolution. It highlights improvements or regressions in the patching process, providing actionable intelligence for continuous improvement. Rather than debating a static compliance number, discussions can focus on the efficiency of the patching pipeline and strategic adjustments needed to reduce remediation times across all severity levels. This shift from static snapshots to dynamic, performance-oriented metrics empowers more intelligent decision-making and fosters a proactive security culture.

← Back to all insights

Talk to our team