← Insights
StrategyGuide

Cyber Hygiene Metrics Your Engineers Will Trust

Dephiant Research4 min read

Engineering teams treat most security metrics like marketing numbers, directionally true, locally meaningless. Here are five that survive engineering scrutiny.

Cyber Hygiene Metrics Your Engineers Will Trust

Engineering teams often view traditional security metrics with a healthy dose of skepticism, frequently dismissing them as "marketing numbers", directionally true in their general sentiment but locally meaningless in their practical application. This disconnect arises because many common security metrics fail to provide actionable insights or directly reflect tangible security posture improvements. For metrics to gain traction and drive engagement within engineering organizations, they must be perceived as relevant, accurate, and indicative of outcomes that engineers can directly influence. The following five metrics are specifically chosen for their ability to withstand engineering scrutiny, offering clear, actionable objectives that resonate with technical teams and genuinely contribute to an enhanced security posture.

The Five Essential Metrics

These five metrics are designed to provide clear, actionable insights that engineering teams can actively work towards improving, fostering a culture of security accountability and measurable progress.

  1. Mean time to patch critical CVEs, measured per service and tracked weekly. This metric directly addresses the speed and efficiency with which critical vulnerabilities, identified as Common Vulnerabilities and Exposures (CVEs), are remediated across an organization's various services. Tracking this weekly allows for immediate identification of bottlenecks or underperforming teams, driving a culture of rapid response to emergent threats. A consistently low mean time to patch indicates an agile and security-conscious engineering operation, reducing the window of opportunity for attackers to exploit known weaknesses.
  2. % of production services with documented owners. The principle here is simple yet profound: if a critical production service lacks a clearly defined owner, accountability for its security and maintenance becomes diffusive and ultimately nonexistent. This metric tracks the proportion of all production services that have an assigned individual or team responsible for their upkeep, security, and incident response. A high percentage signifies clear lines of responsibility, ensuring that security issues, including vulnerabilities and misconfigurations, have a designated party to address them, preventing services from becoming security orphans.
  3. Build-pipeline secret count, trending toward zero as you adopt OIDC-based access. This metric focuses on the number of hardcoded secrets (API keys, passwords, tokens) embedded directly within build pipelines or source code repositories. These secrets represent significant security risks if exposed. The goal is to aggressively reduce this count towards zero, primarily through the adoption of more secure, dynamic secret management practices like OpenID Connect (OIDC)-based authentication for machine-to-machine communication, where identities are validated and temporary credentials are issued on demand, eliminating the need to store long-lived secrets.
  4. Coverage of phishing-resistant MFA for service accounts and human accounts separately. Multi-Factor Authentication (MFA) is a critical control against unauthorized access, but its effectiveness varies depending on the resistance to phishing attacks. This metric tracks the percentage of both human user accounts and critical service accounts that are protected by phishing-resistant MFA methods, such as FIDO2/WebAuthn or hardware security keys. Differentiating between human and service accounts is crucial as their usage patterns and attack vectors differ, allowing for tailored security strategies and accurate reporting on the adoption of the strongest available authentication mechanisms.
  5. % of incidents with a written postmortem within 10 business days. While a lagging indicator, this metric offers invaluable insight into the health and maturity of an organization's incident response process. It tracks the proportion of security incidents that are followed by a comprehensive written postmortem report within a reasonable timeframe (e.g., 10 business days). Timely postmortems ensure that lessons learned from incidents are captured, analyzed, and disseminated, preventing recurrence and driving continuous improvement in security posture, incident handling procedures, and overall resilience. A consistently high percentage reflects a commitment to learning and operational excellence.

Why These Metrics Trump the Usual Suspects

The typical security metrics often presented in compliance reports or executive summaries, such as raw counts of security controls implemented, lists of audit findings, or aggregate vulnerability totals from automated scanners, frequently fall short of providing meaningful signals to engineering teams. These common metrics are often either rate-of-work measures (how much security activity is happening) or scan artifacts (snapshot data from automated tools). While they have their place in compliance checklists, engineers quickly discern that they do not accurately reflect the actual exposure or risk profile of the organization.

For instance, a high "vulnerability count" from a scanner might be inflated by low-risk issues or false positives, making it difficult to prioritize genuine threats. Similarly, merely reporting the "number of security controls in place" doesn't indicate their effectiveness or correct configuration.

The five metrics advocated here survive engineering scrutiny precisely because they describe outcomes you can directly improve. They focus on tangible results and behaviors that technical teams can influence through their daily work:

  • Patching speed is a direct measure of response efficiency.
  • Service ownership is a fundamental organizational practice.
  • Secret management reflects secure coding and deployment standards.
  • MFA coverage indicates the strength of access controls.
  • Post-incident analysis drives learning and resilience.

These metrics offer clear goals and provide real-time feedback on the efficacy of security efforts, transforming security from a compliance burden into an integral aspect of engineering quality and operational excellence. By focusing on these outcome-oriented metrics, organizations can foster greater alignment between security and engineering, ultimately building a more robust and defensible digital infrastructure.