State of AI-Era Defense Compliance Evidence
A field report on twelve attack patterns, their detection signatures, and the NIST 800-171A objectives they intersect.
Thesis
The attack surface of AI-era defense compliance is not the tooling. It is the evidence.
Every Defense Industrial Base contractor facing CMMC Level 2 assessment now has on-tap access to tools (ChatGPT, Claude, Gemini, Copilot) that can produce a plausible-looking System Security Plan, Plan of Action and Milestones, policy, procedure, and audit-log narrative in under an hour. Assessors (C3PAOs, Certified CMMC Assessors, Registered Practitioners) do not have a corresponding tool to detect what those AI-authored artifacts look like. The Phase 2 enforcement deadline (November 10, 2026) arrives into a market where the supply of compliance theater has decoupled from the supply of detection capability.
This field report closes that gap. It catalogs twelve attack patterns observed across pre-assessment packets, ships detection signatures for seven of them (shipped as a stdlib-only Python engine under MIT license), and documents the remaining five with signatures-only so that practitioners can build their own detectors.
Every pattern is mapped to the NIST 800-171A assessment objective it intersects and the CompTIA Security+ SY0-701 domain it teaches to.
The twelve attack patterns
Detection shipping in v0.3.1 (seven)
- H1Sentence structure flatness. LLM output exhibits a coefficient of variation on sentence length roughly half that of human-authored text. Weak signal alone, corroborating in aggregate.
- H2Cross-control boilerplate clustering. The same generative shell reused across 50+ unrelated controls produces high k-shingle Jaccard similarity. Catches the MSP industrialization pattern.
- H3Synthetic audit log timestamp regularity. Real event streams follow Poisson-ish distributions with microsecond precision. Synthetic logs are too regular or too round.
- H4Citation-heavy, mechanism-light narratives. LLMs over-cite control IDs. Real implementers name the tool and its configuration.
- H5Shallow citation graphs. AI-generated packets have depth 1-2 citation chains, high orphan rates, and circular citations between artifacts.
- H6Prompt leakage. Residue phrases ("As an AI language model," "Certainly! Here is," "[INSERT COMPANY NAME HERE]") are near-certain proof of un-edited LLM paste.
- H7Artifact specificity deficit. LLMs name mechanisms but do not ground them. Grounding tokens (versions, hashes, IPs, paths, ticket IDs, dates, filenames) are rare in synthetic artifacts.
Signature-only documentation (five, in the full report)
- Hallucinated control inheritance from cloud service providers the contractor does not actually use.
- Fabricated POA&M closure evidence where the remediation narrative contradicts the original finding.
- Synthetic incident response timeline where the dwell time, root cause, and lessons learned read as generic textbook boilerplate.
- Inverted scope claims where the SSP narrows the CUI boundary below what the contract requires, to reduce the packet's apparent burden.
- Template-inheritance drift where a legitimate template has been re-prompted so many times it lost its anchor to the contractor's real environment.
Why this matters for the market
No competitor publishes a detector for this class of artifact. Vanta, Drata, Secureframe, Prevail, and Apptega are selling more AI generation (auto-drafted SSPs). The generation is the attack surface. Detection is the defense.
The Phase 2 enforcement deadline will produce a cohort of failed first-cycle assessments. The public narrative about why they failed is not yet written. This report writes it.
How to read the full report
The full ~40-page report ships Sunday, April 27, 2026 at this URL. Each of the twelve sections follows the same format:
- Pattern description. What the attack looks like.
- Example. A redacted artifact exhibiting the pattern.
- Detection signature. The rule, regex, or statistical test.
- NIST 800-171A mapping. The assessment objective it intersects.
- Security+ domain. The training the pattern teaches to.
- Recommended control. What a contractor can do to avoid producing this pattern.
The report includes the full seven-heuristic engine as shipped in the companion repository, reproducible against the samples/ fixtures, with expected outputs committed in examples/.
What to do until April 27
- Clone the companion repository. Run
python3 -m unittest test_mismatch_engine_ai test_template_guard test_risk_delta. Confirm 65 tests pass. - Run the engine against the three sample packets. Confirm the expected outputs (CLEAN at 0.16, LIKELY_SYNTHETIC at 1.00, CLEAN at 0.00 with template guard).
- Read
KNOWN_BYPASSES.mdto understand what the engine does and does not catch. - Run the engine against one of your own real packets. Read the
USAGE_GUIDE.mdsection on the three workflows. - If you find a pattern not listed here, open an issue. We credit contributors in the next release.
Commitment
This summary is the committed pre-release. The full report on April 27 will not retract any pattern listed here, will not change any NIST mapping, and will not weaken any detection signature. It will add examples, a quantitative section on detection accuracy on an internal test corpus of 200 packets, and the five signature-only patterns in full.
The commitment bundle hash for v0.2 of the engine is:
Verify with python3 verify_commitment.py.