AI and LLM Usage Disclosure

How HailBytes products use large language models internally, what leaves your deployment, and what does not.

This page answers the AI-shaped questions HailBytes receives during enterprise procurement: where LLMs sit inside HailBytes products, which providers we use, what customer data crosses the network boundary to those providers, and what HailBytes does and does not train any model on. The companion MCP reference covers the inverse direction: how your own AI agents call HailBytes.

Where LLMs Sit Inside HailBytes Products

Remediation Guidance Generation

For each vulnerability finding, HailBytes ASM can generate a plain-language description, impact summary, and remediation guidance via an LLM call. The output is cached per finding with full provenance (provider, model, model version, generation timestamp) and stored alongside the deterministic finding payload.

Model record: LLMVulnerabilityReport
Cached: per finding, replayable

Phishing-Email Body Drafting (SAT)

HailBytes SAT can draft phishing-email bodies for simulation campaigns from a customer-provided brief. Drafts are reviewable and editable before any campaign sends. Send-side machinery is deterministic; LLM involvement ends at the template-drafting step.

Scope: template drafting only
Review gate: human-edit before send

What HailBytes Does NOT Use LLMs For

Several procurement questionnaires ask whether ML or LLM-based systems make security-relevant decisions. The honest answer is: not today.

Asset Attribution

Attribution of discovered assets to a target organization is deterministic. Inputs: hostname matching, certificate subject and SAN, BGP and ASN ownership, exclude lists, and project-scoped uniqueness constraints. No ML inference in the attribution pipeline.

False Positive Reduction

FP handling is deterministic: deduplication on write, explicit STATUS_FALSE_POSITIVE with audit trail, and per-engine EXCLUDED_SUBDOMAINS or EXCLUDE_EXTENSIONS lists. ML-based FP reduction is on the roadmap and is not in production today. Honest gap.

Risk Prioritization

The 0-to-100 composite risk score per vulnerability is rule-based. Inputs: severity, CVSS base score, EPSS score, CISA KEV status, Domain.business_criticality propagation, and threat-intel enrichment from named providers. No ML model rerank step.

Customer-Selectable LLM Providers

Under HailBytes' BYOC posture every deployment chooses its own LLM provider. The customer decides whether any LLM call leaves the deployment perimeter at all.

Hosted Providers

For deployments that accept hosted LLM calls, HailBytes ASM supports OpenAI, Anthropic, and Google Gemini. The provider and model name are configured per deployment. Per-call provenance is recorded with each cached output.

  • OpenAI (model selectable)
  • Anthropic (Claude family, model selectable)
  • Google Gemini (model selectable)
  • Provider switch is a deployment-time setting

Self-Hosted / Air-Gapped (Ollama)

For deployments that cannot send any data to a hosted LLM, HailBytes ASM ships an Ollama backend with NVIDIA CUDA and AMD ROCm GPU support. The LLM runs inside the customer's own deployment perimeter; no outbound LLM traffic crosses the BYOC boundary. This is the recommended posture for regulated and classified deployments.

  • Fully self-hosted, no external API calls
  • GPU acceleration via CUDA or ROCm
  • Model choice is customer's (Llama, Mistral, etc.)
  • Compatible with AWS GovCloud and Azure Government

What Crosses The Network Boundary

When a deployment is configured for a hosted LLM provider, the prompts sent to that provider contain:

  • Public CVE-shaped context: CVE ID, CWE ID, CVSS vector, the source nuclei-template URL, and the matcher name that triggered the finding.
  • Minimal asset context: the affected subdomain or hostname, the affected port, and the asset's business-criticality classification.
  • The vulnerability evidence excerpt: a bounded slice of the captured request and response, sufficient for the LLM to describe what was observed.

The prompts do NOT contain:

  • Customer SIEM credentials, API keys, OAuth tokens, LDAP bind passwords, or any value from the encrypted-at-ORM-layer credential store.
  • Customer user account data (email addresses, SSO group memberships, audit-log payloads).
  • Full scan history or the corpus of past findings.
  • Any data from other Projects or Organizations inside the same deployment (per-project query scoping is enforced before any LLM call constructs its prompt).

For deployments on the Ollama / self-hosted path, no data crosses the BYOC perimeter at all. The LLM call is local loopback traffic inside the customer's own VPC or subscription.

Training and Retention

HailBytes Does Not Train Models

HailBytes does not train, fine-tune, or distill any LLM, ML model, or embedding model on customer scan data, customer findings, customer reports, or any input the customer provides to HailBytes products. There is no proprietary "HailBytes model" built from aggregated customer telemetry. The models referenced on this page are the third-party hosted or self-hosted models the customer selects at deployment time.

Hosted Provider Retention

HailBytes deployments call hosted provider APIs through the provider's standard enterprise endpoints. Provider-side retention is governed by the provider's own data-processing terms (OpenAI, Anthropic, and Google Gemini all publish enterprise terms that exclude API traffic from training corpora by default). Customers procuring HailBytes via marketplace private offer can layer their existing enterprise agreement with the provider for additional guarantees.

Audit and Replay

Every LLM-generated remediation guidance entry is stored with:

  • Provider identifier (e.g., anthropic, openai, ollama).
  • Model identifier (e.g., claude-sonnet-4-6, gpt-4o-2024-08-06).
  • Generation timestamp.
  • Token-count attribution to the per-Project billing rollup at /billing/projects/.

This makes LLM output reproducible and reviewable. A compliance auditor or a customer security team can replay any generated guidance against a known model version and verify the output is consistent with the recorded provenance.