Cloud Spectra Gateway - User Guide (v1 Configuration Reference)

Introduction & how configuration works

Cloud Spectra Gateway runs entirely inside your own AWS account. It is a software networking and AI appliance that replaces metered Amazon VPC networking services and metered LLM-API spend with a fixed EC2 cost -- Your Cloud, Off the Meter. There is no vendor control plane: traffic, configuration, logs and (for local inference) prompt data stay in your account at all times.

This page is the full v1 configuration reference. It is organized into the three product tiers. The tiers are cumulative -- each higher tier is a strict superset of the one below it:

Network

Network Gateway

Replaces NAT Gateway, NLB, Elastic IPs and a forward proxy at a flat fee.

7 features:

Source NAT (sNAT)
Destination NAT / port forwarding (dNAT)
Network Load Balancer (IPVS L4)
TLS termination (ACM)
Per-AZ Auto Scaling
Vertical + horizontal scaling (GWLB)
Forward HTTP proxy + caching (Squid)

Security

Security Gateway

Everything in Network, plus inline inspection and content filtering.

Adds 3 features:

Suricata IDS/IPS (inline, NFQUEUE)
nftables firewall rules
Domain / URL filtering

AI Gateway

Everything in Security, plus an in-account AI data plane.

Adds 3 features:

AI LLM Proxy + response caching
Local AI Inference (vLLM)
AI Semantic Cache

Three ways to configure -- one source of truth

Every feature on this page is set the same way regardless of which interface you use. The dashboard, the Terraform provider, and the configuration API all write into the same place: AWS SSM Parameter Store, in your account. The gateway instances poll that configuration and reconcile the running data plane toward it.

Interface	Endpoint / surface	Best for
Dashboard (Angular)	HTTPS port `443` (TLS via HAProxy)	Interactive setup, status, day-2 operations
Configuration API (REST)	Port `8080`	Scripting, CI/CD, custom automation
Terraform provider	`cloudspectra/cloudspectra`	Declarative infrastructure-as-code
State store (read by data plane)	SSM Parameter Store	The single source of truth all three write to

flowchart LR
    DASH["Dashboard
(HTTPS 443)"]
    API["Config API
(REST 8080)"]
    TF["Terraform provider
cloudspectra/cloudspectra"]
    SSM["AWS SSM
Parameter Store
(your account)"]
    DP["Gateway data plane
(per-AZ ASG instances)"]
    DASH --> SSM
    API --> SSM
    TF --> API
    SSM -->|"poll & reconcile"| DP

For each feature below you will find: what it does, what AWS service or meter it replaces or uses, how to configure it (the dashboard panel, with a note that the same setting is available through the Terraform provider and the configuration API), and how to verify it. Every setting shown is a v1 feature; nothing here is roadmap or preview.

New to the product? Start with the Quick Start to deploy in under 15 minutes, then return here for the per-feature reference. For the data-plane internals, see Architecture. For common questions, see the FAQ.

Network Gateway tier Network

The Network Gateway tier is the base appliance. It delivers seven data-plane features that replace AWS-metered networking services with a fixed EC2 cost. The diagram below shows the common packet paths these features sit on -- outbound NAT, inbound port forwarding, the in-appliance L4 load balancer, and the forward proxy.

flowchart LR
    subgraph VPC["Your VPC"]
      PRIV["Private instances"]
    end
    GW["Cloud Spectra Gateway
(per-AZ ASG behind GWLB)"]
    EIP["Elastic IP
(stable endpoint)"]
    INET["Internet"]
    PRIV -->|"0.0.0.0/0 route"| GW
    GW -->|"sNAT"| EIP --> INET
    INET -->|"dNAT / port fwd"| EIP --> GW --> PRIV
    GW -->|"forward proxy (Squid)"| INET
    GW -->|"IPVS L4 load balance"| PRIV

Source NAT (sNAT)

What it does. Provides outbound internet access for instances in private subnets. The gateway source-translates private traffic to its Elastic IP so private instances can reach the internet without a public IP of their own.

Replaces. The AWS NAT Gateway -- specifically its hourly charge and its per-GB data-processing meter. With Cloud Spectra, outbound egress carries no per-GB networking fee beyond standard EC2 data-transfer.

Configure. In the dashboard, open NAT Gateway and enable outbound NAT. Then point each private subnet's route table at the gateway ENI for that Availability Zone (default route 0.0.0.0/0). The same setting is available through the Terraform provider (cloudspectra_nat_gateway) and the configuration API (PUT /api/cloudspectra/nat-gateway).

Setting	Meaning
`enable`	Turn outbound source NAT on or off.
Per-AZ ENI route	Each private subnet routes `0.0.0.0/0` to its own AZ's gateway ENI (avoids cross-AZ data charges).

Verify. From a private instance routed through the gateway:

curl -s https://checkip.amazonaws.com
# Returns the gateway's Elastic IP -- confirms egress is source-NATed.

Destination NAT / port forwarding (dNAT)

What it does. Forwards inbound TCP connections that arrive at the gateway's public endpoint to a private target inside the VPC. Use it to expose a service running on a private instance without giving that instance a public address.

Replaces / uses. A common pattern that would otherwise require a public-facing load balancer or a bastion. dNAT pairs naturally with the gateway's Elastic IP as the stable inbound endpoint.

Configure. In the dashboard, open Port Forwarding and add a rule mapping a listen port on the gateway to a private destination IP and port. The same setting is available through the Terraform provider and the configuration API.

Field	Meaning
Listen port	The TCP port the gateway accepts on its public endpoint.
Destination IP	The private target instance address.
Destination port	The port on the target to forward to.
Protocol	TCP.

Verify. From outside the VPC, connect to the gateway's Elastic IP on the listen port and confirm you reach the private service:

curl -s http://<GATEWAY_EIP>:<listen_port>/
# Reaches the forwarded private target.

Network Load Balancer (IPVS L4)

What it does. Provides Layer-4 load balancing inside the appliance using the Linux IPVS engine. The gateway distributes incoming TCP/UDP flows across a set of backend targets, and keeps the IPVS target set in sync with an AWS Network Load Balancer target group so you manage targets the AWS-native way.

Replaces. The metered per-LCU/NLCU cost of an AWS Network Load Balancer for L4 traffic, while still using the AWS NLB target set as the source of truth for membership.

Configure. In the dashboard, open Load Balancer, enable the IPVS L4 balancer, and select the AWS NLB target set to mirror. The same setting is available through the Terraform provider (cloudspectra_nlb_gateway) and the configuration API.

flowchart LR
    CL["Client"] --> GW["Gateway
IPVS L4 virtual service"]
    AWSNLB["AWS NLB target group
(source of truth)"] -. "sync targets" .-> GW
    GW --> T1["Target 1"]
    GW --> T2["Target 2"]
    GW --> T3["Target 3"]

Verify. Send repeated L4 connections to the virtual service and confirm they spread across the backend targets (for example, by checking which target served each request in your application logs), and confirm the IPVS target list matches the AWS NLB target group membership.

TLS termination (ACM)

What it does. Terminates HTTPS at the gateway using HAProxy, presenting a certificate issued by AWS Certificate Manager (ACM). Inbound TLS on port 443 is decrypted at the appliance; plaintext on port 80 is redirected to HTTPS.

Uses. An ACM certificate from your account. The same HAProxy listener also fronts the management dashboard on 443.

Configure. In the dashboard, open TLS Termination and select the ACM certificate to present. The same setting is available through the Terraform provider and the configuration API.

Setting	Meaning
ACM certificate ARN	The certificate HAProxy presents on port 443.
HTTP redirect	Port 80 requests are redirected to HTTPS.

Verify. Inspect the served certificate and confirm the redirect:

# Confirm the ACM certificate is presented on 443
openssl s_client -connect <GATEWAY_EIP>:443 -servername your.domain </dev/null 2>/dev/null | openssl x509 -noout -issuer -subject

# Confirm HTTP redirects to HTTPS
curl -sI http://<GATEWAY_EIP>/  # 301/302 to https://

Per-AZ Auto Scaling

What it does. Runs one Auto Scaling Group per Availability Zone. Each AZ's instances egress through that AZ's own ENI, so traffic stays within its zone. This is the design that keeps a gateway healthy in every AZ while avoiding cross-AZ data-transfer charges.

Replaces / uses. Uses EC2 Auto Scaling. The per-AZ split avoids the cross-AZ data-transfer cost a single shared appliance would incur when instances in one AZ NAT through an ENI in another.

Configure. Availability Zones are chosen at deployment (the CloudFormation template or the Terraform module). For each enabled AZ the stack creates a dedicated subnet, ENI, route table, and Auto Scaling Group. Point each AZ's private subnets at that AZ's gateway ENI. The same routing is settable through the Terraform provider and the configuration API.

flowchart TD
    subgraph AZa["AZ a"]
      ASGa["ASG a"] --> ENIa["ENI a"]
    end
    subgraph AZb["AZ b"]
      ASGb["ASG b"] --> ENIb["ENI b"]
    end
    subgraph AZc["AZ c"]
      ASGc["ASG c"] --> ENIc["ENI c"]
    end
    PA["Private subnets in AZ a"] --> ENIa
    PB["Private subnets in AZ b"] --> ENIb
    PC["Private subnets in AZ c"] --> ENIc

Tip. Always route an AZ's private subnets to that AZ's gateway ENI. Routing across AZs works but re-introduces the cross-AZ data charge this design is meant to eliminate.

Verify. Confirm that each enabled AZ has a running gateway instance and its own ENI, and that each private subnet's route table targets the ENI in its own AZ.

Vertical & horizontal scaling (GWLB)

What it does. Scales the gateway two ways. Vertically, you can change the EC2 instance size. Horizontally, the fleet scales out behind a Gateway Load Balancer (GWLB) using GENEVE encapsulation, so multiple instances share the data-plane load. GWLB connection draining lets active flows finish before an instance is removed.

Replaces / uses. Uses a Gateway Load Balancer for horizontal scale-out and a range of EC2 instance types for vertical sizing. This removes the single-box ceiling (one NIC, fixed cores) of a fixed virtual appliance.

Configure. In the dashboard, open Scaling to change instance type (vertical) and the desired/min/max instance counts per AZ (horizontal). The same settings are available through the Terraform provider and the configuration API.

Setting	Meaning
Instance type	Vertical sizing -- resize the gateway instances live.
Min / desired / max	Horizontal sizing -- how many instances per AZ behind the GWLB.
Connection draining	GWLB lets in-flight flows finish before an instance is removed.

flowchart LR
    SRC["Traffic"] --> GWLB["Gateway Load Balancer
(GENEVE)"]
    GWLB --> I1["Gateway instance 1"]
    GWLB --> I2["Gateway instance 2"]
    GWLB --> I3["Gateway instance N"]

Verify. Increase the desired count and confirm new instances register healthy behind the GWLB target group and begin taking traffic; reduce the count and confirm draining lets active flows complete before an instance is removed.

Forward HTTP proxy + caching (Squid)

What it does. Runs an outbound forward proxy based on Squid. It supports proxy authentication, HTTP response caching, domain filtering, and per-client bandwidth limits -- giving you a single controlled egress point for HTTP/HTTPS traffic.

Replaces / uses. A standalone forward-proxy deployment. Caching reduces repeated outbound fetches (and therefore egress), and the filtering/bandwidth controls give you policy at the egress edge.

Configure. In the dashboard, open Forward Proxy, enable the proxy on a configurable port, and set authentication, caching, domain rules, and bandwidth limits. The same settings are available through the Terraform provider (cloudspectra_web_proxy_gateway) and the configuration API.

Setting	Meaning
Proxy port	The listen port for the forward proxy (configurable).
Authentication	Username/password required to use the proxy.
Response caching	Caches cacheable HTTP responses to cut repeated fetches.
Domain filtering	Allow / deny outbound by domain (see Domain / URL filtering).
Bandwidth limits	Per-client egress rate caps.

Verify. Configure a client to use the proxy and confirm egress flows through it:

export https_proxy=http://proxyuser:PASSWORD@<GATEWAY_IP>:<proxy_port>
curl -s https://checkip.amazonaws.com   # Returns the gateway EIP
# Repeat a cacheable request and confirm a cache HIT in the proxy access log.

Security Gateway tier Security

The Security Gateway tier includes everything in the Network tier and adds three inline enforcement features. Together they let you inspect, allow, and deny traffic at the data plane. The diagram shows where each one sits in the packet path.

flowchart LR
    IN["Inbound / outbound packet"] --> NFT["nftables
stateless allow/deny"]
    NFT -->|"allowed"| NFQ["NFQUEUE -> Suricata
IDS/IPS inline"]
    NFT -.->|"denied"| DROP1["Drop"]
    NFQ -->|"pass"| DOM["Domain / URL filter
(firewall + Squid paths)"]
    NFQ -.->|"alert / block"| DROP2["Drop / log"]
    DOM -->|"allowed"| FWD["Forward / NAT"]
    DOM -.->|"blocked domain"| DROP3["Drop"]

Suricata IDS/IPS (inline)

What it does. Runs Suricata as an inline intrusion detection and prevention system. Packets are handed to Suricata via the Linux NFQUEUE mechanism, so Suricata can inspect them and enforce a deny or allow verdict at the data plane (not just alert).

Replaces / uses. An inline IDS/IPS layer, enforced in your account on the gateway instances. No separate inspection endpoints are required.

Configure. In the dashboard, open IDS/IPS, enable inline inspection, and choose the action mode (detect vs. prevent) and rule sets. The same settings are available through the Terraform provider and the configuration API.

Setting	Meaning
`enable`	Turn inline Suricata inspection on or off.
Mode	Detect (alert only) or prevent (drop matching traffic).
Rule sets	The signature rule sets Suricata enforces.

Sizing. Inline inspection is CPU-intensive. Run the Security and AI Gateway tiers on an instance with 4 or more vCPUs; the inline inspection services will not start on smaller instances.

Verify. Generate a benign test signature match from a routed instance and confirm it appears in the Suricata events/threat view in the dashboard; in prevent mode, confirm the matching traffic is dropped.

nftables firewall rules

What it does. Enforces stateless allow/deny rules in the Linux kernel with nftables. Rules match on source and destination CIDR, port, and protocol, and are applied at the data plane on every gateway instance.

Replaces / uses. A kernel-level packet filter on the gateway. It complements the inline Suricata layer: nftables makes fast stateless allow/deny decisions; Suricata performs deep inline inspection on the traffic nftables permits.

Configure. In the dashboard, open Firewall and add rules. The same settings are available through the Terraform provider (cloudspectra_firewall_gateway) and the configuration API.

Field	Meaning
Action	Allow or deny.
Source CIDR	Source address range the rule matches.
Destination CIDR	Destination address range the rule matches.
Port	Destination port (or range).
Protocol	TCP / UDP / etc.

Verify. Add a deny rule for a known destination/port from a routed test instance and confirm the connection is blocked; remove it and confirm the connection succeeds again.

Domain / URL filtering

What it does. Blocks or allows traffic by domain. The policy is enforced on both data paths: the firewall path and the Squid forward-proxy path. This gives you consistent domain policy whether a client routes through NAT or through the proxy.

Replaces / uses. Domain-based egress control. Enforced in-account, on the same gateway instances.

Configure. In the dashboard, open Domain Filtering, choose an allow-list or deny-list model, and add domains. The same settings are available through the Terraform provider and the configuration API.

Setting	Meaning
Mode	Allow-list (only listed domains permitted) or deny-list (listed domains blocked).
Domains	The domain patterns the policy applies to.
Enforcement paths	Applied on both the firewall path and the Squid proxy path.

Verify. Add a domain to the deny-list and confirm it is blocked from a routed instance and also blocked when going through the proxy; confirm a non-listed domain still resolves and connects.

AI Gateway tier AI Gateway

The AI Gateway tier includes everything in the Security tier and adds an in-account AI data plane. It exposes an OpenAI-compatible endpoint on port 8090, can serve models locally on GPUs in your account, and caches responses to cut cost and latency. The diagram traces a request from a client through the gateway: a cache check first, then a route either to a remote provider (Amazon Bedrock, OpenAI, Anthropic) or to a local vLLM model, with a configurable fallback.

flowchart LR
    CLIENT["Client app
OpenAI base URL -> :8090"] --> EXACT{"Exact-match
cache hit?"}
    EXACT -->|"yes"| RESP["Cached response"]
    EXACT -->|"no"| SEM{"Semantic
cache hit?"}
    SEM -->|"yes"| RESP
    SEM -->|"no"| ROUTE{"Route by model"}
    ROUTE -->|"bedrock/openai/anthropic"| REMOTE["Remote provider"]
    ROUTE -->|"local/<model>"| VLLM["Local vLLM
(in-account GPU)"]
    VLLM -.->|"pre-first-token failure
+ overflow policy"| FB["Fallback remote model"]
    REMOTE --> METER["Token metering
+ audit log"] --> RESP
    VLLM --> METER
    FB --> METER

AI LLM Proxy with response caching

What it does. An OpenAI-compatible reverse proxy for LLM API calls. It fronts Amazon Bedrock, OpenAI, and Anthropic behind a single endpoint, with exact-match response caching, token metering, and audit logging. Clients keep using their existing OpenAI client libraries -- they just point the base URL at the gateway on port 8090.

Replaces / uses. A centralized LLM egress point. Exact-match caching returns identical prior responses without re-billing the provider; token metering and audit logging give you per-call accounting in your own account.

Configure. In the dashboard, open AI Gateway, enable the proxy, and configure providers, caching, metering, and audit logging. The same settings are available through the Terraform provider (cloudspectra_ai_gateway) and the configuration API.

Setting	Meaning
`enable`	Turn the AI LLM proxy on or off.
Providers	Amazon Bedrock, OpenAI, Anthropic.
Response caching	Exact-match caching of identical requests.
Token metering	Per-call token accounting.
Audit logging	Records requests for review.

OpenAI-compatible usage. Point your OpenAI client at the gateway's base URL on port 8090 and call the standard OpenAI API routes:

# Set the base URL to the gateway on port 8090
export OPENAI_BASE_URL=http://<GATEWAY_IP>:8090/v1

# Standard /v1/chat/completions request through the gateway
curl -s http://<GATEWAY_IP>:8090/v1/chat/completions \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "messages": [{"role": "user", "content": "Hello, world"}]
  }'

# Route to a Bedrock or Anthropic model just by changing the model name:
curl -s http://<GATEWAY_IP>:8090/v1/chat/completions \
  -H "Authorization: Bearer $PROVIDER_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "anthropic.claude-3-5-sonnet",
    "messages": [{"role": "user", "content": "Hello, world"}]
  }'

Verify. Send the same chat-completion request twice and confirm the second is served from cache (lower latency, no second provider charge), and confirm the call appears in the audit log with token counts.

Local AI Inference (vLLM)

What it does. Serves LLMs locally on GPU instances in your own account using vLLM, exposed through the same OpenAI-compatible endpoint. Local models are addressed with the local/<model> prefix in the model field. On a pre-first-token failure, the gateway can fall back to a configured remote model according to an overflow policy.

Replaces / uses. In-account GPU serving with vLLM. The fallback can target a remote provider (Bedrock / OpenAI / Anthropic) when local capacity is unavailable.

Overflow policy. The fallback behavior on a pre-first-token failure is governed by the overflow policy:

Policy	Behavior on pre-first-token failure
`queue`	Hold the request and wait for local capacity to free up.
`spill`	Spill over to the configured remote fallback model.
`reject`	Reject the request rather than send it elsewhere.

Data sovereignty. If you leave the remote fallback empty, prompt data never leaves your account -- requests are served only by the local in-account vLLM model (and queued or rejected per the overflow policy), never spilled to a third-party provider.

Configure. In the dashboard, open Local Inference, register a local vLLM model, and set the overflow policy and (optionally) the remote fallback model. The same settings are available through the Terraform provider and the configuration API.

# Call a local model via the OpenAI-compatible endpoint
curl -s http://<GATEWAY_IP>:8090/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "local/llama-3.1-8b-instruct",
    "messages": [{"role": "user", "content": "Summarize this internal doc."}]
  }'

Verify. Send a request to a local/<model> name and confirm it is served by the local vLLM instance; then exercise the overflow policy (for example, with spill, confirm a pre-first-token failure falls back to the configured remote model; with an empty fallback, confirm the request is queued or rejected and never leaves the account).

AI Semantic Cache

What it does. Adds an embedding-based cache that recognizes similar prompts, not just byte-identical ones. Where the exact-match cache only hits on an identical request, the semantic cache returns a stored response for a sufficiently similar prompt -- raising the overall cache hit rate.

Replaces / uses. Extends the response caching of the AI LLM proxy. Higher hit rates mean fewer billed provider calls and lower latency for near-duplicate prompts.

Configure. In the dashboard, open AI Gateway -- Semantic Cache and enable it. The same setting is available through the Terraform provider and the configuration API.

Setting	Meaning
`enable`	Turn the embedding-based similar-prompt cache on or off.
Layering	Runs after the exact-match cache; only consulted on an exact-match miss.

Verify. Send a prompt, then send a slightly reworded but semantically equivalent prompt, and confirm the second is served from the semantic cache (lower latency, no new provider charge).

Ports & endpoints reference

The gateway exposes the following ports. Restrict access to each to the smallest client CIDR that needs it.

Port	Service	Protocol	Tier	Notes
`443`	Management dashboard (HTTPS)	TLS	Network	TLS via HAProxy; Angular dashboard.
`8080`	Configuration API (REST)	HTTP	Network	Writes config to SSM Parameter Store.
`8090`	AI Gateway (OpenAI-compatible)	HTTP	AI Gateway	Set your OpenAI base URL here.
configurable	Forward HTTP proxy (Squid)	HTTP	Network	Proxy listen port is configurable.
`80`	HTTP redirect	HTTP	Network	Redirects to HTTPS.

Security. Lock down the dashboard (443) and the configuration API (8080) to your administrative CIDR. Restrict the AI Gateway port (8090) and the forward-proxy port to the client networks that need them. The gateway runs in your account; you control its security groups.

Cross-account / home-account IAM role

The base CloudFormation template (new-VPC or existing/BYO-VPC) ships an intentionally minimal instance role. From it the gateway boots, elects a leader, associates its Elastic IP, and runs outbound source NAT on its primary interface -- with no further configuration. Most other features, however, require a separately deployed operational IAM role -- the cross-account / home-account role -- without which those features will not work.

You deploy this operational role once in your home account (the account running the gateway) and once in each additional account you want the gateway to manage. The gateway computes every IAM policy for you; you never hand-edit policy JSON.

Base vs operational, at a glance. The base instance role grants only boot-to-ready and steady-self permissions: read/write its own SSM configuration, write its log group, complete the Auto Scaling launch hook, associate the master Elastic IP, and disable the source/destination check on the primary network interface. Everything beyond that assumes the operational role.

What the operational role unlocks

Each capability below maps to one operational function role named <name>-cross-account-roleN. When the matching role is absent -- the stack is not deployed, or you deselected the feature -- that capability is simply inert: the gateway keeps running and retries; it does not crash.

Capability	Needs the operational role?
Outbound source NAT on the gateway's primary interface	No -- works from the base template
Master Elastic IP association	No -- works from the base template
Full per-AZ NAT data plane (egress for your private subnets)	Yes
Core VPC topology build, VPC peering / inter-VPC transit	Yes
Gateway Load Balancer + per-AZ Auto Scaling infrastructure	Yes
EIP-pool DNS publishing + ACM certificate export	Yes
VPC flow logs, EventBridge / SQS integration	Yes
Vertical / horizontal scaling, spot lifecycle, teardown	Yes
AI Gateway (Amazon Bedrock) + local vLLM inference	Yes
Cross-account management of a member account	Yes (the role deployed in that account)

Naming and trust

The operational roles are named <name>-cross-account-roleN, where <name> is derived from your resource-name prefix (default cloudspectra), the manager stack name, and the manager stack ID. The gateway rebuilds these names at runtime in order to assume the roles, so the deployed names must match exactly. Each role trusts only two principals in your home account -- the gateway instance role (<name>-instance) and the CloudFormation handler role (<name>-cf-handler-role) -- gated by an external ID, and optionally by your AWS Organizations ID. Each role's permissions are inline and fully visible to your security team for review.

Setting it up

From the dashboard's Setup IAM Roles view, each account row offers three ways to deploy the operational role. The gateway pre-fills every IAM policy, so you never type permissions.

1CloudFormation (recommended)

The dashboard hands you a CloudFormation Quick-Create URL with the template and all parameters pre-filled. Open it while signed into the target account and create the stack. (If a stack already exists, the dashboard instead stages a Change Set, and the link becomes an Execute action.)

2CLI

Download the cloudspectra-setup tool from the dashboard and run cloudspectra-setup add --token <token> with credentials for the target account. The tool creates the IAM roles directly through the IAM API.

3Terraform / infrastructure as code

The dashboard also exposes a JSON payload (template URL + stack name + parameter map) you can feed into your own IaC pipeline.

After the stack reaches CREATE_COMPLETE (or the CLI finishes), use the dashboard's Add Account action to register the account so the gateway begins managing it.

One account at a time. Each account -- including your home account -- is enrolled with the same template. In v1 there is no AWS Organizations StackSets one-action rollout; you deploy the stack (or run the CLI) once per account.

Updating it over time

As Cloud Spectra ships features that need new permissions, the operational role is updated through CloudFormation -- the gateway never edits your IAM at runtime. The template itself is frozen -- only its parameter values change between releases, so an update never disturbs the roles you have already deployed.

After your gateway auto-upgrades, it regenerates the policies and -- on the dashboard's account view -- stages a CloudFormation Change Set on your existing stack against the new release. You open the link, review the diff in the CloudFormation console, and click Execute. (For an account that has no stack yet, the dashboard gives you a Quick-Create URL instead.) The cloudspectra-setup CLI is the alternative: re-run it with a fresh token to apply the update.

Same source of truth. Whether you deploy via CloudFormation or the CLI, both consume the identical backend-generated policies, so the two interfaces cannot drift. The gateway stages the change for you, but you always apply it yourself.

Choosing which features get permissions. The dashboard includes an IAM permission picker that lets you deselect optional features. Core features -- configuration, compute lifecycle, VPC cleanup, teardown, and the Region guardrail -- are always on and cannot be deselected. Deselecting a feature removes its role on the next stack update. Saving your selection persists it and prompts you to update the stack to apply -- it does not change your IAM by itself.

flowchart TD
    BASE["Base CloudFormation template
(minimal instance role)"] --> BOOT["Boot, leader election,
master EIP, primary-interface NAT"]
    DASH["Gateway dashboard"] -->|"Quick-Create URL /
Change Set / CLI / IaC"| OPS["Operational role stack
(cross-account / home-account)"]
    OPS --> FEAT["Most features:
full NAT data plane, GWLB, DNS,
flow logs, scaling, AI Gateway"]
    DASH -->|"new release:
stage Change Set"| UPD["Review diff in console,
click Execute"]
    UPD --> OPS
    TRUST["Trust: instance role +
cf-handler role, external ID"] --> OPS
    style BASE fill:#dbeafe,stroke:#3b82f6,color:#1e3a8a
    style BOOT fill:#d1fae5,stroke:#10b981,color:#065f46
    style OPS fill:#fef3c7,stroke:#f59e0b,color:#92400e
    style FEAT fill:#d1fae5,stroke:#10b981,color:#065f46
    style DASH fill:#ede9fe,stroke:#8b5cf6,color:#5b21b6
    style UPD fill:#ede9fe,stroke:#8b5cf6,color:#5b21b6
    style TRUST fill:#dbeafe,stroke:#3b82f6,color:#1e3a8a

Deployment & upgrades

There are three supported ways to deploy Cloud Spectra Gateway. All three deploy into your own AWS account and produce the same runtime: a per-AZ Auto Scaling Group behind a Gateway Load Balancer, with an Elastic IP as a stable endpoint and configuration in SSM Parameter Store.

flowchart TD
    MP["AWS Marketplace
1-click subscribe"] --> CF["CloudFormation"]
    CF --> NEWVPC["New-VPC template"]
    CF --> EXVPC["Existing / BYO-VPC template"]
    TFM["Terraform
(network mirror provider)"] --> RUNTIME["Per-AZ ASG behind GWLB
+ EIP + SSM config"]
    AMI["Standalone AMI
(no CloudFormation)"] --> RUNTIME
    NEWVPC --> RUNTIME
    EXVPC --> RUNTIME

CloudFormation (new VPC or existing/BYO VPC)

From the AWS Marketplace listing, 1-click launches a CloudFormation stack. Two base templates are provided:

Template	Use when
New-VPC	You want Cloud Spectra to create a fresh VPC, subnets, route tables, and gateway resources. The default.
Existing / BYO-VPC	You want to deploy into a VPC you already operate; Cloud Spectra adds its per-AZ subnets, ENIs, and ASGs.

See the Quick Start for the step-by-step CloudFormation walkthrough.

Terraform (via a one-time network mirror)

Deploy and configure with Terraform using the cloudspectra/cloudspectra provider plus AWS modules. The provider is installed from a network mirror -- add a one-time network_mirror block to your ~/.terraformrc so Terraform can fetch it:

# ~/.terraformrc  (one-time)
provider_installation {
  network_mirror {
    url = "https://<cloudspectra-network-mirror>/"
  }
  direct {
    exclude = ["cloudspectra/cloudspectra"]
  }
}

terraform {
  required_providers {
    aws          = { source = "hashicorp/aws", version = ">= 5.0" }
    cloudspectra = { source = "cloudspectra/cloudspectra" }
  }
}

provider "cloudspectra" {
  gateway_url    = "https://<GATEWAY_EIP>"
  admin_password = var.cloudspectra_admin_password
}

# Every feature on this page maps to a provider resource, e.g.:
resource "cloudspectra_nat_gateway"        "main" { enable = true }
resource "cloudspectra_firewall_gateway"   "main" { enable = true }
resource "cloudspectra_web_proxy_gateway"  "main" { enable = true }
resource "cloudspectra_ai_gateway"         "main" { enable = true }

Same source of truth. The Terraform provider writes the same configuration that the dashboard and the REST API write -- into SSM Parameter Store -- so you can mix interfaces (for example, declare base config in Terraform and make day-2 tweaks in the dashboard).

Standalone AMI (no CloudFormation)

You can launch the AMI directly into a subnet. It boots with outbound NAT and the management dashboard already running -- no CloudFormation stack required. This is the lightest-weight way to evaluate the gateway or to run it in environments where you manage the surrounding infrastructure yourself.

Upgrades

Because the runtime is a per-AZ Auto Scaling Group, upgrades roll through the fleet instance by instance. Gateway Load Balancer connection draining lets active flows on an instance finish before it is replaced, so planned upgrades and resizes are seamless. Configuration in SSM Parameter Store is preserved across instance replacement, so the data plane reconciles back to your declared state automatically.

Next steps. Deploy with the Quick Start, understand the internals in Architecture, or browse common questions in the FAQ.

Cloud Spectra Gateway -- User Guide v1.0.0

Introduction & how configuration works

Network Gateway

Security Gateway

AI Gateway

Three ways to configure -- one source of truth

Network Gateway tier Network

Source NAT (sNAT)

Destination NAT / port forwarding (dNAT)

Network Load Balancer (IPVS L4)

TLS termination (ACM)

Per-AZ Auto Scaling

Vertical & horizontal scaling (GWLB)

Forward HTTP proxy + caching (Squid)

Security Gateway tier Security

Suricata IDS/IPS (inline)

nftables firewall rules

Domain / URL filtering

AI Gateway tier AI Gateway

AI LLM Proxy with response caching

Local AI Inference (vLLM)

AI Semantic Cache

Ports & endpoints reference

Cross-account / home-account IAM role

What the operational role unlocks

Naming and trust

Setting it up

Updating it over time

Deployment & upgrades

CloudFormation (new VPC or existing/BYO VPC)

Terraform (via a one-time network mirror)

Standalone AMI (no CloudFormation)

Upgrades