
Data residency by AI provider
Where your prompts and outputs actually live, by vendor. As of June 2026, best-effort.
What the three layers of residency actually mean
Before reading the per-provider tables, internalize these distinctions. Every vendor slides between them.
- Inference geo — the data center where the GPU actually generated your response. This is the layer most exposed to compelled access by the host country's government, because the cleartext token stream is in memory there.
- Storage geo — where logs, abuse-monitoring buffers, and any persisted artifacts live at rest. Usually 30 days by default on commercial APIs, often reducible to 0 days under a ZDR addendum.
- Endpoint-services geo — where supporting infrastructure runs: tokenizers, content-safety classifiers, image pipelines, retrieval indexes, code interpreters. Easy to forget. A 'EU-resident' deployment that calls a US classifier is, in practice, sending your prompt to the US.
- Training reuse — separate from residency, but always asked in the same breath. Default on enterprise API tiers from Anthropic, OpenAI, Google, AWS, Mistral, and Cohere: customer prompts are NOT used to train base models. Verify on the order form.
- BAA scope — a Business Associate Agreement is a contract under HIPAA. The fact that a vendor 'has' a BAA does not mean every product is covered. Bedrock got added to AWS's HIPAA-eligible list on 10 February 2026; some Bedrock features remain out of scope. Read the schedule.
Provider index — at a glance
Quick scan of where each provider stood as of June 2026, drawn from their published trust portals and docs. Treat 'yes' as 'available under enterprise contract' unless noted otherwise.
Anthropic — what is actually shipping
OpenAI — global storage, US inference
Google Vertex AI and AWS Bedrock — the partner-operated lanes
If you want Claude or a major open model with the strictest residency posture available today, you almost certainly want it through one of the two big clouds. Both inherit hyperscaler compliance.
Google Vertex AI / Gemini
Cite: docs.cloud.google.com/vertex-ai/generative-ai/docs/learn/data-residency
FedRAMP High since July 2025. SOC 1/2/3, ISO 9001, ISO/IEC 27001/27017/27018/27701/42001. Regional data residency GA since April 2025 for Gemini API, Vertex AI agents, and secured Gemini Apps. EU residency through europe-west1, europe-west4, and others; the `eu` multi-region endpoint guarantees EU-only processing under Google's DPA. Zero-retention terms available via DPA addendum for eligible customers.
AWS Bedrock
Cite: aws.amazon.com/bedrock/security-compliance
Stateless API by default — prompts and responses are not stored on AWS's side. SOC 1/2/3 since 15 August 2023. HIPAA-eligible since 10 February 2026 — a meaningful upgrade for healthcare workloads. FedRAMP High in GovCloud US-West. ISO and CSA STAR Level 2 in scope. EU residency through eu-west-1 (Ireland), eu-central-1 (Frankfurt), and Paris.
Microsoft Azure OpenAI
Cite: azure.microsoft.com/en-us/blog (Data Zones announcement)
Closest thing to a true global per-region inference plane for GPT-class models. 28+ regions, Global/Data Zone/Regional deployment selectors, ZDR amendment available. Inherits Azure's SOC 2, ISO 27001, FedRAMP High, HIPAA BAA.
Mistral — the EU-native default
Cohere, Together, Fireworks, Replicate — the inference-platform layer
These four sit one rung below the hyperscalers in posture. All four are usable in production. Compliance scope varies sharply.
- Cohere — primary infrastructure on GCP US-Central. Annual SOC 2 Type II. HIPAA compliant with a standard BAA available. Multi-jurisdictional DPA incorporating the EU Standard Contractual Clauses (approved 4 June 2021). The honest caveat: Cohere does not currently offer true multi-region data residency outside the US — a meaningful limit if you need EU-only inference and not just EU-resident logging.
- Together AI — US-primary with EU residency live in Sweden since September 2025. Enterprise SOC 2 Type II, HIPAA on enterprise contracts, dedicated data-residency controls and ZDR on the enterprise tier. Voice platform layer adds its own enterprise compliance bundle.
- Fireworks AI — multi-region global fleet across US, Frankfurt, Iceland, Tokyo. Fully HIPAA-eligible, SOC 2 Type II, GDPR-compliant. Zero data retention by default on the enterprise product. Airgapped EKS deployment is on offer for the most regulated customers.
- Replicate — developer-and-community surface. Strong model catalog, easy experimentation. As of June 2026 not the right choice if you need formal enterprise BAA and FedRAMP — those are simply not the posture they sell. Use Replicate for prototyping and route the production workload to a hyperscaler or to Together/Fireworks.
Groq and Cerebras — the dedicated-silicon lane
Where buyers reliably get burned
Three failure modes we keep seeing in 2026 procurement reviews. Each is avoidable on day one of vendor diligence. First, mistaking storage residency for inference residency. OpenAI's Asia residency announcement says it plainly: 'targeted for data that is stored or is at rest and not data that is being used for inference by a model, whose default location continues to be the US.' If your DPO wrote a policy that says EU data must be processed in the EU, you are not compliant on OpenAI direct API even with Asia residency turned on. Use Azure OpenAI's Regional or Data Zone deployments for that. Second, assuming SOC 2 proves geography. SOC 2 proves control design and operating effectiveness over the trust services criteria. It does not prove your data stayed in any particular country. A SOC 2 report is necessary but not sufficient for residency compliance. Pair it with a regional inference commitment in the order form. Third, assuming a vendor BAA covers every product the vendor sells. Bedrock got added to AWS's HIPAA-eligible services list on 10 February 2026. The fact that AWS has had a BAA for a decade did not retroactively cover Bedrock before that date. Read the HIPAA-eligible-services schedule, not the marketing page.
How AtomEons handles this internally
Sources
- [01]
Anthropic Privacy Center lists SOC 2 Type I & II, ISO 27001:2022, ISO/IEC 42001:2023, and HIPAA-ready configuration with BAA available.
https://privacy.claude.com/en/articles/10015870-what-certifications-has-anthropic-obtained ↗ - [02]
Anthropic's data-residency docs confirm `inference_geo` accepts only `us` or `global`; workspace geo is currently US-only; Opus 4.6 and Sonnet 4.6 support the parameter at 1.1x pricing.
https://platform.claude.com/docs/en/manage-claude/data-residency ↗ - [03]
Anthropic Trust Portal is the source for SOC 3 summary and certification verification.
https://trust.anthropic.com/ ↗ - [04]
OpenAI announced data residency in Japan, India, Singapore, and South Korea on 8 May 2025; default inference location remains the US.
https://openai.com/index/introducing-data-residency-in-asia/ ↗ - [05]
OpenAI data residency is available in Europe, UK, US, Canada, Japan, South Korea, Singapore, India, Australia, and the UAE for storage at rest.
https://openai.com/index/expanding-data-residency-access-to-business-customers-worldwide/ ↗ - [06]
OpenAI publishes its enterprise privacy posture including DPA, SOC 2, and ZDR availability for qualifying organizations.
https://openai.com/business-data/ ↗ - [07]
Azure OpenAI introduced Data Zones for EU and US in addition to Global and Regional deployment modes.
https://azure.microsoft.com/en-us/blog/enterprise-trust-in-azure-openai-service-strengthened-with-data-zones/ ↗ - [08]
Vertex AI provides regional data residency via region-pinned endpoints including europe-west1, europe-west4, and the eu multi-region endpoint.
https://docs.cloud.google.com/vertex-ai/generative-ai/docs/learn/data-residency ↗ - [09]
Vertex AI Search and Generative AI on Vertex AI achieved FedRAMP High authorization in July 2025.
https://cloud.google.com/blog/topics/public-sector/vertex-ai-search-and-generative-ai-with-gemini-achieve-fedramp-high ↗ - [10]
Gemini Enterprise documents SOC 1/2/3, ISO 27001/27017/27018/27701/42001 compliance scope.
https://docs.cloud.google.com/gemini/enterprise/docs/compliance-security-controls ↗ - [11]
Amazon Bedrock is in scope for ISO, SOC, CSA STAR Level 2, GDPR, HIPAA eligibility, and FedRAMP High in GovCloud US-West.
https://aws.amazon.com/bedrock/security-compliance/ ↗ - [12]
AWS Alps blog confirms Bedrock's regional residency and Switzerland-specific data-protection guidance.
https://aws.amazon.com/blogs/alps/security_bedrock/ ↗ - [13]
Amazon Bedrock is available in AWS GovCloud (US) with FedRAMP High authorization.
https://docs.aws.amazon.com/govcloud-us/latest/UserGuide/govcloud-bedrock.html ↗ - [14]
Mistral AI states it complies with SOC 2 Type II and ISO 27001/27701 frameworks.
https://help.mistral.ai/en/articles/347638-do-you-have-soc-2-or-iso-27001-certification ↗ - [15]
Mistral AI defaults to EU data hosting unless customers explicitly use the US API endpoint.
https://help.mistral.ai/en/articles/347629-where-do-you-store-my-data-or-my-organization-s-data ↗ - [16]
Cohere Trust Center publishes its annual SOC 2 Type II posture and HIPAA-ready BAA availability.
https://trustcenter.cohere.com/ ↗ - [17]
Cohere documents its multi-jurisdictional DPA incorporating EU Standard Contractual Clauses and ephemeral-data configuration option.
https://cohere.com/enterprise-data-commitments ↗ - [18]
Fireworks AI publishes HIPAA, SOC 2 Type II, GDPR compliance, ZDR by default, and multi-region deployment including airgapped EKS.
https://fireworks.ai/enterprise ↗ - [19]
Groq Trust Center publishes Groq's current security and compliance posture.
https://trust.groq.com/ ↗ - [20]
Groq operates data centers across the US, Canada, Europe, and the Middle East with further 2026 expansion announced.
https://groq.com/newsroom/groq-solidifies-status-as-emerging-hyperscaler-with-new-global-deployment ↗ - [21]
Groq is building inferencing infrastructure with Aramco Digital in Saudi Arabia under local data-sovereignty rules.
https://groq.com/newsroom/aramco-digital-and-groq-announce-progress-in-building-the-worlds-largest-inferencing-data-center-in-saudi-arabia-following-leap-mou-signing ↗ - [22]
Groq secured a USD 1.5 billion Saudi Arabia investment to expand its Dammam data-center footprint.
https://www.datacenterdynamics.com/en/news/groq-secures-15bn-from-saudi-arabia-to-expand-ai-inference-infrastructure-in-the-region/ ↗ - [23]
Cerebras Systems S-1 documents its Framework Agreement with Core42 (formerly G42 Holding US LLC) for AI supercomputer deployment.
https://www.sec.gov/Archives/edgar/data/2021728/000162828026025762/exhibit1011-sx1.htm ↗ - [24]
Core42 commissioned Maximus-01 at TeraWulf's Lake Mariner data center in Buffalo, NY in November 2025 with ~9,000 AMD Instinct MI300X GPUs.
https://www.middleeastainews.com/p/core42-us-footprint-expands-adding ↗